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O (54) Title- GONOCOCCAL PROTEINS AND NUCLEIC ACIDS 

<s 

® (57) Abstract: The invention provides proteins from gonococcus {Neisseria gonorrhoeae), including amino acid sequences, the cor- 
Q responding nucleotide sequences, expression data, and serological data. The proteins are useful antigens for vaccines, immunogenic 
compositions, and/or diagnostics. They are also useful for distinguishing between gonococcus and meningococcus and, in particular, 
^ between gonococcus and serogroup B meningococcus. 



WO 02/079243 PCT/IB02/02069 

-1- 

GONOCOCCAL PROTEINS AND NUCLEIC ACIDS 
All documents cited herein are incorporated by reference in their entirety. 
TECHNICAL FIELD 

This invention relates to proteins from the bacterium Neisseria gonorrhoeae, and more particularly to 
5 such proteins which do not have corresponding homologs or orthologs in serogroup B Nmeningitidis. 

BACKGROUND ART ^ 

Neisseria gonorrhoeae is a bacterial pathogen. There is currently no effective vaccine against 
N.gonorrlweae infection. It is an object of the invention to provide proteins and nucleic acid useful in 
vaccine study and/or manufacture. 

10 Ngonorrhoeae is related to N.meningitidis. Sequence data are now available for serogroup B of 
meningococcus [e.g. W099/24578; W099/36544; WO99/57280; WO00/22430; WO00/66791; Tettelin et 
aL (2000) Science 287:1809-1815] and also for serogroup A [P&rkhill et al, (2000) Nature 404:502-506]. 
It is a further object of the invention to provide proteins and nucleic acid useful in distinguishing between 
gonococcus and meningococcus and, in particular, between gonococcus and serogroup B meningococcus. 

15 DISCLOSURE OF THE INVENTION 

The invention provides proteins comprising the Ngonorrhoeae amino acid sequences disclosed in the 
examples (the even-numbered SEQ IDS 2 to 8622). 159 of these have no homolog in serogroup B 
meningococcus and these have been given a name in the form "NGSn". 

It also provides proteins comprising amino acid sequences having sequence identity to the Ngonorrhoeae 
20 amino acid sequences disclosed in the examples. Depending on the particular sequence, the degree of 
sequence identity is preferably greater than 50% (e.g. 60%, 70%, 80%, 90%, 95%, 99% or more). These 
proteins include homologs, orthologs, allelic variants and functional mutants. Typically, 50% identity or 
more between two proteins is considered to be an indication of functional equivalence. Identity between 
proteins is preferably determined by the Smith-Waterman homology search algorithm as implemented in 
25 the MPSRCH program (Oxford Molecular), using an affine gap search with parameters gap open 
penalty- 12 and gap extension penalty =L 

The invention further provides proteins comprising fragments of the Ngonorrhoeae amino acid 
sequences disclosed in the examples. The fragments should comprise at least n consecutive amino acids 
from the sequences and, depending on the particular sequence, n is 7 or more (e.g. 8, 10, 12, 14, 16, 18, 
30 20, 30, 40, 50, 60, 70, 80, 90, 100 or more). Preferably the fragments comprise one or more epitopes from 
the sequence. Other preferred fragments are (a) the N-terminal signal peptides of the proteins disclosed in 
the examples, and (b) the proteins disclosed in the examples, but without their N-terminal signal peptides. 

The proteins of the invention can, of course, be prepared by various means (e.g. recombinant expression, 
purification horn Neisseria, chemical synthesis etc.) and in various forms (e.g. native, fusions eta). They 
35 are preferably prepared in substantially pure form (Le. substantially free from other Ngonorrhoeae or 
host cell proteins). 

The proteins of the invention are preferably Neisserial proteins, more preferably Ngonorrhoeae proteins. 
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The invention provides antibodies which bind to these proteins. These may be polyclonal or monoclonal 
and may be produced by any suitable means. The antibodies may include a detectable label. 

The invention provides nucleic acid comprising the N. gonorrhoeae nucleotide sequences disclosed in the 
examples. In addition, the invention provides nucleic acid comprising nucleotide sequences having 
5 sequence identity to the N. gonorrhoeae nucleotide sequences disclosed in the examples. 

Furthermore, the invention provides nucleic acid which can hybridise to the N. gonorrhoeae nucleic acid 
disclosed in the examples, preferably under ,c high stringency" conditions (e.g. 65°C in a O.lxSSC, 0.5% 
SDS solution). 

Nucleic acid comprising fragments of these sequences are also provided. These should comprise at least n 
10 consecutive nucleotides from ihtN.gonorrhoeae sequences and, depending on the particular sequence, n 
is 10 or more (e.g. 12, 14, 15, 18, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 150, 200 or more). 

Hie invention also provides nucleic acid encoding the proteins and protein fragments of the invention. 

The invention includes nucleic acid comprising sequences complementary to those described above (e.g. 
for antisense or probing purposes). 

15 Nucleic acid according to the invention can be prepared in many ways (e.g. by chemical synthesis, from 
genomic or cDNA libraries, from the organism itself etc.) and can take various forms (e.gl single 
stranded, double stranded, vectors, probes etc.). 1 

In addition, the term Nucleic acid" includes DNA and RNA, and also their analogues, such as those 
containing modified backbones, and also peptide nucleic acids (PNA) etc. 

20 The invention provides vectors comprising nucleotide sequences of the invention (e.g. cloning or 
expression vectors) and host cells transformed with such vectors. 

The invention provides compositions comprising protein, antibody, and/or nucleic acid according to the 
invention. These compositions may be suitable as immunogenic compositions, for instance, or as 
diagnostic reagents, or as vaccines. 

25 The invention also provides nucleic acid, protein, or antibody according to the invention for use as 
medicaments (e.g. as vaccines) or as diagnostic reagents. It also provides the use of nucleic acid, protein, 
or antibody according to the invention in the manufacture of: (i) a medicament for treating or preventing 
infection due to Neisseria; (ii) a diagnostic reagent for detecting the presence of Neisseria or of antibodies 
raised against Neisseria; and/or (iii) a reagent which can raise antibodies against Neisseria. Said Neisseria 

30 may be any species, but is preferably N.gonorrhoeae. 

The invention also provides a method of treating a patient, comprising administering to the patient a 
therapeutically effective amount of nucleic acid, protein, and/or antibody of the invention. 

According to further aspects, the invention provides various processes. 

A process for producing proteins of the invention is provided, comprising the step of culturing a host cell 
35 of to the invention under conditions which induce protein expression. 
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A process for producing protein or nucleic acid of the invention is provided, wherein the protein or 
nucleic acid is synthesised in part or in whole using chemical means. 

A process for detecting polynucleotides of the invention is provided, comprising the steps of: (a) 
contacting a nucleic probe according to the invention with a biological sample under hybridising 
5 conditions to form duplexes; and (b) detecting said duplexes. 

A process for detecting proteins of the invention is provided, comprising the steps of: (a) contacting an 
antibody of the invention with a biological sample under conditions suitable for the formation of an 
antibody-antigen complexes; and (b) detecting said complexes. 

A process for distinguishing N. gonorrhoeae from N.meningitidis is provided, comprising the steps of: (a) 
10 contacting an antibody of the invention with a biological sample under conditions suitable for the 
formation of an antibody-antigen complexes; and (b) detecting said complexes. Alternatively, the steps 
may be (a) contacting nucleic acid of the invention with a biological sample under conditions suitable for 
the nucleic acid hybridisation; and (b) detecting any such hybridisation. Alternatively, the steps may be 
(a) contacting a protein of the invention with a biological sample (e.g. blood or seium) under conditions 
15 suitable for the formation of an antibody-antigen complexes; and (b) detecting said complexes. 

A summary of standard techniques and procedures which may be employed in order to perform the 
invention (e.g. to utilise the disclosed sequences for vaccination or diagnostic purposes) follows. This 
summary is not a limitation on the invention, but gives examples that may be used, but are not required. 

General 

20 The practice of the present invention will employ, unless otherwise indicated, conventional techniques of molecular 
biology, microbiology, recombinant DN A, and immunology, which are within the skill of the art. Such techniques are 
explained fully in the literature eg. Sambrook Molecular Cloning; A Laboratory Manual, Second Edition (1989) or 
Third Edition (2000); DNA Cloning, Volumes I and II (D.N Glover ed. 1985); Oligonucleotide Synthesis (M.J. Gait 
ed, 1984); Nucleic Acid Hybridization (B.D. Hames & S J. Higgins eds. 1984); Transcription and Translation (B.D. 

25 Hames & S J. Higgins eds. 1984); Animal Cell Culture (R.I. Freshney ed. 1986); Immobilized Cells and Enzymes 
(IRL Press, 1986); B. Perbal, A Practical Guide to Molecular Cloning (1984); the Methods in Enzymology series 
(Academic Press, Inc.), especially volumes 154 & 155; Gene Transfer Vectors for Mammalian Cells (J.H. Miller and 
M.P. Calos eds. 1987, Cold Spring Harbor Laboratory); Mayer and Walker, eds. (1987), Immunochemical Methods in 
Cell and Molecular Biology (Academic Press, London); Scopes, (1987) Protein Purification: Principles and 

30 Practice, Second Edition (Springer-Verlag, N.Y.), and Handbook of Experimental Immunology, Volumes I-IV (D.M. 
Weir and C. C. Blackwell eds 1986). 

Standard abbreviations for nucleotides and amino acids are used in this specification. 
Pefinitions 

A composition containing X is "substantially free of Y when at least 85% by weight of the total X+Y in the 
35 composition is X. Preferably, X comprises at least about 90% by weight of the total of X+Y in the composition, more 
preferably at least about 95% or even 99% by weight. 

The term "comprising'' means "including" as well as "consisting" e.g. a composition "comprising" X may consist 
exclusively of X or may include something additional e.g. X + Y. 



WO 02/079243 



-4~ 



PCT/EB02/02069 



The term "heterologous" refers to two biological components that are not found together in nature; The components 
may be host cells, genes, or regulatory regions, such as promoters. Although the heterologous components are not 
found together in nature, they can function together, as when a promoter heterologous to a gene is operably linked to 
the gene. Another example is where a Neisseria sequence is heterologous to a mouse host cell. A further examples 
5 would be two epitopes from the same or different proteins which haye been assembled in a single protein in an 
arrangement not found in nature. 

An "origin of replication" is a polynucleotide sequence that initiates and regulates replication of polynucleotides, 
such as an expression vector. The origin of replication behaves as an autonomous unit of polynucleotide replication 
within a cell, capable of replication under its own control. An origin of replication may be needed for a vector to 

10 replicate in a particular host cell. With certain origins of replication, an expression vector can be reproduced at a high 
copy number in the presence of the appropriate proteins within the cell. Examples of origins are the autonomously 
replicating sequences, which are effective in yeast; and the viral T-antigen, effective in COS-7 cells. 
A "mutant" sequence is defined as DNA, RNA or amino acid sequence differing from but having sequence identity 
with the native or disclosed sequence. Depending on the particular sequence, the degree of sequence identity between 

15 the native or disclosed sequence and the mutant sequence is preferably greater than 50% (eg. 60%, 70%, 80%, 90%, 
95%, 99% or more, calculated using the Smith-Waterman algorithm as described above). As used herein, an "allelic 
variant" of a nucleic acid molecule, or region, for which nucleic acid sequence is provided herein is a nucleic acid 
molecule, or region, that occurs essentially at the same locus in the genome of another or second isolate, and that, due 
to natural variation caused by, for example, mutation or recombination, has a similar but not identical nucleic acid 

20 sequence. A coding region allelic variant typically encodes a protein having similar activity to that of the protein 
encoded by the gene to which it is being compared. An allelic variant can also comprise an alteration in the 5* or 3' 
untranslated regions of the gene, such as in regulatory control regions (eg. see US patent 5,753,235). 

Expression systems 

The Neisseria nucleotide sequences can be expressed in a variety of different expression systems; for example those 
25 used with mammalian cells, baculoviruses, plants, bacteria, and yeast. 

i. Mammalian Systems 

Mammalian expression systems are known in the art. A mammalian promoter is any DNA sequence capable of 
binding mammalian RNA polymerase and initiating the downstream (3^ transcription of a coding sequence {eg. 
structural gene) into mRNA. A promoter will have a transcription initiating region, which is usually placed proximal 

30 to the 5' end of the coding sequence, and a TATA box, usually located 25-30 base pairs (bp) upstream of the 
transcription initiation site. The TATA box is thought to direct RNA polymerase II to begin RNA synthesis at the 
correct site. A mammalian promoter will also contain an upstream promoter element, usually located within 100 to 
200 bp upstream of the TATA box. An upstream promoter element determines the rate at which transcription is 
initiated and can act in either orientation [Sambrook et al. (1989) "Expression of Cloned Genes in Mammalian Cells. 1 ' 

35 In Molecular Cloning: A Laboratory Manual, 2nd ed.]. 

Mammalian viral genes are often highly expressed and have a broad host range; therefore sequences encoding 
mammalian viral genes provide particularly useful promoter sequences. Examples include the SV40 early promoter, 
mouse mammary tumor virus LTR promoter, adenovirus major late promoter (Ad MLP), and herpes simplex virus 
promoter. In addition, sequences derived from non-viral genes, such as the murine metallotheionein gene, also 

40 provide useful promoter sequences. Expression may be either constitutive or regulated (inducible), depending on the 
promoter can be induced with glucocorticoid in hormone-responsive cells. 
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The presence of an enhancer element (enhancer), combined with the promoter elements described above, will usually 
increase expression levels. An enhancer is a regulatory DNA sequence that can stimulate transcription up to 1000- 
fold when linked to homologous or heterologous promoters, with synthesis beginning at the normal RNA start site. 
Enhancers are also active when they are placed upstream or downstream from the transcription initiation site, in either 

5 normal or flipped orientation, or at a distance of more than 1000 nucleotides from the promoter [Maniatis et al. (1987) 
Science 256:1237; Alberts et al. (1989) Molecular Biology of the Cell, 2nd ed.]. Enhancer elements derived from 
viruses may be particularly useful, because they usually have a broader host range. Examples include the SV40 early 
gene enhancer [Dijkema et al (1985) EMBO I 4:161] and the enhancer/promoters derived from the long terminal 
repeat (LTR) of the Rous Sarcoma Virus [Gorman et al. (1982b) Free. Natl Acad. Set 79:6777] and from human 

10 cytomegalovirus [Boshart et ah (1985) Cell 41:521]. Additionally, some enhancers are regulatable and become active 
only in the presence of an inducer, such as a hormone or metal ion [Sassone-Corsi and Borelli (1986) Trends Genet. 
2:215; Maniatis et al. (1987) Science 236:1237]. 

A DNA molecule may be expressed intracellular^ in mammalian cells. A promoter sequence may be directly linked 
with the DNA molecule, in which case the first amino acid at the N-terminus of the recombinant protein will always 
15 be a methionine, which is encoded by the ATG start codon. If desired, the N-terminus may be cleaved from the 
protein by in vitro incubation with cyanogen bromide. 

Alternatively, foreign proteins can also be secreted from the cell into the growth media by creating chimeric DNA 
molecules that encode a fusion protein comprised of a leader sequence fragment that provides for secretion of the 
foreign protein in mammalian cells. Preferably, there are processing sites encoded between the leader fragment and 

20 the foreign gene that can be cleaved either in vivo or in vitro. The leader sequence fragment usually encodes a signal 
peptide comprised of hydrophobic amino acids which direct the secretion of the protein from the cell. The adenovirus 
triparite leader is an example of a leader sequence that provides for secretion of a foreign protein in mammalian cells. 
Usually, transcription termination and polyadenylation sequences recognized by mammalian cells are regulatory 
regions located 3 1 to the translation stop codon and thus, together with the promoter elements, flank the coding 

25 sequence. The 3' terminus of the mature raRNA is formed by site-specific post-transcriptional cleavage and polya- 
denylation [Birnstiel et al. (1985) Cell 4J:349; Proudfoot and Whitelaw (1988) "Termination and 3' end processing of 
eukaryotic RNA. In Transcription and splicing (ed. B.D. Hames and D.M. Glover); Proudfoot (1989) Trends 
Biochem. Sci 14:105]. These sequences direct the transcription of an mRNA which can be translated into the 
polypeptide encoded by the DNA. Examples of transcription terminater/polyadenylation signals include those derived 

30 from SV40 [Sambrook et al (1989) "Expression of cloned genes in cultured mammalian cells." In Molecular Cloning: 
A Laboratory Manual]. 

Usually, the above described components, comprising a promoter, polyadenylation signal, and transcription 
termination sequence are put together into expression constructs. Enhancers, introns with functional splice donor and 
acceptor sites, and leader sequences may also be included in an expression construct, if desired. Expression constructs 

35 are often maintained in a replicon, such as an extrachromosomal element {eg. plasraids) capable of stable 
maintenance in a host, such as mammalian cells or bacteria. Mammalian replication systems include those derived 
from animal viruses, which require trans-acting factors to replicate. For example, plasmids containing the replication 
systems of papovaviruses, such as SV40 [Gluzman (1981) Cell 25:175] or polyomavirus, replicate to extremely high 
copy number in the presence of the appropriate viral T antigen. Additional examples of mammalian replicons include 

40 those derived from bovine papillomavirus and Epstein-Barr virus. Additionally, the replicon may have two replicaton 
systems, thus allowing it to be maintained, for example, in mammalian cells for expression and in a prokaryotic host 
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for cloning and amplification. Examples of such mammalian-bacteria shuttle vectors include pMT2 [Kaufman et al. 
(1989) Mol. Cell. Biol. 9:946] and pHEBO [Shimizu et al. (1986) Mol. Cell. Biol. 6:1074}. 
The transformation procedure used depends upon the host to be transformed. Methods for introduction of 
heterologous polynucleotides into mammalian cells are known in the art and include dextran-mediated transfection, 
calcium phosphate precipitation, polybrene mediated transfection, protoplast fusion, electroporation, encapsulation of 
the polynucleotide(s) in liposomes, and direct microinjection of the DNA into nuclei. 

Mammalian cell lines available as hosts for expression are known in the art and include many immortalized cell lines 
available from the American Type Culture Collection (ATCC), including but not limited to, Chinese hamster ovary 
(CHO) cells, HeLa cells, baby hamster kidney (BHK) cells, monkey kidney cells (COS), human hepatocellular 
> carcinoma cells (eg. Hep G2), and a number of other cell lines. 

fi, tt^vffns Systems 

The polynucleotide encoding the protein can also be inserted into a suitable insect expression vector, and is operably 
linked to the control elements within that vector. Vector construction employs techniques which are known in the art. 
Generally, the components of the expression system include a transfer vector, usually a bacterial plasmid, which 

5 contains both a fragment of the baculoviras genome, and a convenient restriction site for insertion of the heterologous 
gene or genes to be expressed; a wild type baculovirus with a sequence homologous to the baculovirus-specific 
fragment in the transfer vector (this allows for the homologous recombination of the heterologous gene in to the 
baculoviras genome); and appropriate insect host cells and growth media. i- f 
After inserting the DNA sequence encoding the protein into the transfer vector, the vector and the wild type viral 

20 genome are transfected into an insect host cell where the vector and viral genome are allowed to recombine. The 
packaged recombinant virus is expressed and recombinant plaques are identified and purified. Materials and methods 
for baculovirus/insect cell expression systems are commercially available in kit form from, inter alia, Invitrogen, San 
Diego CA ("MaxBac' kit). These techniques are generally known to those skilled in the art and fully described in 
Summers & Smith, Texas Agricultural Experiment Station Bulletin No. 1555 (1987) ("Summers & Smith 0 ). 

25 Prior to inserting the DNA sequence encoding the protein into the baculovirus genome, the above described 
components, comprising a promoter, leader (if desired), coding sequence, and transcription termination sequence, are 
usually assembled into an intermediate transplacement construct (transfer vector). This may contain a single gene and 
operably linked regulatory elements; multiple genes, each with its owned set of operably linked regulatory elements; 
or multiple genes, regulated by the same set of regulatory elements. Intermediate transplacement constructs are often 

30 maintained in a replicon, such as an extra-chromosomal element (e.g. plasmids) capable of stable maintenance in a 
host, such as a bacterium. The replicon will have a replication system, thus allowing it to be maintained in a suitable 
host for cloning and amplification. 

Currently, the most commonly used transfer vector for introducing foreign genes into AcNPV is pAc373. Many other 
vectors, known to those of skill in the art, have also been designed. These include, for example, pVL985 (which alters 
35 the polyhedrin start codon from ATG to ATT, and which introduces a BamHl cloning site 32 basepairs downstream 
from the ATT; see Luckow and Summers, Virology (1989) 17:31. 

The plasmid usually also contains the polyhedrin polyadenylation signal (Miller et al. (1988) Ann. Rev. Microbiol., 
42:177) and a prokaryotic ampicillin-resistance (amp) gene and origin of replication for selection and propagation in 
E.coli. 
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Baculovirus transfer vectors usually contain a baculovirus promoter. A baculovirus promoter is any DNA sequence 
capable of binding a baculovirus RNA polymerase and initiating the downstream (5 1 to 3 1 ) transcription of a coding 
sequence {eg. structural gene) into mRNA. A promoter will have a transcription initiation region which is usually 
placed proximal to the S 1 end of the coding sequence. This transcription initiation region usually includes an RNA 
5 polymerase binding site and a transcription initiation site. A baculovirus transfer vector may also have a second 
domain called an enhancer, which, if present, is usually distal to the structural gene, Expression may be either 
regulated or constitutive. 

Structural genes, abundantly transcribed at late times in a viral infection cycle, provide particularly useful promoter 
sequences. Examples include sequences derived from the gene encoding the viral polyhedron protein, Friesen et al., 
10 (1986) "The Regulation of Baculovirus Gene Expression/' in: The Molecular Biology of Baculoviruses (ed. Walter 
Doerfler); EPO Publ. Nos. 127 839 and 155 476; and the gene encoding the plO protein, Vlak et al M (1988), I Gen. 
Virol. 69:165. 

DNA encoding suitable signal sequences can be derived from genes for secreted insect or baculovirus proteins, such 
as the baculovirus polyhedrin gene (Carbonell et al (1988) Gene, 73:409). Alternatively, since the signals for 

15 mammalian cell postradiational modifications (such as signal peptide cleavage, proteolytic cleavage, and 
phosphorylation) appear to be recognized by insect cells, and the signals required for secretion and nuclear 
accumulation also appear to be conserved between the invertebrate cells and vertebrate cells, leaders of non-insect 
origin, such as those derived from genes encoding human a-interferon, Maeda et al., (1985), Nature 375:592; human 
gastrin-releasing peptide, Lebacq-Verheyden et al., (1988), Molec. Cell Biol 5:3129; human IL-2, Smith et al., 

20 (1985) Proc. Nat'l Acad. Sci USA, 32:8404; mouse IL-3, (Miyajima et al, (1987) Gene 55:273; and human 
glucocerebrosidase, Martin et al. (1988) DNA, 7:99, can also be used to provide for secretion in insects. 

A recombinant polypeptide or polyprotein may be expressed intracellular^ or, if it is expressed with the proper 
regulatory sequences, it can be secreted. Good intracellular expression of nonfused foreign proteins usually requires 
heterologous genes that ideally have a short leader sequence containing suitable translation initiation signals 
25 preceding an ATG start signal. If desired, methionine at the N-terminus may be cleaved from the mature protein by in 
vitro incubation with cyanogen bromide. 

Alternatively, recombinant polyproteins or proteins which are not naturally secreted can be secreted from the insect 
cell by creating chimeric DNA molecules that encode a fusion protein comprised of a leader sequence fragment that 
provides for secretion of the foreign protein in insects. The leader sequence fragment usually encodes a signal peptide 
30 comprised of hydrophobic amino acids which direct the translocation of the protein into the endoplasmic reticulum. 

After insertion of the DNA sequence and/or the gene encoding the expression product precursor of the protein, an 
insect cell host is co-transformed with the heterologous DNA of the transfer vector and the genomic DNA of wild 
- type baculovirus - usually by co-transfection. The promoter and transcription termination sequence of the construct 
will usually comprise a 2-5kb section of the baculovirus genome. Methods for introducing heterologous DNA into the 

35 desired site in the baculovirus virus are known in the art. (See Summers & Smith supra\ In et al. (1987); Smith et al., 
Mol Cell Biol (1983) 3:2156; and Luckow and Summers (1989)). For example, the insertion can be into a gene such 
as the polyhedrin gene, by homologous double crossover recombination; insertion can also be into a restriction 
enzyme site engineered into the desired baculovirus gene. Miller et al., (1989), Bioessays 4:91.The DNA sequence, 
when cloned in place of the polyhedrin gene in the expression vector, is flanked both 5* and 3* by polyhedrin-specific 

40 sequences and is positioned downstream of the polyhedrin promoter. 
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The newly formed baculovirus expression vector" is subsequently packaged into an infections recombinant 
baculovirus. Homologous recombination occurs at low frequency (between about 1% and about 5%); thus, the 
majority of the vims produced after cotransfection is still wild-type vims. Therefore, a method is necessary to identify 
recombinant virnses. An advantage of the expression system is a visual screen allowing recombinant viruses to be 

5 distinguished. The polyhedrin protein, which is produced by the native virus, is produced at very high levels in the 
nuclei of infected cells at late times after viral infection. Accumulated polyhedrin protein forms occlusion bodies that 
also contain embedded particles. These occlusion bodies, up to 15 pm in size, are highly retractile, giving them a 
bright shiny appearance that is readily visualized under the light microscope. Cells infected with recombinant viruses 
lack occlusion bodies. To distinguish recombinant virus from wild-type virus, the transfection supernatant is plaqued 

10 onto a monolayer of insect cells by techniques known to those skilled in the art. Namely, the plaques are screened 
under the light microscope for the presence (indicative of wild-type virus) or absence (indicative of recombinant 
virus) of occlusion bodies. "Current Protocols in Microbiology" Vol. 2 (Ausnbel et al. eds) at 16.8 (Supp. 10, 1990); 
Summers & Smith, supra; Miller et al. (1989). 

Recombinant baculovirus expression vectors have been developed for infection into several insect cells. For example, 
15 recombinant baculoviruses have been developed for, inter alia: Aedes aegypti , Autographa californica, Bombyx 
mori, Drosophila m elanogaster, Spodoptera frugiperda, and Trichoplusia ni (WO 89/046699; Carbonell et al., (1985) 
J. Virol. 56:153; Wright (1986) Nature 32J:718; Smith et al., (1983) JfoL Ceil. Biol. 3:2156; and see generally, 
Fraser, et al. (1 989) In Vitro Cell. dev. Biol. 25:225). * 
Cells and cell culture media are commercially available for both direct and fusion expression of heterologous 
20 polypeptides in a baculovirus/expression system; cell culture technology is generally known to those skilled in the art. 
See, eg. Summers & Smith lupra. 

The modified insect cells may then be grown in an appropriate nutrient medium, which allows for stable maintenance 
of the plasmid(s) present in the modified insect host. Where the expression product gene is under inducible control, 
the host may be grown to high density, and expression induced. Alternatively, where expression is constitutive, the 

25 product will be continuously expressed into the medium and the nutrient medium must be continuously circulated, 
while removing the product of interest and augmenting depleted nutrients. The product may be purified by such 
techniques as chromatography, eg. HPLC, affinity chromatography, ion exchange chromatography, etc.; 
electrophoresis; density gradient centrifugation; solvent extraction, etc. As appropriate, the product may be further 
purified, as required, so as to remove substantially any insect proteins which are also present in the medium, so as to 

30 provide a product which is at least substantially free of host debris, eg. proteins, lipids and polysaccharides. 

In order to obtain protein expression, recombinant host cells derived from the transformants are incubated under 
conditions which allow expression of the recombinant protein encoding sequence. These conditions will vary, 
dependent upon the host cell selected. However, the conditions are readily ascertainable to those of ordinary skill in 
the art, based upon what is known in the art. 

35 j ii T Plant Systems 

There are many plant cell culture and whole plant genetic expression systems known in the art. Exemplary plant 
cellular genetic expression systems include those described in patents, such as: US 5,693,506; US 5,659,122; and US 
5,608,143. Additional examples of genetic expression in plant cell culture has been described by Zenk, 
Phytochemistry 30:3861-3863 (1991). Descriptions of plant protein signal peptides may be found in addition to the 
40 references described above in Vaulcombe et al., Mol. Gen. Genet. 209:33 40 (1987); Chandler et al., Plant Molecular 
Biology 3:407-418 (1984); Rogers, J. Biol. Chem. 260:3731-3738 (1985); Rothstein et al., Gene 55:353-356 (1987); 
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Whittier et ah, Nucleic Acids Research 15:2515-2535 (1987); Wirsel et al, Molecular Microbiology 3:3-14 (1989); 
Yo et al„ Gene 122:247-253 (1992). A description of the regulation of plant gene expression by the phytohormone, 
gibberellic acid and secreted enzymes indnced by gibberellic acid can be found in R.L. Jones and J. MacMillin, 
Gibberellins: in: Advanced Plant Physiology,, Malcolm B. Wilkins, ed., 1984 Pitman Publishing Limited, London, 
5 pp. 21-52. References that describe other metabolically-regulated genes: Sheen, Plant Cell, 2:1027-1038(1990); Maas 
et al, EMBO L 9:3447-3452 (1990); Benkel and Hickey, Proc. Natl Acad. Set. 84:1337-1339 (1987). 

Typically, using techniques known in the art, a desired polynucleotide sequence is inserted into an expression cassette 
comprising genetic regulatory elements designed for operation in plants. The expression cassette is inserted into a 
desired expression vector with companion sequences upstream and downstream from the expression cassette suitable 

10 for expression in a plant host. The companion sequences will be of plasmid or viral origin and provide necessary 
characteristics to the vector to permit the vectors to move DNA from an original cloning host, such as bacteria, to the 
desired plant host. The basic bacterial/plant vector construct will preferably provide a broad host range prokaryote 
replication origin; a prokaryote selectable marker; and, for Agrobacterium transformations, T DNA sequences for 
A grobacterium -mediated transfer to plant chromosomes. Where the heterologous gene is not readily amenable to 

15 detection, the construct will preferably also have a selectable marker gene suitable for determining if a plant cell has 
been transformed. A general review of suitable markers, for example for the members of the grass family, is found in 
Wilraink and Dons, 1993, Plant Mol Biol Reptr, 11(2):165-185. 

Sequences suitable for permitting integration of the heterologous sequence into the plant genome are also 
recommended. These might include transposon sequences and the like for homologous recombination as well as Ti 
20 sequences which permit random insertion of a heterologous expression cassette into a plant genome. Suitable 
prokaryote selectable markers include resistance toward antibiotics such as ampicillin or tetracycline. Other DNA 
sequences encoding additional functions may also be present in the vector, as is known in the art. 

The nucleic acid molecules of the subject invention may be included into an expression cassette for expression of the 
protein(s) of interest. Usually, there will be only one expression cassette, although two or more are feasible. The 
25 recombinant expression cassette will contain in addition to the heterologous protein encoding sequence the following 
elements, a promoter region, plant 5' untranslated sequences, initiation codon depending upon whether or not the 
structural gene comes equipped with one, and a transcription and translation termination sequence. Unique restriction 
enzyme sites at the 5* and V ends of the cassette allow for easy insertion into a pre-existing vector. 

A heterologous coding sequence may be for any protein relating to the present invention. The sequence encoding the 
30 protein of interest will encode a signal peptide which allows processing and translocation of the protein, as 
appropriate, and will usually lack any sequence which might result in the binding of the desired protein of the 
invention to a membrane. Since, for the most part, the transcriptional initiation region will be for a gene which is 
expressed and translocated during germination, by employing the signal peptide which provides for translocation, one 
may also provide for translocation of the protein of interest. In this way, the protein(s) of interest will be translocated 
35 from the cells in which they are expressed and may be efficiently harvested. Typically secretion in seeds are across 
the aleurone or scutellar epithelium layer into the endosperm of the seed. While it is not required that the protein be 
secreted from the cells in which the protein is produced, this facilitates the isolation and purification of the 
recombinant protein. 

Since the ultimate expression of the desired gene product will be in a eucaryotic cell it is desirable to determine 
40 whether any portion of the cloned gene contains sequences which will be processed out as introns by the host's 
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splicosome machinery. If so, site-directed mutagenesis of the "intron" region may be conducted to prevent losing a 
portion of the genetic message as a false intron code, Reed and Maniatis, Cell 41:95-105, 1985. 
The vector can be microinjected directly into plant cells by use of micropipettes to mechanically transfer the 
recombinant DMA. Crossway, Idol Gen. Genet, 202:179-185, 1985. The genetic material may also be transferred into 

5 the plant cell by using polyethylene glycol, Krens, et al., Nature, 296, 72-74, 1982. Another method of introduction of 
nucleic acid segments is high velocity ballistic penetration by small particles with the nucleic acid either within the 
matrix of small beads or particles, or on the surface, Klein, et al., Nature, 327, 70-73, 1987 and Knodsen and Muller, 
1991, Planta, 185:330-336 teaching particle bombardment of barley endosperm to create transgenic barley. Yet 
another method of introduction would be fusion of protoplasts with other entities, either minicells, cells, lysosomes or 

10 other fusible lipid -surfaced bodies, Fraley, et al., Proc. Null. Acad. Sci USA, 79, 1859-1863, 1982. 

The vector may also be introduced into the plant cells by electroporation. (Fromm et al., Proc. AffltJ Acad. Sci. USA 
82:5824, 1985). In this technique, plant protoplasts are electrop orated in the presence of plasmids containing the gene 
construct. Electrical impulses of high field strength reversibly permeabilize biomembranes allowing the introduction 
of the plasmids. Electroporated plant protoplasts reform the cell wall, divide, and form plant callus. 

15 All plants from which protoplasts can be isolated and cultured to give whole regenerated plants can be transformed by 
the present invention so that whole plants are recovered which contain the transferred gene. It is known that 
practically all plants can be regenerated from cultured cells or tissues, including but not limited to all major species of 
sugarcane, sugar beet, cotton, fruit and other trees, legumes and vegetables. Some suitable plants include, for 
example, species from the genera Fragaria, Lotus, Medicago, Onobrychis, Trifolium, Trigonella, Vignal Citrus, 

20 Linum, Geranium, Manihot, Daucus, Arabidopsis, Brassica, Raphanus, Sinapis, Atropa, Capsicum, Datura, 
Hyoscyamus, Lycopersion, Nicotiana, Solanum, Petunia, Digitalis, Majorana, Cichorium, HelianthusrLactuca, 
Bromus, Asparagus, Antirrhinum, Hererocallis, Nemesia, Pelargonium, Panicum, Pennisetum, Ranunculus, Senecio, 
Salpiglossis, Cucumis, Bromalia, Glycine, Lolium, lea, Triticum, Sorghum, and Datura. 
Means for regeneration vary from species to species of plants, but generally a suspension of transformed protoplasts 

25 containing copies of the heterologous gene is first provided. Callus tissue is formed and shoots may be induced from 
callus and subsequently rooted. Alternatively, embryo formation can be induced from the protoplast suspension. 
These embryos germinate as natural embryos to form plants. The culture media will generally contain various amino 
acids and hormones, such as auxin and cytokines. It is also advantageous to add glutamic acid and proline to the 
medium, especially for such species as corn and alfalfa. Shoots and roots normally develop simultaneously. Efficient 

30 regeneration will depend on the medium, on the genotype, and on the history of the culture. If these three variables 
are controlled, then regeneration is fully reproducible and repeatable. 

In some plant cell culture systems, the desired protein of the invention may be excreted or alternatively, the protein 
may be extracted from the whole plant. Where the desired protein of the invention is secreted into the medium, it may 
be collected. Alternatively, the embryos and embryoless-half seeds or other plant tissue may be mechanically 
35 disrupted to release any secreted protein between cells and tissues. The mixture may be suspended in a buffer solution 
to retrieve soluble proteins. Conventional protein isolation and purification methods will be then used to purify the 
recombinant protein. Parameters of time, temperature pH, oxygen, and volumes will be adjusted through routine 
methods to optimize expression and recovery of heterologous protein. 

jy. Bacterial Systems 

40 Bacterial expression techniques are known in the art. A bacterial promoter is any DN A sequence capable of binding 
bacterial RNA polymerase and initiating the downstream (3*) transcription of a coding sequence (eg. structural gene) 
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into mRNA. A promoter will have a transcription initiation region which is usually placed proximal to the 5 1 end of 
the coding sequence* This transcription initiation region usually includes an RNA polymerase binding site and a 
transcription initiation site. A bacterial promoter may also have a second domain called an operator, that may overlap 
an adjacent RNA polymerase binding site at which RNA synthesis begins. The operator permits negative regulated 
5 (inducible) transcription, as a gene repressor protein may bind the operator and thereby inhibit transcription of a 
specific gene. Constitutive expression may occur in the absence of negative regulatory elements, such as the operator. 
In addition, positive regulation may be achieved bya gene activator protein binding sequence, which, if present is 
usually proximal (5 1 ) to the RNA polymerase binding sequence. An example of a gene activator protein is the 
catabolite activator protein (CAP), which helps initiate transcription of the lac operon in Escherichia coli (£. coli) 
10 [Raibaud et al (1984) Anna. Rev. Genet 75:173], Regulated expression may therefore be either positive or negative, 
thereby either enhancing or reducing transcription. 

Sequences encoding metabolic pathway enzymes provide particularly useful promoter sequences. Examples include 
promoter sequences derived from sugar metabolizing enzymes, such as galactose, lactose {lac) [Chang et al (1977) 
Nature 795:1056], and maltose. Additional examples include promoter sequences derived from biosynthetic enzymes 
15 such as tryptophan {trp) [Goeddel et al (1980) Hue. Acids Res. 5:4057; Yelverton et al (1981) Nucl Acids Res. 
9:731; US patent 4,738,921; EP-A -0036776 and EP-A-0121775J. The g-laotamase (Ha) promoter system 
[Weissmann (1981) "The cloning of interferon and other mistakes. 11 In Interferon 3 (ed. I. Gresser)], bacteriophage 
lambda PL [Shimatake et al. (1981) Nature 292:128] and T5 [US patenf 4,689,406] promoter systems also provide 
useful promoter sequences. 

20 In addition, synthetic promoters which do not occur in nature also function as bacterial promoters. For example, 
transcription activation sequences of one bacterial or bacteriophage promoter may be joined with the operon 
sequences of another bacterial or bacteriophage promoter, creating a synthetic hybrid promoter [US 
patent 4451,433]. For example, the tac promoter is a hybrid trp-lac promoter comprised of both trp promoter and lac 
operon sequences that is regulated by the lac repressor [Amann etal. (1983) Gene 25:167; de Boer et al (1983) Proc. 

25 Natl Acad. Sci. 50:21]. Furthermore, a bacterial promoter can include naturally occurring promoters of non-bacterial 
origin that have the ability to bind bacterial RNA polymerase and initiate transcription. A naturally occurring 
promoter of non-bacterial origin can also be coupled with a compatible RNA polymerase to produce high levels of 
expression of some genes in prokaryotes. The bacteriophage T7 RNA polym erase/promoter system is an example of a 
coupled promoter system [Studier et al (1986) 7. Uol Biol. 259:113; Tabor et al (1985) Proc Natl. Acad. Scl 

30 52:1074]. In addition, a hybrid promoter can also be comprised of a bacteriophage promoter and an E. coli operator 
region (EPO-A-0 267 851). 

In addition to a functioning promoter sequence, an efficient ribosome binding site is also useful for the expression of 
foreign genes in prokaryotes. In E. coli, the ribosome binding site is called the Shine-Dalgarno (SD) sequence and 
includes an initiation codon (ATG) and a sequence 3-9 nucleotides in length located 3-11 nucleotides upstream of the 

35 initiation codon [Shine et al (1975) Nature 254:34]. The SD sequence is thought to promote binding of mRNA to the 
ribosome by the pairing of bases between the SD sequence and the 3 1 and of E, coli 16S rRNA [Steitz et al (1979) 
"Genetic signals and nucleotide sequences in messenger RNA. 0 In Biological Regulation and Development: Gene 
Expression (ed. RJ. Goldberger)]. To express eukaryotic genes and prokaryotic genes with weak ribosome-binding 
site [Sam brook et al (1989) "Expression of cloned genes in Escherichia coli." In Molecular Cloning: A Laboratory 

40 Manual). 
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A DNA molecule may be expressed intracellular^. A promoter sequence may be directly linked with the DNA 
molecule, in which case the first amino acid at the N-terminus will always be a methionine, which is encoded by the 
ATG start codon. If desired, methionine at the N-terminus may be cleaved from the protein by in vitro incubation 
with cyanogen bromide or by either in vivo on in vitro incubation with a bacterial methionine N-terminal peptidase 
5 (EPO-A-0 219 237). 

Fusion proteins provide an alternative to direct expression. Usually, a DNA sequence encoding the N-terminal portion 
of an endogenous bacterial protein, or other stable protein, is fused to the 5 1 end of heterologous coding sequences. 
Upon expression, this construct will provide a fusion of the two amino acid sequences. For example, the 
bacteriophage lambda cell gene can be linked at the 5 1 terminus of a foreign gene and expressed in bacteria. The 

10 resulting fusion protein preferably retains a site for a processing enzyme (factor Xa) to cleave the bacteriophage 
protein from the foreign gene [Nagai et al (1984) Nature 309:810]. Fusion proteins can also be made with sequences 
from the lad [Jia et al (1987) Gene 60:197], trpE [Allen et al. (1987) J. Biotechnol. 5:93; Makoff et al (1989) I 
Gen. Microbiol 135:11], and Chey [EP-A-0 324 647] genes. This DNA sequence at the junction of the two amino 
acid sequences may or may not encode a cleavable site. Another example is a ubiquitin fusion protein. Such a fusion 

15 protein is made with the ubiquitin region that preferably retains a site for a processing enzyme {eg. ubiquitin specific 
processing-protease) to cleave the ubiquitin from the foreign protein. Through this method, native foreign protein can 
be isolated [Miller et al (1989) Bio/Technology 7:698]. 

Alternatively, foreign proteins can also be secreted from the cell by creating chimeric DNA molecules that, encode a 
fusion protein comprised of a signal peptide sequence fragment that provides for secretion of the foreign protein in 

20 bacteria [US patent 4,336,336]. The signal sequence fragment usually encodes a signal peptide comprised of 
hydrophobic amino acids which direct the secretion of the protein from the cell. The protein is either secreted into the 
growth media (gram -positive bacteria) or into the periplasmic space, located between the inner and outer membrane 
of the cell (gram -negative bacteria). Preferably there are processing sites, which can be cleaved either in vivo or in 
vitro encoded between the signal peptide fragment and the foreign gene. 

25 DNA encoding suitable signal sequences can be derived from genes for secreted bacterial proteins, such as the E. coli 
outer membrane protein gene {ompA) [Masui et al (1983), in: Experimental Manipulation of Gene Expression] 
Ghrayeb et al (1984) EMBO J. 5:2437] and the-£. coli alkaline phosphatase signal sequence (phoA) [Oka et al 
(1985) Proc. Natl. Acad. Scu 82:7212]. As an additional example, the signal sequence of the alpha-amylase gene 
from various Bacillus strains can be used to secrete heterologous proteins from B. subtilis [Palva et al. (1982) Proc. 

30 Natl Acad. Sci. USA 79:5582; EP-A-0 244 042]. 

Usually, transcription termination sequences recognized by bacteria are regulatory regions located 3 1 to the translation 
stop codon, and thus together with the promoter flank the coding sequence. These sequences direct the transcription 
of an mRNA which can be translated into the polypeptide encoded by the DNA. Transcription termination sequences 
frequently include DNA sequences of about 50 nucleotides capable of forming stem loop structures that aid in 

35 terminating transcription. Examples include transcription termination sequences derived from genes with strong 
promoters, such as the trp gene in E. coli as well as other biosynthetic genes. 

Usually, the above described components, comprising a promoter, signal sequence (if desired), coding sequence of 
interest, and transcription termination sequence, are put together into expression constructs. Expression constructs are 
often maintained in a replicon, such as an extrachromosomal element [eg. plasmids) capable of stable maintenance in 
40 a host, such as bacteria. The replicon will have a replication system, thus allowing it to be maintained in a prokaryotic 
host either for expression or for cloning and amplification. In addition, a replicon may be either a high or low copy 
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number plasmid. A high copy number plasmid will generally have a copy number ranging from about 5 to about 200, 
and usually about 10 to about 150. A host containing a high copy number plasmid will preferably contain at least 
about 10, and more preferably at least about 20 plasmids. Either a high or low copy number vector may be selected, 
depending upon the effect of the vector and the foreign protein on the host. 

Alternatively, the expression constructs can be integrated into the bacterial genome with an integrating vector. 
Integrating vectors usually contain at least one sequence homologous to the bacterial chromosome that allows the 
vector to integrate. Integrations appear to result from recombinations between homologous DNA in the vector and the 
bacterial chromosome. For example, integrating vectors constructed with DNA from various Bacillus strains integrate 
into the Bacillus chromosome (BP-A- 0 127 328). Integrating vectors may also be comprised of bacteriophage or 
transposon sequences. 

Usually, extrachromosomal and integrating expression constructs may contain selectable markers to allow for the 
selection of bacterial strains that have been transformed. Selectable markers can be expressed in the bacterial host and 
may include genes which render bacteria resistant to drugs such as ampicillin, chloramphenicol, erythromycin, 
kanamycin (neomycin), and tetracycline [Davies etal. (1978) Anna. Rev. Microbiol. 52:469]. Selectable markers may 
also include biosynthetic genes, snch as those in the histidine, tryptophan, and leucine biosynthetic pathways. 
Alternatively, some of the above described components can be put together in transformation vectors. Transformation 
vectors are usually comprised of a selectable market that is either maintained in a replicon or developed into an 
integrating vector, as described above. 

Expression and transformation vectors, either extra-chromosomal replicons or integrating vectors, have been 
developed for transformation into many bacteria. For example, expression vectors have been developed for, inter alia, 
the following bacteria: Bacillus subtilis [Palva etal. (1982) Proc. Natl Acad. Sci. USA 79:5582; EP-A-0 036 259 and 
EP-A-0. 063 953; WO 84/04541], Escherichia coli [Shimatake et al (1981) Nature 292:128; Amann et al. (1985) 
Gene 40:183; Studier etal. (1986) /. Mol. Biol. 789:113; EP-A-0 036 776.EP-A-0 136 829 and EP-A-0 136 907], 
Streptococcus cremoris [Powell et al. (1988) Appl. Environ. Microbiol. 54:655]; Streptococcus lividans [Powell et al. 
(1988) Appl. Environ. Microbiol. 54:655], Streptomyces lividans [US patent 4,745,056]. 

Methods of introducing exogenous DNA into bacterial hosts are well-known in the art, and usually include either the 
transform atioD of bacteria treated with CaClj or other agents, such as divalent cations and DMSO. DNA can also be 
introduced into bacterial cells by electroporation. Transformation procedures usually vary with the bacterial species to 
be transformed. See eg. [Masson et al. (1989) FEMS Microbiol. Lett. 60:273; Palva et al. (1982) Proc. Natl. Acad. 
Sci. USA 79:5582; EP-A-0 036 259 and EP-A-0 063 953; WO 84/04541, Bacillus], [Miller et al (1988) Proc. Natl. 
Acad. Sci. 85:856; Wang et al. (1990) J. Bacteriol. 172:949, Campylobacter], [Cohen et al. (1973) Proc. Natl. Acad. 
Sci. 69:2110; Dower et al. (1988) Nucleic Acids Res. 76:6127; Kushner (1978) "An improved method for 
transformation of Escherichia coli with ColEl-derived plasmids. In Genetic Engineering: Proceedings of the 
International Symposium on Genetic Engineering (eds. H.W. Boyer and S. Nicosia); Mandel etal. (1970) J. Mol 
Biol. 5*159; Taketo (1988) Biochim. Biophys. Acta 049:318; Escherichia], [Chassy et al. (1987) FEMS Microbiol 
Lett. 44\m Lactobacillus]; [Fiedler et al (1988) Anal Biochem 770:38, Pseudomonas]; [Augustin et al (1990) 
FEMS Microbiol. Lett. 66:203, Staphylococcus], [Barany et al. (1980) J. Bacteriol. 744:698; Harlander (1987) 
"Transformation of Streptococcus lactis by electroporation, in: Streptococcal Genetics (ed. I. Ferretti and R. Cnrtiss 
HI); Perry etal. (1981) Infect. Immun. 52:1295; Powell etal. (1988) Appl. Environ. Microbiol 54:655; Somkuti etal 
(1987) Proc. 4th Evr. Cong. Biotechnology 7:412, Streptococcus]. 
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vYftast Expression .- . 

Yeast expression systems are also known to one of ordinary skill in the art. A yeast promoter is any DNA sequence 
capable of binding yeast RNA polymerase and initiating the downstream (3 1 ) transcription of a coding sequence <e*. 
structural gene) into mRNA. A promoter will have a transcription initiation region which is usually placed proximal 
5 to the 5' end of the coding sequence. This transcription initiation region usually includes an RNA polymerase binding 
site (the "TATA Box") and a transcription initiation site. A yeast promoter may also have a second domain called an 
upstream activator sequence (UAS), which, if present, is usually distal to the structural gene. The UAS permits 
regulated (inducible) expression. Constitutive expression occurs in the absence of a UAS. Regulated expression may 
be either positive or negative, thereby either enhancing or reducing transcription. 
10 Yeast is a fermenting organism with an active metabolic pathway, therefore sequences encoding enzymes in the 
metabolic pathway provide particularly useful promoter sequences. Examples include alcohol dehydrogenase (ADH) 
(EP-A-0 284 044), enolase, glucokinase, glucose-6-phosphate isomerase, glyceraldehyde-3-phosphate-dehydrogenase 
(GAP or GAPDH), hexokinase, phosphofructokinase, 3-phosphoglycerate mutase, and pyruvate kinase (PyK) (EPO- 
A-0 329 203). The yeast PB05 gene, encoding acid phosphatase, also provides useful promoter sequences 
15 [Myanohara et al. (1983) Proc. Natl. Acad. Sci. USA 80:1]. 

In addition, synthetic promoters which do not occur in nature also function as yeast promoters. For example, UAS 
sequences of one yeast promoter may. be joined with the transcription activation region of another yeast promoter, 
creating a synthetic hybrid promoter. Examples of such hybrid promoters include the ADH regulatory fequence 
linked to the GAP transcription activation region (US Patent Nos. 4,876,197 and 4,880,734). Other examples of 
20 hybrid promoters include promoters which consist of the regulatory sequences of either the ADH 2, GALA, GALIQ, 
OR PH05 genes, combined with the transcriptional activation region of a glycolytic enzyme gene such asGAP or 
PyK (EP-A-0 164 556). Furthermore, a yeast promoter can include naturally occurring promoters of non-yeast origin 
that have the ability to bind yeast RNA polymerase and initiate transcription. Examples of such promoters include, 
inter alia, [Cohen et al. (1980) Proc. Natl. Acad. Sci. USA 77:1078; Henikoff et al. (1981) Nature 2*3:835; 
25 Hollenberg et al. (1981) Curr. Topic. Microbiol. Immunol. 96:119; Hollenberg et al. (1979) "The Expression of 
Bacterial Antibiotic Resistance Genes in the Yeast Saccharomyces cerevisiae," in: Plamids of Medical, 
Environmental and Commercial Importance (eds. K.N. Timmis and A. Puhler); Mercerau-Puigalon et ai (1980) 
Gene 1 J:163; Pantbier et al. (1980) Curr. Genet. 2:109;]. 

A DNA molecule may be expressed intracellularly in yeast. A promoter sequence may be directly linked with the 
30 DNA molecule, in which case the first amino acid at the N-terminus of the recombinant protein will alwnys be a 
methionine, which is encoded by the ATG start codon. If desired, methionine at the N-terminus may be cleaved from 
the protein by in vitro incubation with cyanogen bromide. 

Fusion proteins provide an alternative for yeast expression systems, as well as in .mammalian, baculovirus, and 
bacterial expression systems. Usually, a DNA sequence encoding the N-terminal portion of an endogenous yeast 

35 protein, or other stable protein, is fused to the 5' end of heterologous coding sequences. Upon expression, this 
construct will provide a fusion of the two amino acid sequences. For example, the yeast or human superoxide 
dismutase (SOD) gene, can be linked at the 5' terminus of a foreign gene and expressed in yeast. The DNA sequence 
at the junction of the two amino acid sequences may or may not encode a cleavable site. See eg. EP-A-0 196 056. 
Another example is a ubiquitin fusion protein. Such a fusion protein is made with the ubiquitin region that preferably 

40 retains a site for a processing enzyme {eg. ubiquitin-specific processing protease) to cleave the ubiquitin from the 
foreign protein. Through this method, therefore, native foreign protein can be isolated {eg. WO88/024066). 
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Alternatively, foreign proteins can also be secreted from the cell into the growth media by creating chimeric DNA 
molecules that encode a fusion protein comprised of a leader sequence fragment that provide for secretion in yeast of 
the foreign protein. Preferably, there are processing sites encoded between the leader fragment and the foreign gene 
that can be cleaved either in vivo or in vitro. The leader sequence fragment usually encodes a signal peptide 
5 comprised of hydrophobic amino acids which direct the secretion of the protein from the cell, 

DNA encoding suitable signal sequences can be derived from genes for secreted yeast proteins, such as the yeast 
invertase gene (EP-A-0 012 873; JPO. 62,096,086) and the A-factor gene (US patent 4,588,684). Alternatively, 
leaders of non-yeast origin, such as an interferon leader, exist that also provide for secretion in yeast (EP-A-0 060 
057). 

10 A preferred class of secretion leaders are those that employ a fragment of the yeast alpha-factor gene, which contains 
both a "pre - signal sequence, and a "pro 11 region. The types of alpha-factor fragments that can be employed include 
the full-length pre-pro alpha factor leader (about'83 amino acid residues) as well as truncated alpha-factor leaders 
(usually about 25 to about 50 amino acid residues) (US Patents 4,546,083 and 4,870,008; EP-A-0 324 274). 
Additional leaders employing an alpha-factor leader fragment that provides for secretion include hybrid alpha-factor 

15 leaders made with a presequence of a first yeast, but a pro-region from a second yeast alphafactor. (eg. see WO 
89/02463.) 

Usually, transcription termination sequences recognized by yeast are regulatory regions located 3' to the translation 
stop codon, and thus together with the promoter flank the coding sequence. These sequences direct the transcription 
of an mRNA which can be translated into the polypeptide encoded by the DNA. Examples of transcription terminator 
20 sequence and other yeast-recognized termination sequences, such as those coding for glycolytic enzymes. 

Usually, the above described components, comprising a promoter, leader (if desired), coding sequence of interest, and 
transcription termination sequence, are put together into expression constructs. Expression constructs are often 
maintained in a replicon, such as an extrachromosomal element {eg. plasmids) capable of stable maintenance in a 
host, such as yeast or bacteria. The replicon may have two replication systems, thus allowing it to be maintained, for 

25 example, in yeast for expression and in a prokaryotic host for cloning and amplification. Examples of such yeast- 
bacteria shuttle vectors include YEp24 [Botstein et al (1979) Gene 3:17-24], pCl/1 [Brake et al (1984) Proc. Natl. 
Acad. Sci USA 57:4642-4646], and YRpl7 [Stinchcomb etal (1982) J. Mol Biol 755:157]. In addition, a replicon 
may be either a high or low copy number plasmid. A high copy number plasmid will generally have a copy number 
ranging from about 5 to about 200, and usually about 10 to about 150. A host containing a high copy number plasmid 

30 will preferably have at least about 10, and more preferably at least about 20. Enter a high or low copy number vector 
may be selected, depending upon the effect of the vector and the foreign protein on the host. See eg. Brake et al, 
supra. 

Alternatively, the expression constructs can be integrated into the yeast genome with an integrating vector. 
Integrating vectors usually contain at least one sequence homologous to a yeast chromosome that allows the vector to 

35 integrate, and preferably contain two homologous sequences flanking the expression construct. Integrations appear to 
result from recombinations between homologous DNA in the vector and the yeast chromosome [Orr-Weaver et al 
(1983) Methods in Enzymol 707:228-245]. An integrating vector may be directed to a specific locus in yeast by 
selecting the appropriate homologous sequence for inclusion in the vector. See Orr-Weaver et al, supra. One or more 
expression construct may integrate, possibly affecting levels of recombinant protein produced [Rine et al. (1983) 

40 Proc. Natl Acad. Sci USA 80:6750]. The chromosomal sequences included in the vector can occur either as a single 
segment in the vector, which results in the integration of the entire vector, or two segments homologous to adjacent 
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segments in the chromosome and flanking the expression construct in the vector, which can result in the stable 
integration of only the expression construct. 

Usually, extrachromosomal and integrating expression constructs may contain selectable markers to allow for the 
selection of yeast strains that have been transformed. Selectable markers may include biosynthetic genes that can be 

5 expressed in the yeast host, such as ADB2, H1S4, LEVI, TRPl, and ALG7, and the G418 resistance gene, which 
confer resistance in yeast cells to tunicamycin and G418, respectively. In addition, a suitable selectable marker may 
also provide yeast with the ability to grow in the presence of toxic compounds, such as metal. For example, the 
presence of CUP1 allows yeast to grow in the presence of copper ions [Butt etal. (1987) Microbiol, Rev. 5J:351]. 
Alternatively, some of the above described components can be put together into transformation vectors. 

10 Transformation vectors are usually comprised of a selectable marker that is either maintained in a replicon or 
developed into an integrating vector, as described above. 

Expression and transformation vectors, either extrachromosomal replicons or integrating vectors, have been 
developed for transformation into many yeasts. For example, expression vectors have been developed for, inter alia, 
the following yeasts:Candida albicans [Kurtz, et al. (1986) Mol. Cell Biol. <f'.142], Candida maltosa [Kunze, et al 

15 (1985) J. Basic Microbiol. 25:141]. Hansenula polymorpha [Gleeson, et al. (1986) 7. Gen. Microbiol. 732:3459; 
Roggenkamp et al. (1986) Mol. Gen. Genet. 202:302], Kluyveromyces fragilis [Das, et al. (1984) 7. Bacterial. 
158:1165], Kluyveromyces lactis [De Louvencourt et al. (1983) /. Bacteriol. 154:131; Van den Berg et al. (1990) 
Bio/Technology 8:135], Pichia guillerimondii [Kunze et al. (1985) J. Basic Microbiol. 25:141], Pichia,. pastoris 
[Cregg, et al. (1985) Mol. Cell. Biol. 5:3376; US Patent Nos. 4,837,148 and 4,929,555], Saccharomyces cerevisiae 

20 [Hinnen et al. (1978) Proc. Natl. Acad. Sci. USA 75:1929; Ito et al. (1983) 7. Bacteriol. 755:163], 
Schizosaccharomyces pombe [Beach and Nurse (1981) Nature 500:706], and Yarrowia lipolytica [Davidow, et al. 
(1985) Curr. Genet. 70:380471 Gaillardin, etal. (1985) Curr. Genet. 70:49]. 

Methods of introducing exogenous DNA into yeast hosts are well-known in the art, and usually include either the 

transformation of spheroplasts or of intact yeast cells treated with alkali cations. Transformation procedures usually 
25 vary with the yeast species to be transformed. See eg. [Kurtz et al. (1986) Mol. Cell. Biol. 6:142; Kunze et al (1985) 

7. Basic Microbiol. 25:141; Candida]; [Gleeson et al. (1986) 7. Gen. Microbiol. 732:3459; Roggenkamp et al. (1986) 

Mol. Gen. Genet. 202:302; Hansenula]; [Das et al. (1984) 7. Bacteriol. 758:1165; De Louvencourt et al. (1983) 7. 

Bacteriol 154:1165; Van den Berg et al (1990) Bio/Technology 8:135; Kluyveromyces]; [Cregg et al (1985) Mol. 

Cell Biol. 5:3376; Kunze et al. (1985) J. Basic Microbiol 25:141; US Patent Nos. 4,837,148 and 4,929,555; Pichia]; 
30 [Hinnen et al. (1978) Proc. Natl AcaL Scl USA 75;1929; Ito et al (1983) 7. Bacteriol. 153:163 Saccharomyces]; 

[Beach and Nnrse (1981) Nature 300:706; Schizosaccharomyces]; [Davidow et al. (1985) Curr. Genet. 70:39; 

Gaillardin et al (1985) Curr. Genet. 70:49; Yarrowia]. 

Antibodies 

As used herein, the term "antibody*' refers to a polypeptide or group of polypeptides composed of at least one 
35 antibody combining site. An "antibody combining site" is the three-dimensional binding space with an internal 
surface shape and charge distribution complementary to the features of an epitope of an antigen, which allows a 
binding of the antibody with the antigen. "Antibody" includes, for example, vertebrate antibodies, hybrid antibodies, 
chimeric antibodies, humanised antibodies, altered antibodies, univalent antibodies, Fab proteins, and single domain 
antibodies. 

40 Antibodies against the proteins of the invention are useful for affinity chromatography, immunoassays, and 
distinguishing/identifying Neisseria proteins. 
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Antibodies to the proteins of the invention, both polyclonal and monoclonal, may be prepared by conventional . 
methods. In general, the protein is first nsed to immunize a suitable animal, preferably a monse, rat, rabbit or goat. 
Rabbits and goats are preferred for the preparation of polyclonal sera dne to the volume of sernm obtainable, and the 
availability of labeled anti-rabbit and anti-goat antibodies. Immunization is generally performed by mixing or 
5 emulsifying tbe protein in saline, preferably in an adjuvant such as Freund's complete adjuvant, and injecting the 
mixture or emulsion parenteral^ (generally subcntaneously or intramuscularly). A dose of 50-200 ^/injection is 
typically sufficient. Immunization is generally boosted 2-6 weeks later with one or more injections of the protein in 
saline, preferably using Freund's incomplete adjuvant. One may alternatively generate antibodies by in vitro 
immunization using methods known in the art, which for the purposes of this invention is considered equivalent to in 
10 vivo immunization. Polyclonal antisera is obtained by bleeding tbe immunized animal into a glass or plastic container, 
incubating tbe blood at 25°C for one hour, followed by incubating at 4°C for 2-18 hours. The serum is recovered by 
centrifugation (eg. 1,000* for 10 minutes). About 20-50 ml per bleed may be obtained from rabbits. 
Monoclonal antibodies are prepared using the standard method of lohler & Milstein [Nature (1975) 256:495-96], or 
a modification thereof. Typically, a mouse or rat is immunized as described above. However, rather than bleeding the' 
15 animal to extract serum, the spleen (and optionally several large lymph nodes) is removed and dissociated into single 
cells. If desired, the spleen cells may be screened (after removal of nonspecifically adherent cells) by applying a cell 
suspension to a plate or well coated with the protein antigen. B-cells expressing membrane-bound immunoglobulin 
specific for the antigen bind to the plate, and are not rinsed away with the rest of the suspension. Resulting B-cells, or 
all dissociated spleen cells, are then induced to fuse with myeloma cells to form hybridomas, and are cultured in a 
20 selective medium (eg. hypoxanthine, aminopterin, thymidine medium, "HAT"). The resulting hybridomas are plated 
by limiting dilution, and are assayed for production of antibodies which bind specifically to the immunizing antigen 
(and which do not bind to unrelated antigens). The selected MAb-secreting hybridomas are then cultured either in 
vitro (eg. in tissue culture bottles or hollow fiber reactors), or in vivo (as ascites in mice). 
If desired, the antibodies (whether polyclonal or monoclonal) may be labeled using conventional techniques. Suitable 
25 labels include fluorophores, chromophores, radioactive atoms (particularly M P and lls I), electron-dense reagents, 
enzymes, and ligands having specific binding partners. Enzymes are typically detected by their activity. For example, 
horseradish peroxidase is usually detected by its ability to convert 3,3',5,5'-tetraraethylbenzidine (TMB) to a blue 
pigment, quantifiable with a spectrophotometer. "Specific binding partner" refers to a protein capable of binding a 
ligand molecule with high specificity, as for example in the case of an antigen and a monoclonal antibody specific 
30 therefor. Other specific binding partners include biotin and avidin or streptavidin, IgG and protein A, and the 
numerous receptor-ligand couples known in the art. It should be understood that the above description is not meant to 
categorize tbe various labels into distinct classes, as the same label may serve in several different modes. For 
example, ,,5 I may serve as a radioactive label or as an electron-dense reagent. HRP may serve as enzyme or as 
antigen for a MAb. Further, one may combine various labels for desired effect. For example, MAbs and avidin also 
35 require labels in the practice of this invention: thus, one might label a MAb with biotin, and detect its presence with 
avidin labeled with '"I, or with an anti-biotin MAb labeled with HRP. Other permutations and possibilities will be 
readily apparent to those of ordinary skill in the art, and are considered as equivalents within the scope of the instant 
invention. 
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Pharmaceutical Compositions 

Pharmaceutical compositions can comprise either polypeptides, antibodies, or nucleic acid of the invention. The 
pharmaceutical compositions will comprise a therapeutically effective amount of either polypeptides, antibodies, or 
polynucleotides of the claimed invention. 

5 The term "therapeutically effective amount" as used herein refers to an amount' of a therapeutic agent to treat, 
ameliorate, or prevent a desired disease or condition, or to exhibit a detectable therapeutic or preventative effect. The 
effect can be detected by, for example, chemical markers or antigen levels. Therapeutic effects also include reduction 
in physical symptoms, such as decreased body temperature. The precise effective amount for a subject will depend 
upon the subject's size and health, the nature and extent of tbe condition, and the therapeutics or combination of 

10 therapeutics selected for administration. Thus, it is not useful to specify an exact effective amount in advance. 
However, the effective amount for a given situation can be determined by routine experimentation and is within the 
judgement of the clinician. 

For purposes of the present invention, an effective dose will be from about 0.01 mg/ kg to 50 mg/kg or 0.05 mg/kg to 
about 10 mg/kg of the DNA constructs in the individual to which it is administered. 

15 A pharmaceutical composition can also contain a pharmaceutical^ acceptable carrier. The term "pharmaceutically 
acceptable carrier" refers to a carrier for administration of a therapeutic agent, such as antibodies or a polypeptide, 
genes, and other therapeutic agents. The term refers to any pharmaceutical carrier that does not itself induce the 
production of antibodies harmful to the individual receiving the composition, and which may be administered without 
undue toxicity. Suitable carriers may be large, slowly metabolized macromolecules such as proteins, polysaccharides, 

20 polylactic acids, polyglycolic acids, polymeric amino acids, amino acid copolymers, and inactive virus particles. Such 
carriers are well known to those of ordinary skill in the art. 

Pharmaceutically acceptable salts can be used therein, for example, mineral acid salts such as hydrochlorides, 
hydrobromides, phosphates, sulfates, and the like; and the salts of organic acids such as acetates, propionates, 
malonates, benzoates, and the like. A thorough discussion of pharmacentically acceptable excipients is available in 

25 Remington's Pharmaceutical Sciences (Mack Pub. Co., N J. 1991). 

Pharmaceutically acceptable carriers in therapeutic compositions may contain liquids such as water, saline, glycerol 
and ethanol. Additionally, auxiliary substances, such as wetting or emulsifying agents, pH buffering substances, and 
the like, may be present in such vehicles. Typically, the therapeutic compositions are prepared as injectables, either as 
liquid solutions or suspensions; solid forms suitable for solution in, or suspension in, liquid vehicles prior to injection 

30 may also be prepared. Liposomes are included within the definition of a pharmaceutically acceptable carrier. 

I)t liven Methods 

Once formulated, the compositions of the invention can be administered directly to the subject. The subjects to be 
treated can be animals; in particular, human subjects can be treated. 

Direct delivery of the compositions will generally be accomplished by injection, either subcutaneonsly, 
35 intraperitoneal^, intravenously or intramuscularly or delivered to the interstitial space of a tissue. The compositions 
can also be administered into a lesion. Other modes of administration include oral and pulmonary administration, 
suppositories, and transdermal or transcutaneous applications (eg. see WO98/20734), needles, and gene guns or 
byposprays. Dosage treatment may be a single dose schedule or a multiple dose schedule. 
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Vaccines 

Vaccines according to the invention may either be prophylactic (ie. to prevent infection) 01 therapeutic («. to treat 
disease after infection). 

Such vaccines comprise immunising antigen(s), immunogen(s), polypeptide(s), protein(s) or nucleic acid, usually in 
5 combination with "pharmaceutical^ acceptable carriers" which include any carrier that does not itself induce the 
production of antibodies harmful to the individual receiving the composition. Suitable carriers are typically large, 
slowly metabolized macromolecules such as proteins, polysaccharides, polylactic acids, polyglycolic acids, polymeric 
amino acids, amino acid copolymers, lipid aggregates (such as oil droplets or liposomes), and inactive virus particles. 
Such carriers are well known to those of ordinary skill in the art. Additionally, these carriers may function as 
10 immunostimulating agents ("adjuvants"). Furthermore, the antigen or immunogen may be conjugated to a bacterial 
toxoid, such as a toxoid from diphtheria, tetanus, cholera, ff. pylori, etc. pathogens. 

Preferred adjuvants to enhance effectiveness of the composition include, but are not limited to: (1) aluminum salts 
(alum), such as aluminum hydroxide, aluminum phosphate, aluminum sulfate, etc; (2) oil-in-water emulsion 
formulations (with or without other specific immunostimulating agents such as muramyl peptides (see below) or 

15 bacterial cell wall components), such as for example (a) MF59™ (WO 90/14837; Chapter 10 in Vaccine design: the 
subunit and adjuvant approach, eds. Powell & Newman, Plenum Press 1995), containing 5% Squalene, 0.5% Tween 
80, and 0.5% Span 85 (optionally containing various amounts of MTP-PB (see below), although not required) 
formulated into submicron particles using a micro fluidizer such as Model HOY microfluidizer (Microfluidics, 
Newton, MA), (b) SAF, containing 10% Squalane, 0.4% Tween 80, 5% pluronic-blocked polymer L121, and thr- 

20 MDP (see below) either microfluidized into a submicron emulsion or vortexed to generate a larger particle size 
emulsion, and (c) Ribi™ adjuvant system (RAS), (Ribi Immunochem, Hamilton, MT) containing 2% Squalene, 0.2% 
Tween 80, and one or more bacterial cell wall components from the group consisting of monopbosphorylipid A 
(MPL), trehalose dimycolate (TDM), and cell wall skeleton (CWS), preferably MPL + CWS (Detox™); (3) saponin 
adjuvants, such as Stimulon™ (Cambridge Bioscience, Worcester, MA) may be used or particles generated therefrom- 

25 such as ISCOMs (immunostimulating complexes); (4) Complete Freund's Adjuvant (CPA) and Incomplete Freund's 
Adjuvant (IFA); (5) cytokines, such as interleukins {eg. IL-1, IL-2, IL-4, IL-5, 1L-6, IL-7, IL-12, etc.), interferons (eg. 
gamma interferon), macrophage colony stimulating factor (M-CSF), tumor necrosis factor (TNF), etc; and (6) other 
substances that act as immunostimulating agents to enhance the effectiveness of the composition. Alum and MF59™ 
are preferred. 

30 As mentioned above, muramyl peptides include, but are not limited to, N-acetyl-muramyl-L-threonyl-D-isoglutamine 
(thr-MDP), N-acetyl-normuramyl-L-alanyl-D-isoglutamine (nor-MDP), N-acetylmuramyl-L-alanyl-D-isoglutaminyl- 
L-alanine-Z^r^-dipalmitoyl-sn-glycero-S-hydroxyphosphoryloxyJ-ethylamine (MTP-PE),etc. 

The immunogenic compositions {eg. the immunising antigen/immunogen/polypeptide/protein/ nucleic acid, 
pharmaceutical^ acceptable carrier, and adjuvant) typically will contain diluents, such as water, saline, glycerol, 
35 ethanol, etc. Additionally, auxiliary substances, such as wetting or emulsifying agents, pH buffering substances, and 
the like, may be present in such vehicles. 

Typically, the immunogenic compositions are prepared as injectables, either as liquid solutions or suspensions; solid 
forms suitable for solution in, or suspension in, liquid vehicles prior to injection may also be prepared. The 
preparation also may be emulsified or encapsulated in liposomes for enhanced adjuvant effect, as discussed above 
40 under pharmaceutical^ acceptable carriers. 
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Immunogenic compositions used as vaccines comprise an immunologically effective amount of the antigenic or 
immunogenic polypeptides, as veil as any other of the above-mentioned components, as needed. By 
"immunologically effective amount", it is meant that the administration of that amount to an individual, either in a 
single dose or as part of a series, is effective for treatment or prevention. This amount varies depending upon the 
5 health and physical condition of the individual to be treated, the taxonomic group of individual to be treated {eg. 
nonhuman primate, primate, etc.), the capacity of the individual's immune system to synthesize antibodies, the degree 
of protection desired, the formulation of the vaccine, the treating doctor's assessment of the medical situation, and 
other relevant factors. It is expected that the amount will fall in a relatively broad range that can be determined 
through routine trials. 

10 The immunogenic compositions are conventionally administered parenterally, eg. by injection, either subcutaneously, 
intramuscularly, or transdermally/transcutaneously (eg. W 098/20734). Additional formulations suitable for other 
modes of administration include oral and pulmonary formulations, suppositories, and transdermal applications. 
Dosage treatment may be a single dose schedule or a multiple dose schedule. The vaccine may be administered in 
conjunction with other immunoregulatory agents. 

15 As an alternative to protein-based vaccines, DNA vaccination may be used [eg. Robinson & Torres (1997) Seminars 
in Immunol 9:271-283; Donnelly el al (1997) Anna Rev Immunol 15:617-648; later herein], ' 

Gene Delivery Vehicles 

Gene therapy vehicles for delivery of constructs including a coding sequence of a therapeutic of the invention, to be 
delivered to the mammal for expression in the mammal, can be administered either locally or systemically. These 
20 constructs can utilize viral or non-viral vector approaches in in vivo or ex vivo modality. Expression of such coding 
sequence can be induced using endogenous mammalian or heterologous promoters. Expression of the coding 
sequence in vivo can be either constitutive or regulated. 

The invention includes gene delivery vehicles capable of expressing the contemplated nucleic acid sequences. The 
gene delivery vehicle is preferably a viral vector and, more preferably, a retroviral, adenoviral, adeno-associated viral 
25 (AAV), herpes viral, or alphavirus vector. The viral vector can also be an astrovirus, coronavirus, orthomyxovirus, 
papovavirus, paramyxovirus, parvovirus, picornavirus, poxvirus, or togavirus viral vector. See generally, Jolly (1994) 
Cancer Gene Therapy 1:51-64; Kimura (1994) flyman Gene Therapy 5:845-852; Connelly (1995) Human Gene 
Therapy 6:1 85-193; and Kaplitt (1994) Nature Genetics 6:148-153. 

Retroviral vectors are well known in the art and we contemplate that any retroviral gene therapy vector is employable 
30 in the invention, including B, C and D type retroviruses, xenotropic retroviruses (for example, NZB-X1, NZB-X2 and 
NZB9-1 (see O'Neill (1985) J. Virol. 53:160) polytropic retroviruses eg. MCF and MCF-MLV (see Kelly (1983) I 
Virol 45:291), spumaviruses and lentiviruses. See RNA Tumor Viruses, Second Edition, Cold Spring Harbor 
Laboratory, 1985. 

Portions of the retroviral gene therapy vector may be derived from different retroviruses. For example, retrovector 
35 LTRs may be derived from a Murine Sarcoma Virus, a tRNA binding site from a Rous Sarcoma Virus, a packaging 
signal from a Murine Leukemia Virus, and an origin of second strand synthesis from an Avian Leukosis Virus. 
These recombinant retroviral vectors may be used to generate transduction competent retroviral vector particles by 
introducing them into appropriate packaging cell lines (see US patent 5,591,624). Retrovirus vectors can be 
constructed for site-specific integration into host cell DNA by incorporation of a chimeric integrase enzyme into the 
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retroviral particle (see W096/37626). It is preferable that the recombinant viral vector is a replication defective 
recombinant virus. 

Packaging cell lines suitable for use with the above-described retrovirus vectors are well known in the art, are readily 
prepared (see W 095/30763 and WO92/05266), and can be used to create producer cell lines (also termed vector cell 
5 lines or "VCLs") for the production of recombinant vector particles. Preferably, the packaging cell lines are made 
from human parent cells (eg, HT1080 cells) or mink parent cell lines, which eliminates inactivation in human serum. 
Preferred retroviruses for the construction of retroviral gene therapy vectors include Avian Leukosis Virus, Bovine 
Leukemia, Virus, Murine Leukemia Virus, Mink-Cell Focus-Inducing Virus, Murine Sarcoma Virus, 
Reticuloendotheliosis Virus and Rous Sarcoma Virus. Particularly preferred Murine Leukemia Viruses include 
10 4070A and 1504A (Hartley and Rowe (1976) / Virol 19:19-25), Abelson (ATCC No. VR-999), Friend (ATCC No. 
VR-245), Graffi, Gross (ATCC Nol VR-590), Kirsten, Harvey Sarcoma Virus and Rauscher (ATCC No. VR-998) 
and Moloney Murine Leukemia Virus (ATCC No. VR-190). Such retroviruses may be obtained from depositories or 
collections such as the American Type Culture Collection ("ATCC") in Rockville, Maryland or isolated from known 
sources using commonly available techniques. 

15 Exemplary known retroviral gene therapy vectors employable in this invention include those described in patent 
applications GB2200651, EP0415731, EP0345242, EP0334301, WO89/02468; WO89/05349, WO89/09271, 
W 090/02806, W 090/07936, W 094/03622, W 093/256^ 

WO91/02825, WO95/07994, US 5,219,740, US 4,405,712, US 4,861,719, US 4,980,289, US 4,777,127, US 
5,591,624. See also Vile (1993) Cancer Res 53:3860-3864; Vile (1993) Cancer Res 53:962-967; Ram (1993) Cancer 
20 Res 53 (1993) 83-88; Takamiya (1992) / Neurosci Res 33:493-503; Baba (1993) / Neurosurg 79:729-735; Mann 
(1983) Cell 33:153; Cane (1984) Proc Natl Acad Sci 81:6349; and Miller (1990) Human Gene Therapy 1. 

Human adenoviral gene therapy vectors are also known in the art and employable in this invention. See, for example, 
Berkner (1988) Biotechniques 6:616 and Rosenfeld (1991) Science 252:431, and WO93/07283, WO93/06223, and 
WO93/07282. Exemplary known adenoviral gene therapy vectors employable in this invention include those 
25 described in the above referenced documents and in W094/12649, WO93/03769, W093/19191, W094/28938, 
WO95/11984,WO95/00655,WO95/27071,WO95/29993,WO95/34671,WO9 

WO93/06223, W094/24299, WO95/14102, W 095/24297, WO95/02697, W094/28152, W094/24299, WO95/09241, 
WO95/25807,WO95/05835, W094/18922 and WO95/09654. Alternatively, administration of DNA linked to killed 
adenovirus as described in Curiel (1992) Hum. Gene Ther. 3:147-154 may be employed. The gene delivery vehicles 

30 of the invention also include adenovirus associated virus (AAV) vectors. Leading and preferred examples of such 
vectors for use in this invention are the AAV-2 based vectors disclosed in Srivastava, WO93/09239. Most preferred 
AAV vectors comprise the two AAV inverted terminal repeats in which the native D-sequences are modified by 
substitution of nucleotides, such that at least 5 native nucleotides and up to 18 native nucleotides, preferably at least 
10 native nucleotides up to 18 native nucleotides, most preferably 10 native nucleotides are retained and the 

35 remaining nucleotides of the D-sequence are deleted or replaced with non-native nucleotides. The native D-sequences 
of the AAV inverted terminal repeats are sequences of 20 consecutive nucleotides in each AAV inverted terminal 
repeat [ie. there is one sequence at each end) which are not involved in HP formation. The non-native replacement 
nucleotide may be any nucleotide other than the nucleotide found in the native D-sequence in the same position. 
Other employable exemplary AAV vectors are pWP-19, pWN-1, both of which are disclosed in Nahreini (1993) 

40 Gene 124:257-262. Another example of such an AAV vector is psub201 (see Samulski (1987) J. Virol 61:3096). 
Another exemplary AAV vector is the Double-D ITR vector. Construction of the Double-D ITR vector is disclosed in 
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US Patent 5,478,745. Still other vectors are those disclosed in Carter US Patent 4,797368 and Muzyczka US Patent 
5,139,941, Chartejee US Patent 5,474,935, and Kotin W094/288157. Yet a further example of an AAV vector 
employable in this invention is SSV9AFABTKneo, which contains the AFP enhancer and albumin promoter and 
directs expression predominantly in the liver. Its structure and construction are disclosed in Su (1996) Human Gene 
5 Therapy 7:463470. Additional AAV gene therapy vectors are described in US 5,354,678, US 5,173,414, US 
5,139,941, and US 5,252,479. 

The gene therapy vectors of the invention also include herpes vectors. Leading and preferred examples are herpes 
simplex virus vectors containing a sequence encoding a thymidine kinase polypeptide such as those disclosed in US 
5,288,641 and EP0176170 (Roizman). Additional exemplary herpes simplex virus vectors include HFEM/ICP6-LacZ 
10 disclosed in WO95/04139 (Wistar Institute), pHSVlac described in Geller (1988) Science 241:1667-1669 and in 
WO90/09441 and WO92/07945, HSV Us3::pgC-lacZ described in Fink (1992) Human Gene Therapy 3:11-19 and 
HSV 7134,2 RH 105 and GAL4 described in EP 0453242 (Breakefield), and those deposited with the ATCC with 
accession numbers VR-977 andVR-260. 

Also contemplated are alpha virus gene therapy vectors that can be employed in this invention. Preferred alpha virus 
15 vectors are Sindbis viruses vectors. Togaviruses, Semliki Forest virus (ATCC VR-67; ATCC VR-1247), Middleberg 
virus (ATCC VR-370), Ross River virus (ATCC VR-373; ATCC VR-1246), Venezuelan equine encephalitis virus 
(ATCC VR923; ATCC VR-1250; ATCC VR-1249; ATCC VR-532), and those described in US patents 5,091,309, 
5,217,879, and WO92/10578. More particularly, those alpha virus vectors described in US Serial No. 08/405,627, 
filed March 15, 1995.W094/21792, WO92/10578, WO95/07994, US 5,091,309 and US 5,217,879 are employable. 
20 Such alpha viruses may be obtained from depositories or collections such as the ATCC in Rockville, Maryland or 
isolated from known sources using commonly available techniques. Preferably, alphavirus vectors with reduced 
cytotoxicity are used (see USSN 08/679640). 

DNA vector systems such as eukaryotic layered expression systems are also useful for expressing the nucleic acids of 
the invention. See W 095/07994 for a detailed description of eukaryotic layered expression systems. Preferably, the 
25 eukaryotic layered expression systems of the invention are derived from alphavirus vectors and most preferably from 
Sindbis viral vectors. 

Other viral vectors suitable for use in the present invention include those derived from poliovirus, for example ATCC 
VR-58 and those described in Evans, Nature 339 (1989) 385 and Sabin (1973) /. Biol Standardization 1:115; 
rhinovirus, for example ATCC VR-1110 and those described in Arnold (1990) J Cell Biochem L401; pox viruses 

30 such as canary pox virus or vaccinia virus, for example ATCC VR-111 and ATCC VR-2010 and those described in 
Fisher-Hoch (1989) Proc Natl Acad Sci 86:317; Flexner (1989) Ann NY Acad Sci 569:86, Flexner (1990) Vaccine 
8:17; in US 4,603,112 and US 4,769,330 and WO89/01973; SV40 virus, for example ATCC VR-305 and those 
described in Mulligan (1979) Nature 277:108 and Madzak (1992) / Gen Virol 73:1533; influenza virus, for example 
ATCC VR-797 and recombinant influenza viruses made employing reverse genetics techniques as described in US 

35 5,166,057 and in Enami (1990) Proc Natl Acad Sci 87:3802-3805; Enami & Palese (1991) / Virol 65:2711-2713 and . 
Luytjes (1989) Cell 59:110, (see also McMichael (1983) NE] Med 309:13, and Yap (1978) Nature 273:238 and 
Nature (1979) 277:108); human immunodeficiency virus as described in EP-0386882 and in Buchschacher(1992) J. 
Virol. 66:2731; measles virus, for example ATCC VR-67 and VR-1247 and those described in EP-0440219; Aura 
virus, for example ATCC VR-368; Bebaru virus, for example ATCC VR-600 and ATCC VR-1240; Cabassou virus, 

40 for example ATCC VR-922; Chikungunya virus, for example ATCC VR-64 and ATCC VR-1241; Fort Morgan 
Virus, for example ATCC VR-924; Getah virus, for example ATCC VR-369 and ATCC VR-1243; Kyzylagach virus, 
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for example ATCC VR-927; Mayaro virus, for example ATCC VR-66; Mucambo virus, for example ATCC VR-580 
and ATCC VR-1244; Ndumu virus, for example ATCC VR-371; Pixuna virus, for example ATCC VR-372 and 
ATCC VR-1245; Tonate virus, for example ATCC VR-925; Triniti virus, for example ATCC VR-469; Una virus, for 
example ATCC VR-374; Whataroa virus, for example ATCC VR-926; Y-62-33 virus, for example ATCC VR-375; 
5 0*Nyong virus, Eastern encephalitis virus, for example ATCC VR-65 and ATCC VR-1242; Western encephalitis 
virus, for example ATCC VR-70, ATCC VR-1251, ATCC VR-622 and ATCC VR-1252; and coronavirus, for 
example ATCC VR-740 and those described in Hamre (1966) Proc Soc Exp Biol Med 121:190. 

Delivery of the compositions of this invention into cells is not limited to the above mentioned viral vectors. Other 
delivery methods and media may be employed such as, for example, nucleic acid expression vectors, polycationic 

10 condensed DNA linked or unlinked to killed adenovirus alone, for example see US Serial No. 08/366,787, filed 
December 30, 1994 and Curiel (1992) Hum Gene Ther 3:147-154 ligand linked DNA, for example see Wu (1989) J 
Biol Chem 264:16983-16987, eucaryotic cell delivery vehicles cells, for example see US Serial No.08/240,030, filed 
May 9, 1994, and US Serial No. 08/404,796, deposition of photopolymerized hydrogel materials, hand-held gene 
transfer particle gun, as described in US Patent 5,149,655., ionizing radiation as described in US5,206,152 and in 

IS WO92/11033, nucleic charge neutralization or fusion with cell membranes. Additional approaches are described in 
Philip (1994) Mol Cell Biol 14:2411-2418 and in Woffendin (1994) Proc Natl Acad Sci 91:1581-1585. 

Particle mediated gene transfer may be employed, for example see US Serial No. 60/023,867. Briefly, the sequence 
can be inserted into conventional vectors that contain conventional control sequences for high level expression, and 
then incubated with synthetic gene transfer molecules such as polymeric DNA-binding cations like pdlylysine, 
20 protamine, and albumin, linked to cell targeting ligands such as asialoorosomucoid, as described in Wu & Wo (1987) 
7. Biol Chem. 262:4429-4432, insulin as described in Hucked (1990) Biochem Pharmacol 40:253-263, galactose as 
described in Plank (1992) Bioconjugate Chem 3:533-539, lactose or transferrin. 

Naked DNA may also be employed. Exemplary naked DNA introduction methods are described in WO 90/11092 and 
US 5,580,859. Uptake efficiency may be improved using biodegradable latex beads. DNA coated latex beads are 
25 efficiently transported into cells after endocytosis initiation by the beads. The method may be improved further by 
treatment of the beads to increase hydrophobicity and thereby facilitate disruption of the endosome and release of the 
DNA into the cytoplasm. 

Liposomes that can act as gene delivery vehicles are described in US 5,422,120, W 095/13796, W 094/23697, 
W091/14445 and EP-524,968. As described in USSN. 60/023,867, on non-viral delivery, the nucleic acid sequences 

30 encoding a polypeptide can be inserted into conventional vectors that contain conventional control sequences for high 
level expression, and then be incubated with synthetic gene transfer molecules such as polymeric DNA-binding 
cations like polylysine, protamine, and albumin, linked to cell targeting ligands such as asialoorosomucoid, insulin, 
galactose, lactose, or transferrin. Other delivery systems include the use of liposomes to encapsulate DNA comprising 
the gene under the control of a variety of tissue-specific or ubiquitously-active promoters. Further non-viral delivery 

35 suitable for use includes mechanical delivery systems such as the approach described in Woffendin et al (1994) Proc, 
. NatL Acad. Sci USA 91 (24):1 1581-11585. Moreover, the coding sequence and the product of expression of such can 
be delivered through deposition of photopolymerized hydrogel materials. Other conventional methods for gene 
delivery that can be used for delivery of the coding sequence include, for example, use of hand-held gene transfer 
particle gun, as described in US 5,149,655; use of ionizing radiation for activating transferred gene, as described in 

40 US 5,206,152 and W092/1 1033 
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Exemplary liposome and polycationic gene delivery vehicles are those described in US 5,422,120 and 4,762,915; in 
WO 95/13796; W 094/23697; and W091/14445; in EP-0524968; and in Stryer, Biochemistry, pages 236-240 (1975) 
W.H. Freeman, San Francisco; Szoka (1980) Biochem Biophys Acta 600:1; Bayer (1979) Biochem Biophys Acta 
550:464; Rivnay (1987) Meth Enzymol 149:119; Wang (1987) Proc Natl Acad Sci 84:7851; Plant (1989) Anal 
5 Biochem 176:420. 

A polynucleotide composition can comprises therapeutically effective amount of a gene therapy vehicle, as the term 
is defined above. For purposes of the present invention, an effective dose will be from about 0.01 mg/ kg to 50 mg/kg 
or 0.05 mg/kg to about 10 mg/kg of the DNA constructs in the individual to which it is administered. 

Delivery Methods 

10 Once formulated, the polynucleotide compositions of the invention can be administered (1) directly to the subject; (2) 
delivered ex vivo, to cells derived from the subject; or (3) in vitro for expression of recombinant proteins. The 
subjects to be treated can be mammals or birds. Also, human subjects can be treated. 

Direct delivery of the compositions will generally be accomplished by injection, either subcutaneously, 
intraperitoneally, intravenously or intramuscularly or delivered to the interstitial space of a tissue. The compositions 

15 can also be administered into a lesion. Other modes of administration include oral and pulmonary administration, 
suppositories, and transdermal or transcutaneous applications (eg. see W 098/20734), needles, and gene guns or 
hyposprays. Dosage treatment may be a single dose schedule or a multiple dose schedule. . 
Methods for the ex vivo delivery and reimplantation of transformed cells into a subject are known in the. art and 
described in eg. W093/14778. Examples of cells useful in ex vivo applications include, for example, stem cells, 

20 particularly hematopoetic, lymph cells, macrophages, dendritic cells, or tumor cells. 

Generally, delivery of nucleic acids for both ex vivo and in vitro applications can be accomplished by the following 
procedures, for example, dextran-mediated transfection, calcium phosphate precipitation, polybrene mediated 
transfection, protoplast fusion, electroporation. encapsulation of the polynucleotide^) in liposomes, and direct 
microinjection of the DNA into nuclei, all well known in the art. 

25 Polynucleotide and pohoevM e pharmaceutical compositions 

In addition to the pharmaceutical^ acceptable carriers and salts described above, the following additional agents can 
be used with polynucleotide and/or polypeptide compositions. 
A Polypeptides 

One example are polypeptides which include, without limitation: asioloorosomucoid (ASOR); transferrin; 
30 asialoglycoproteins; antibodies; antibody fragments; ferritin; interleukins; interferons, granulocyte, macrophage 

colony stimulating factor (GM-CSF), granulocyte colony stimulating factor (G-CSF), macrophage colony stimulating 

factor (M-CSF), stem cell factor and erythropoietin. Viral antigens, such as envelope proteins, can also be used. Also, 

proteins from other invasive organisms, such as the 17 amino acid peptide from the circumsporozoite protein of 

Plasmodium falciparum known as RII. 
35 B Hormones. Vitamins, etc. 

Other groups that can be included are, for example: hormones, steroids, androgens, estrogens, thyroid hormone, or 

vitamins, folic acid. 
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CPolyalkvlenes. Polysaccharides, etc. 

Also, polyalkylene glycol can be included with the desired polynucleotides/polypeptides. In a preferred embodiment, 
the polyalkylene glycol is polyethlylene glycol. In addition, mono-, di- f or polysaccharides can be included. In a 
preferred embodiment of this aspect, the polysaccharide is dextran or DEAB-dextran. Also, chitosan and 
5 poly(lactide-co-glycolide) 

D. Lipids, and Liposomes 

The desired polynucleotide/polypeptide can also be encapsulated in lipids or packaged in liposomes prior to delivery 
to the subject or to cells derived therefrom. 

Lipid encapsulation is generally accomplished using liposomes which are able to stably bind or entrap and retain 
10 nucleic acid. The ratio of condensed polynucleotide to lipid preparation can vary but will generally be around 1:1 (mg 
DNA:micromoles lipid), or more of lipid. For a review of the use of liposomes as carriers for delivery of nucleic 
acids, see, Hug and Sleight (1991) Biochim. Biophys. Acta. 1097:1-17; Straubinger (1983) Meth. Enzymol. 
101:512-527. 

Liposomal preparations for use in the present invention include cationic (positively charged), anionic (negatively 
15 charged) and neutral preparations. Cationic liposomes have been shown to mediate intracellular delivery of plasmid 
DNA (Feigner (1987) Proc. Natl Acad. Sci USA 84:7413-7416); mRNA (M alone (1989) Proc. Natl Acad. Scl USA 
86:6077-6081); and purified transcription factors (Debs (1990) /. Biol Chem. 265:10189-10192), in functional form. 

Cationic liposomes are readily available. For example, N[l-2,3-dioleyloxy)propyl]-N,N,N-triethylammonium 
(DOTMA) liposomes are available under the trademark Lipofectin, from GIBCO BRL, Grand Island, NY. (See, also, 
20 Feigner supra). Other commercially available liposomes include transfectace (DDAB/DOPE) and DOTAP/DOPE 
(Boerhinger). Other cationic liposomes can be prepared from readily available materials using techniques well known 
in the art. See, eg. Szoka (1978) Proc. Natl Acad. Sci. USA 75:4194-4198; WO90/11092 for a description of the 
synthesis ofDOTAP (l,2-bis(oleoyloxy)-3-(trimethylammonio)propane) liposomes. 

Similarly, anionic and neutral liposomes are readily available, such as from Avanti Polar Lipids (Birmingham, AL), 
25 or can be easily prepared using readily available materials. Such materials include phosphatidyl choline, cholesterol, 
phosphatidyl ethanolamine, dioleoylphosphatidyl choline (DOPC), dioleoylphosphatidyl glycerol (DOPG), 
dioleoylphoshatidyl ethanolamine (DOPE), among others. These materials can also be mixed with the DOTMA and 
DOTAP starting materials in appropriate ratios. Methods for making liposomes using these materials are well known 
in the art. 

30 The liposomes can comprise multilammelar vesicles (MLVs), small unilamellar vesicles (SUVs), or large unilamellar 
vesicles (LUVs). The various liposome-nucleic acid complexes are prepared using methods known in the art. See eg. 
Straubinger (1983) Meth. Immunol 101:512-527; Szoka (1978) Proc. Natl Acad. Sci. USA 75:4194-4198; 
Papahadjopoulos (1975) Biochim. Biophys. Acta 394:483; Wilson (1979) Cell 17:77); Deamer & Bangham (1976) 
Biochim. Biophys. Acta 443:629; Ostro (1977) Biochem. Biophys. Res. Commun. 76:836; Fraley (1979) Proc. Natl. 

35 Acad. Sci. USA 76:3348); Enoch &.Strittm alter (1979) Proc. Natl Acad. Sci. USA 76:145; Fraley (1980) /. Biol 
Chem. (1980) 255:10431; Szoka & Papahadjopoulos (1978) Proc. Natl Acad. Sci. USA 75:145; and Schaefer-Ridder 
(1982) Science 215:166. 

E. Lipoproteins 

In addition, lipoproteins can be included with the polynucleotide/polypeptide to be delivered. Examples of 
40 lipoproteins to be utilized include: chylomicrons, HDL, IDL, LDL, and VLDL. Mutants, fragments, or fusions of 
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tbese proteins can also be used. Also, modifications of naturally occurring lipoproteins can be used, such' as 
acetylated LDL. These lipoproteins can target the delivery of polynucleotides to cells expressing lipoprotein 
receptors. Preferably, iflipoproteins are including with the polynucleotide to be delivered, no other targeting ligand is 
included in the composition. 

S Naturally occurring lipoproteins comprise a lipid and a protein portion. The protein portion are known as apoproteins. 
At the present, apoproteins A, B, C, D, and E have been isolated and identified. At least two of these contain several 
proteins, designated by Rom an numerals, AI, An* AIV; CI, CII, CUL 

A lipoprotein can comprise more than one apoprotein. For example, naturally occurring chylomicrons comprises of 
A, B, C & E, over time these lipoproteins lose A and acquire C & E. YLDL comprises A, B, C & E apoproteins, LDL 
10 comprises apoprotein B; and HDL comprises apoproteins A, C, & E. 

The amino acid of these apoproteins are known and are described in, for example, Breslow (1985) Annu Rev. 
Biochem 54:699; Law (1986) Adv. Exp Med. Biol. 151:162; Chen (1986) I Biol Chem 261:12918; Kane (1980) Proc 
Natl Acad Sci USA 77:2465; and Utermann (1984) Hum Genet 65:232. 

Lipoproteins contain a variety of lipids including, triglycerides, cholesterol (free and esters), and phospholipids. The 
15 composition of the lipids varies in naturally occurring lipoproteins. For example, chylomicrons comprise mainly 
triglycerides. A more detailed description of the lipid content of naturally occurring lipoproteins can be found, for 
example, in Meth. Emymol. 128 (1986). The composition of the lipids are chosen to aid in conformation, of the 
apoprotein for receptor binding activity. The composition of lipids can also be chosen to facilitate hydrophobic 
interaction and association with the polynucleotide binding molecule. j 

20 Naturally occurring lipoproteins can be isolated from serum by ultracentrifugation, for instance. Such methods are 
described in Meth. EnzymoL (supra); Pitas (1980) J. Biochem. 255:5454-5460 and Mahey (1979) J Clin: Invest 
64:743-750. Lipoproteins can also be produced by in vitro or recombinant methods by expression of the apoprotein 
genes in a desired host cell. See, for example, Atkinson (1986) Annu Rev Biophys Chem 15:403 and Radding (1958) 
Biochim Biophys Acta 30: 443. Lipoproteins can also be purchased from commercial suppliers, such as Biomedical 

25 Techniologies, Inc., Stoughton, Massachusetts, USA. Further description of lipoproteins can be found in Zuckermann 
C/H/.PCTAJS97/14465. 

F.Polvcationic Agents 

Polycationic agents can be included, with or without lipoprotein, in a composition with the desired 
polynucleotide/polypeptide to be delivered. 

30 Polycationic agents, typically, exhibit a net positive charge at physiological relevant pH and are capable of 
neutralizing the electrical charge of nucleic acids to facilitate delivery to a desired location. These agents have both in 
vitro, ex vivo, and in vivo applications. Polycationic agents can be used to deliver nucleic acids to a living subject 
either intramuscularly, subcutaneously, etc. 

The following are examples of useful polypeptides as polycationic agents: polylysine, polyarginine, polyornithine, 
35 and protamine. Other examples include bistones, protamines, human serum albumin, DNA binding proteins, 
non-histone chromosomal proteins, coat proteins from DNA viruses, such as (X 1 74 , transcriptional factors also 
contain domains that bind DNA and therefore may be useful as nucleic aid condensing agents. Briefly, transcriptional 
factors such as C/CEBP, c-jun, c-fos, AP-1, AP-2, AP-3, CPF, Prot-1, Sp-1, Oct-1, Oct-2, CREP, and TFIID contain 
basic domains that bind DNA sequences. 

40 Organic polycationic agents include: spermine, spermidine, and purtrescine. 
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The dimensions and of the physical properties of a polycationic agent' can be extrapolated from the list above, to 
construct other polypeptide polycationic agents or to produce synthetic polycationic agents. 
Synthetic polycationic agents which are useful include, for example, DEAE-dextran, polybrene. Lipofectin™, and 
lipofectAMINE™ are monomers that form polycationic complexes when combined with 
5 polynucleotides/polypeptides. 
Immunadiapnostic Assays 

Neisseria antigens of the invention can be used in immunoassays to detect antibody levels (or, conversely, anti- 
Neisseria antibodies can be used to detect antigen levels). Immunoassays based on well defined, recombinant antigens 
can be developed to replace invasive diagnostics methods. Antibodies to Neisseria proteins within biological samples, 

10 including for example, blood or serum samples, can be detected. Design of the immunoassays is subject to a great 
deal of variation, and a variety of these are known in the art. Protocols for the immunoassay may be based, for 
example, upon competition, or direct reaction, or sandwich type assays. Protocols may also, for example, use solid 
supports, or may be by immunoprecipitation. Most assays involve the use of labeled antibody or polypeptide; the 
labels may be, for example, fluorescent, chemiluminescent, radioactive, or dye molecules. Assays which amplify the 

15 signals from the probe are also known; examples of which are assays which utilize biotin and avidin, and enzyme- 
labeled and mediated immunoassays, such as ELISA assays. 

Kits suitable for immunodiagnosis and containing the appropriate labeled reagents are constructed by packaging the 
appropriate materials, including the compositions of the invention, in suitable containers, along with the remaining 
reagents and materials (for example, suitable buffers, salt solutions, etc.) required for the conduct of the assay, as well 
20 as suitable set of assay instructions. 
fiucleic Acid Hybridisation 

"Hybridization" refers to the association of two nucleic acid sequences to one another by hydrogen bonding. 
Typically, one sequence will be fixed to a solid support and the other will be free in solution. Then, the two sequences 
will be placed in contact with one another under conditions that favor hydrogen bonding. Factors that affect this 

25 bonding include: the type and volume of solvent; reaction temperature; time of hybridization; agitation; agents to 
block the non-specific attachment of the liquid phase sequence to the solid support (Denhardt's reagent or BLOTTO); 
concentration of the sequences; use of compounds to increase the rate of association of sequences (dextran sulfate or 
polyethylene glycol); and the stringency of the washing conditions following hybridization. See Sambrook et al 
[supra] Volume 2, chapter 9, pages 9.47 to 9.57. 

30 "Stringency" refers to conditions in a hybridization reaction that favor association of very similar sequences over 
sequences that differ. For example, the combination of temperature and salt concentration should be chosen that is 
approximately 120 to 200°C below the calculated Tm of the hybrid under study. The temperature and salt conditions 
can often be determined empirically in preliminary experiments in which samples of genomic DNA immobilized on 
filters are hybridized to the sequence of interest and then washed under conditions of different stringencies. See 

35 Sambrook etal at page 9.50. 

Variables to consider when performing, for example, a Southern blot are (1) the complexity of the DNA being blotted 
and (2) the homology between the probe and the sequences being detected. The total amount of the fragment(s) to be 
studied can vary a magnitude of 10, from 0.1 to lpg for a plasmid or phage digest to 10" 9 to 10 4 g for a single copy 
gene in a highly complex eukaryotic genome. For lower complexity polynucleotides, substantially shorter blotting, 

40 hybridization, and exposure times, a smaller amount of starting polynucleotides, and lower specific activity of probes 
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can be used. For example, a single-copy yeast gene can be detected with an exposure time of only 1 hour starting with 
1 ug of yeast DNA, blotting for two hours, and hybridizing for 4-8 hours with a probe of 10* cpra/pg. For a 
single-copy mammalian gene a conservative approach would start with 10 pg of DNA, blot overnight, and hybridize 
overnight in the presence of 10% dextran sulfate using a probe of greater than 10* cpm/pg, resulting in an exposure 
time of -24 hours. 

Several factors can affect the melting temperature (Tm) of a DNA-DNA hybrid between the probe and the fragment 
of interest, and consequently, the appropriate conditions for hybridization and washing. In many cases the probe is 
not 100% homologous to the fragment. Other commonly encountered variables include the length and total G+C 
content of the hybridizing sequences and the ionic strength and formamide content of the hybridization buffer. The 
effects of all of these factors can be approximated by a single equation: 

Tm= 81 + 16.6(log, 0 Ci) + 0.4[%(G + C)]-0.6(%formamide) - 600/n-1.5(%mismatch). 
where Ci is the salt concentration (monovalent ions) and n is the length of the hybrid in base pairs (slightly modified 
from Meinkoth & Wahl (1984) Anal, Biochem. 138: 267-284). 

In designing a hybridization experiment, some factors affecting- nucleic acid hybridization can be conveniently 
altered. The temperature of the hybridization and washes and the salt concentration during the washes are the simplest 
to adjust. As the temperature of the hybridization increases (ie. stringency), it becomes less likely for hybridization to 
occur between strands that are nonhomologous, and as a result, background decreases. If the radiolabeled probe is not 
completely homologous with the immobilized fragment (as is frequently the case in gene family and interspecies 
hybridization experiments), the hybridization temperature must be reduced, and background will increase. The 
temperature of the washes affects the intensity of the hybridizing band and the degree of background in a similar 
manner. The stringency of the w ashes is also increased with decreasing salt concentrations. 
In general, convenient hybridization temperatures in the presence of 50% formamide are 42°C for a probe with is 
95% to 100% homologous to the target fragment, 37°C for 90% to 95% homology, and 32°C for 85% to 90% 
homology. For lower homologies, formamide content should be lowered and temperature adjusted accordingly, using 
the equation above. If the homology between the probe and the target fragment are not known, the simplest approach 
is to start with both hybridization and wash conditions which are nonstringent. If non-specific bands or high 
background are observed after autoradiography, the filter can be washed at high stringency and reexposed. If the time 
required for exposure makes this approach impractical, several hybridization and/or washing stringencies should be 
tested in parallel. 
Nucleic Acid Probe Assays 

Methods such as PCR, branched DNA probe assays, or blotting techniques utilizing nucleic acid probes according to 
the invention can determine the presence of cDNA or mRNA. A probe is said to "hybridize" with a sequence of the 
invention if it can form a duplex or double stranded complex, which is stable enough to be detected. 
The nucleic acid probes will hybridize to the Neisseria nucleotide sequences of the invention (including both sense 
and antisense strands). Though many different nucleotide sequences will encode the amino acid sequence, the native 
Neisseria sequence is preferred because it is the actual sequence present in cells. mRNA represents a coding sequence 
and so a probe should be complementary to the coding sequence; single-stranded cDNA is complementary to mRNA, 
and so a cDNA probe should be complementary to the non-coding sequence. 

The probe sequence need not be identical to the Neisseria sequence (or its complement) - some variation in the 
sequence and length can lead to increased assay sensitivity if the nucleic acid probe can form a duplex with target 
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nocleotides, which can be detected. Also, the nucleic acid probe can include additional nucleotides to stabilize fte 
formed duplex. Additional Neisseria sequence may also be helpful as a label to detect the formed duplex For 
• example, . non-complementary nucleotide sequence may be attached to the 5' end of the probe, with the remainder of 
the probe sequence being complementary to a Neisseria sequence. Alternatively, non-com plementary bases or longer 

5 sequences can be interspersed into the probe, provided that the probe sequence has sufficient complementarity with 
the a Neisseria sequence in order to hybridize therewith and thereby form a duplex which can be detected. 
The exact length and sequence of the probe will depend on the hybridization conditions (e.g temperature, salt 
condition ere). For example, for diagnostic applications, depending on the complexity of the analyte ^sequence the 
nucleic acid probe typically contains at least 10-20 nucleotides, preferably 15-25, and more preferably at least 30 

10 nucleotides, although it may be shorter than this. Short primers generally require cooler temperatures to form 
sufficiently stable hybrid complexes with the template. 

Prota m.» it prodded b» .yftetio precede, »cb „> the tries*, -.tied of M.»e.cei a * [J. Chm. Soc 
(Ml) iOJ-3.85], or ace,*., to Urd« «. .1 (Pr«. Natl Ac... Sci. USA (1983) SO: 7461), ct ».., c,«««c»U, 
available automated oligonucleotide synthesizers. 
15 The chemical nature of the probe can be selected according to preference. For certain applications, DNA or RN A are 
appropriate. For other applications, modifications may be incorporated eg. backbone modifications, such as 
Xrothioates or metnylphosphonates, can be used to increase in vivo balf-life alter RNA a^ increase 
nuclease resistance etc. (eg. see Agrawal * Iyer (1995) Cnrr 0 P in 

14:376-3871; analogues such as peptide nucleic acids may also be used [eg. see Corey (1997) T1BTBCH 15.224-229, 

20 Bochardurd. (1993)7W£Of 11:384-386]. 

Alternatively the polymerase chain reaction (PCR) is another well-knowu means for detecting small amounts of 
C IS; acid' The assay is described in Mollis * al. totynol (1987) 155:335-350] * US patents 
4 683 195 & 4,683,202. Two "primer" nucleotides hybridize with the target nucleic adds and are used to prime the 
reaction. The primers can comprise sequence that does not hybridize to the sequence of the amplification target (or it, 

25 complement) to aid with duplex stabiiity or, for example, to incorporate a convenient restx.ct.on site. Typically, such 
sequence will flank the desired Neisseria sequence. 

A thermostable polymerase creates copies of target nucleic acids from the primers using the original target nucleic 
aeids as a template. After a threshold amount of targe, nucleic acids are generated by ^ polymerase, they can e 
detected by more traditional methods, such as Southern blots. When using the Southern blot method, the labelled 

30 probe will hybridize to the Neisseria sequence (or its complement). 

Also mRNA or cDNA can be detected by traditional blotting techniques described in Sambrook er a/ [supra] 
mRNA, or cDNA generated from mRNA using a polymerase enzyme, can be purified and separated using gel 
electrophoresis. The nucleic acids on the gel are then blotted onto a solid support, such as nitrocellulose. The solid 
support is exposed to a labelled probe and then washed to remove any unhybridized probe. Next, the duplexes 

35 containing the labeled probe are detected. Typically, the probe is labelled with a radioactive moiety. 

BRIEF DESCRIPTION OF DRAWINGS 

There are no drawings. 
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MODES FOR CARRYING OUT THE INVENTION 

The following examples describe nucleic acid sequences which have been identified in N.gonorrhoeae, 
along with their inferred translation products. 

the examples are generally in the following format: 
5 o a nucleotide sequence which has been identified in ^.gonorrhoeae. The strain used is FA1090 

[Dempsey et al. (1991) /. Bacteriol 173:5476-5486] 
o the inferred translation product of this sequence 

o a computer analysis (e.g. PSORT output) of the translation product, indicating antigenicity 
o homologous sequences (where relevant) 
10 o results of expression and purification (where relevant) 

These examples typically include details of sequence homology between species and strains. Proteins that 
are similar in sequence are generally similar in both structure and function, and the homology often 
indicates a common evolutionary origin. Comparison with sequences of proteins of known function is 
widely used as a guide for the assignment of putative protein function to a new sequence and has proved 
15 particularly useful in whole-genome analyses. 

Open reading frames (ORFs) within nucleotide sequences were predicted using the GLIMMER program 
[Salzberg et al (1998) Nucleic Acids Res 26:544-8], All predicted open-reading frames longer than 60 aa 
were screened against the meningococcus serotype B ( 4 MenB') ORFs (accession NC002183) using the 
BLASTP algorithm [Altschul et al. (1990) J. Mol. Biol 215:403-410]. ORFs were considered to be 
20 gonococcus-specific if they showed sequence identity to a MenB ORF lower than 60% over the whole 
protein length, or matching the MenB ORF over less than 30% of the length. 

Open reading frames are usually shown with a N-terminal methionine. Where this is not the case (e.g. 
SEQ IDs 12, 18, 20, 32, 54, 62, 66, 84, 98, 102, 104, 112, 116, 118, 126, 128, 130, 134, 136, 138, 146, 
152, 162, 186, 228, 238, 240, 278, 280, 288, 290, 298, 300, 308, 314), a non-ATG start codon is present, 
25 but the N-terminus amino acid will be methionine when translated using this start codon. If an upstream 
start codon is used, however, the "native" amino acid will be translated (e.g. if the sequence is expressed 
with N-terminus fusion sequences). Even where the first amino acid is not shown as methionine, the 
invention encompasses sequences in which the first amino acid is methionine. 

Various tests can be used to assess the in vivo immunogenicity of the proteins identified in the examples. 
30 For example, the proteins can be expressed recombinantly and used to screen patient sera by immunoblot. 
A positive reaction between the protein and patient serum indicates that the patient has previously 
mounted an immune response to the protein in question Le. the protein is an immunogen. This method 
can also be used to identify immunodominant proteins. 

The recombinant protein can also be conveniently used to prepare antibodies e.g. in a mouse. These can 
35 be used for direct confirmation that a protein is located on the cell-surface. Labelled antibody (e.g. 
fluorescent labelling for FACS) can be incubated with intact bacteria and the presence of label on the 
bacterial surface confirms the location of the protein. 
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For protein expression of 14 antigens, sequences were amplified using the following primers: 
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"QwrA 

rwG 
ivcv 


rorf^ATTrrATATC-GCCGGTAAAGAGCAATTTAC 
CCCGCTCGAG -AGCCAAGAAGAACCCGTTAT 


NdeT 
Xhol 


NGS13 


rWu 




XN11CX 1 


Pair 


CCCGOTPGAG-CATGTGCCGTGCGGCGT 


Xhol 


NGS36 


rwa 


PfiPfSP ATPPCPTAfSP — CI APAPPPPGAAPAATAPP 


xNnex 


Kev 


PPP^PTPfi A fi— A A APP'FGPPPTTGATGPP 


.AilUl 


NGS37 


Fwd 


CGCGGATCCCATATG-GTAGAAGTTAAAGGCGGGG 


Ndel 


Rev 


r , /^ , r , or»rnr»r , »r' frnwnmrnrnofiiPfiPP^2PPr2AA 


AJQOl 


NGS38 


Fwd 


CGCGGATCCCATATG - Gv- CUAuijAAv-ia^-CVjCU 


Ndel 


Rev 


CCCGCTCGAG— AAAuCIjAXAX 1 XA/Wiv-UUftAUAVjCU 


Anol 


NGS39 


Fwd 
Rev 


CGCGGATCCGCTAGC-AACCAAGAAGGGATTACCG 

/-mn/^/^m/in > n mmmmmri TV ootv rmv TV m/"i tv i^rruTirnfTV**»/* , «/"'»r , »fn 

C C CGCTC GAG — x^TTTGAC^ A I I vj AL. 1 1 X XXjL. i_ V- X 


Nhel 
Xnol 


NGS67 


Fwd 


CGCGGATCC CATATG- CGTGCGCACGGACACG 


Ndel 


Rev 


CCCGCTCGAG-GGCGGCGAGTTTX XCGC 


Xhol 


NGS106 


Fwd 
Rev 


/iorn*i/^» m^"»/"^/~» n rmv mn TV TV TV /"i TV A/^OOH TV /~*/~* TV ffl TV ^"1 

CGCGGATC CCATATG - GCAAAC AGCGGAACGATAG 
t_ L L. vjj CTC ViAvj — AAAAx L. L. X\jULKjoA l%-.oVa X 


Ndel 
Anol 


NGS115 


Fwd 
Rev 


CGCGGATCCCATATG -GGGGGCGGCTCCGGC 
CCCGCTCGAG— x^K-Cabu.LJUi.uAAl\jUl Ivt 


Ndel 
Anol 


NGSAG115 


Fwd 


CGCGGATCCCATATG - GATGC CCAATC X X\*ACAAAvi 


Ndel 


Rev 


CCUlsC X V-liAV»— X ivvibt LAAtiyi Ivt X 1LL 


Anol 


NGS118 


T"? _.J 

Fwd 


CGLGGATCCCATA x\*-ACCfc»C.UC- 1 AVCU XV i*aA 


IN del 


Rev 


UL-CvaL-XUviAo— CV9V?L.XV7\~WlX Xv-.0v-.V3X X 


Anoi 


NGS122 


Fwd 


CGCGGATCCCATATC-AACCCGAACGATGCGTTTT 


Ndel 


XxGV 


CCCGCTCGAG-AGGGTAAAACOTATTCAAATCGGCAA 


Xhol 


NGS144 


Fwd 


CGCGGATCCCATATC-GCOTCTGAAAATTCTGTAGC 


Ndel 


Rev 


CCCGCTCGAG -GAACACGCTOTTCATTACACCCA 


Xhol 


NGS151 


Fwd 
Rev 


CGCGGATCCCATA TC~ CACGGTA TCCATAAGAGCA 
CCCGCTCGAG-TOGCxXSATGCGGCTTOAxyPCG 


Ndel 
Xhol 



Example 1 

A DNA sequence was identified in N. gonorrhoeae <SEQ ID 1> which encodes amino acid sequence 
<SEQ ID 2; NGS 1>. Analysis of this protein sequence reveals the following: 

5 GvH: Examining signal sequence (von Heijne) 

Signal Score (-7.5): -5.47 
Possible cleavage site: 36 
»> Seems to have no N- terminal signal seg. 
Amino Acid Composition of Predicted Mature Form: 
10 calculated from 1 

AL0M: Finding transmembrane regions (Klein et al.) 
count: 0 value: 4.72 threshold: 0.0 
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PERIPHERAL Likelihood = 4.72 
modified ALOM score: -1.44 
Rule: cytoplasmic protein 

*** Reasoning Step: 2 

Final Results 



bacterial cytoplasm Certainty= 0 .326 (Affirmative) < succ> 

10 bacterial periplasmic space— Certainty= 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty= 0.000 (Not Clear) < suco 

bacterial inner membrane — Certainty* 0.000 (Not Clear) < suco 

The protein has homology with the following sequences in the databases: 

15 >sp \ P45941 1 YQCFJACSU HYPOTHETICAL 21.5 KD PROTEIN IN CWLA-CISA INTERGENIC REGION 



55 



pir 
dbj 
dbj 
emb 



|E69949 hypothetical protein yqcP - Bacillus subtilis 



BAA06963 .1 
BAA12427.1 
CAB14528.1 



(D32216) 0RF95 [Bacillus subtilis] 
(D84432) YqcF [Bacillus subtilis] 
(Z99117) yqcF [Bacillus subtilis] 



20 Length = 192 

Score =35.5 bits (81), Expect = 0.45 

Identities = 36/162 (22%), Positives = 77/162 (47%), Gaps - 5/162 (3%) 

25 Query: 19 DSGSQYKLNIAAI PSSPNRDLKTYITLGLSKHDLNYK SRFEILFVCSLKYDENQIFP 75 

D ++I ++ P + +Y TLGLS H +NY+ + I V +++ + 

Sbjct: 29 DDNKNSSIDILSVSDQPQEGITSYSTI^LSDHSINYEVNGTPLRIEIVAAMESASDIYAN 88 

Query: 76 FLRWLAETI IENKKILLRGQVVYLPRSrWS-TKMDALYVSAPFYFDDDFQVCYGEHYNI 134 jj 
30 L A II + G + S+ + T M + PF +++D ++ + N+ , 

Sbjct: 89 VLSTCAFNI INSNFTCAPGVIFKimSMYDQETDMKHIMFW 148 |, 

Query: 135 VFPLLVPLYKQEAELVEKKGWNAFEQFLLDNEVGNLSDMNRK 176 
+ + + p + + E++EKG + +Q LL+++ ++ D+ R+ 
35 Sbjct: 149 TWLMALPI SEGELQVAEKHG- SDYLQDLLESKQIDIFDIKRE 189 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 2 

40 A DNA sequence was identified in N.gonorrlweae <SEQ ID 3> which encodes amino acid sequence 

<SEQ ID 4; NGS2>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7.5): -7.2 
Possible cleavage site: 18 
45 »> Seems to have no N-terminal signal seq. 

Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 5.89 threshold: 0.0 
50 PERIPHERAL Likelihood =5.89 

modified ALOM score: -1.68 
Rule: cytoplasmic protein 



*** Reasoning Step: 2 
Final Results — 



bacterial cytoplasm Certainty* 0.367 (Affirmative) < suco 

bacterial periplasmic space Certainty= 0.000 (Not Clear) < suco 

60 bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

bacterial inner membrane Certainty* 0.000 (Not Clear) < suco 
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The protein has homology with the following sequences in the databases: 

>emb|CAC01359.l| (AL390975) hypothetical protein SCP8.21 [Streptanyces coelicolor A3 (2) ] 
Length = 198 

5 

Score * 37.1 bits (85), Expect = 0.15 

Identities = 29/107 (27%), Positives = 51/107 (47%), Gaps = 3/107 (2%) 

Query: 73 ETPEHIETLAMLAS ASMHYPDQFQ 1 SLFYFYGQELEY — 130 

JO +T + + LA+LA++ 6 ++++G P + F L++ P 

Sbjct: 88 DTDKVLRPIAVLAASP^ 147 

Query: 131 -MDNVRFFWLLPITQTERLFLNTHSVE 176 
+P VRF LLP+T E + H L+ ++ G D D +R S 
15 Sbjct: 148 PLDFVRFLPLLFMTPNEAAV^VHGAPALQBRWLNHGTDLRDPSRRS 194 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 3 

20 A DNA sequence was identified in N.gonorrhoeae <SEQ ID 5> which encodes amino acid sequence 

<SEQ ID 6; NGS3>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 

Signal Score (-7.5): -1.69 : 
Possible cleavage site: 32 
25 >» Seems to have a cleavable N-term signal seq. 

Amino Acid Composition of Predicted Mature Form: 

calculated from 33 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 5 value: -10.56 threshold: 0.0 
30 INTEGRAL Likelihood «=-10.56 Transmembrane 182 - 198 ( 171 - 201) 

INTEGRAL Likelihood = -7.86 Transmembrane 251 - 267 ( 244 - 273) 
INTEGRAL Likelihood = -7.48 Transmembrane 142 - 158 ( 136 - 167) 
INTEGRAL Likelihood = -6.32 Transmembrane 55 - 71 ( 50 - 82) 
INTEGRAL Likelihood = -2.97 Transmembrane 100 - 116 ( 99 - 117) 
35 PERIPHERAL Likelihood = 4.72 

modified ALOM score: 2.61 
Rule: cytoplasmic membrane protein 



40 



*** Reasoning Step: 2 
Final Results — 



bacterial inner membrane Certainty* 0 .522 (Affirmative) < suco 

bacterial periplasmic space Certainty* 0.000 (Not Clear) < suco 

45 bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

bacterial cytoplasm Certainty* 0.000 (Not Clear) < suco 

The protein has homology with the following sequences in the databases: 

>sp|P19845|NOSY_PSEST MEMBRANE PROTEIN NOSY PRECURSOR 
50 pir||S13585 nosY protein precursor - Pseudomonas stutzeri 

emb|CAA37717.1| (X53676) nosY [Pseudomonas stutzeri] 
Length = 276 

Score = 163 bits (413), Expect = 2e-39 
55 Identities * 117/275 (42%), Positives = 174/275 (62%), Gaps * 2/275 (0%) 

Query: 1 MNPVWI ITGKEARDSLRNRWNflCiAAVLLLAALALSLGFLGSSPTGSV^ 60 

MNVWIKED LRNRW+IA LL A LA+ + +LG++ +G + + T+ SL+S 
Sbjct: 1 MNQVWNIARKELSDGLRNRWIiLAI SLLFAVLAVG IAWLGAAASGQIX3FTSI PATIASLAS 60 

60 
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Query; 61 LSIFLI PLI AMLL STOALIGEIERGTMALLIiSYPIWRNQILAGKFVGHLI ILALATTAGY 120 

L+ FL+PLIA+LL+YDA++GE B GT+ LLL+YP+ R QIIi GKFVGH +ILALA G+ 
Sbjct: 61 LATFLMPLIALLLAYPAIVGEDEGGTLMLI^TYP 120 

Query: 121 GLAGITLQLANGGFDIAA-WKPFALL IAASVILGAAFLSMGYLI SAKVKERGTAAGI S IG 179 

G A + + L G ++ + F + 4-S +LG FL+ Y++S KV E+ +AAG+++G 
Sbjct: 121 GCAALAIAIJjVEGVELGMIjFWAFGRFMI S STLLGWVFLAFAYVLSGKVNEKS SAAGLALG 180 

Query: 180 VWLFFWIFDMALLGILVADSKQVIT^^ 239 

VW F V+ +L L+ S+ ++ +LL NPTDIYRL+NL+G+E + G+ 

Sbjct: 181 VW-FLFVLVFDLVLLALLVLSEGKFOT 239 

Query: 240 GLSGQIGLTVPNTLLTAQVLVAni IPLVLAAGIFRKR 274 

Ii + + VL + W+ + L+LA IFR+R 
Sbjct: 240 SLGADLPVPAAVLVOjCLIAWIGVSLLLAYAIFRRR 274 

A homolog (amino acids 226-276) was found in serogroup A N.meningitidis but not in serogroup B, so 
NGS3 protein and nucleic acid are useful for distinguishing between gonococcus and serogroup B 
^.meningitidis. 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

* Example 4 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 7> which encodes amino acid sequence 
<SEQ ID 8; NGS4>. Analysis of this protein sequence reveals the following: j 

GvH: Examining signal sequence (von Heijne) 

Signal Score (-7*5): 1.53 

Possible cleavage site: 58 
»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 

count: 0 value: 0.63 threshold: 0.0 

PERIPHERAL Likelihood = 0.63 
modified ALOM score: -0.63 
Rule: cytoplasmic protein 

*** Reasoning Step: 2 

Final Results 

bacterial cytoplasm Certainty* 0.103 (Affirmative) < suco 

bacterial periplasmic space Certainty^ 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty^ 0.000 (Not Clear) < suco 

bacterial inner membrane Certainty^ 0.000 (Not Clear) < suco 

The protein has homology with the following sequences in the databases: 

>sp|Q59746|N0SZ_RHIME NITROUS-OXIDE REDUCTASE PRECURSOR <N(2)OR) (N20 REDUCTASE) 
gblAAC44023.l| (U47133) nitrous-oxide reductase [Sinorhizobium meliloti] 
prf | |2209347B nitrous-oxide reductase [Rhizobium meliloti] 
Length = 639 

Score = 660 bits (1704), Expect =0.0 

Identities = 344/536 (64%), Positives = 407/536 (75%), Gaps = 23/536 (4%) 

Query: 1 MSDEKLEQNGLSRRSFLGTAA- - ASGAGIAGAGLLGLAGCSKDGEQAAANASGAAPVAKA 58 

MS+E+ + L+RR LGT A A+ + G L L+G G A A+A 

Sbjct: 1 MSNEETKMR-LNRRQMLGTTAFMAAAGAVGAGGALTLSG GTATPARA 46 
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20 



Ouerv: 59 QGESKPGQLSSEVGPGELDQYYGFfcSGGQSGEMRijJ.uii*' bmusu-m-i-r v r mwaii^ 118 

n s s ev PGELD+YY P S GQSGE+R++G PSMRE+MRIPVFN SATGWG+ 

Sbjct: 47 QBTSGS SYEVKPGEliDEYYVFFSSGQSGEIRIVGAI'SMREMMRIPVFNRCSATQWGQ 103 

5 Ouerv 119 TraSUWI^ITEETRKFIiKDSGI*^ "8 

TNES KV+ + ET +FLKD G Y NGDLHHPH SFTD TYDGRY YANDK+N+RV 
Sbjct: 104 TNESKKVMTEGLIiPETVEFIiKDQG-GIiYIjNGDIjHH^ 162 

Query 179 CRVRLDVMKMKIIDIPITOSGIHGLRPQRYPKTGYWAN^ 237 
10 * .CR+RLDVMK DKII +PN +HGIiR Q+YPKTGYVF NGE PV GK + D ++ 

Sbjct: 163 CRIRLDVMKCDKIICLPNQHTVHGIiKVQKYPKT^ 222 

Ouerv- 238 AvmiWBSMBIlWQVLVIXM^ 297 
A++TA+DGETME+AVJQV+VDGNLDN DADYQGKY FATCYNSE T+ ++EQDW V 
15 Sbjct: 223 AIFTAVDGETMEVAWQVMVDGNLDNVDADYQGKYC^^ 282 

Ouerv- 298 VFDLFAIEEGIKAGDFKEVNGVKMjDGEAEAKSK 357 
W ^* +F+LK 1KB + GD+KE+ GV +LDGR S YTRY+ PVPHSPHG N +PDG +++ N 

Sbjct: 283 IFNLKRIEEAVAKGDYKEIGGVPVLDGR — KGSPYTRYVFVPNSPHGINTAPDGIHWAN 340 

Ouerv- 358 GKLPPTVTVLBVSKLDDIJAGKIKER^^ 417 

QKL PTVTV DV K DDLF KI+ RD WAEP+LGLGPI^A+m4<5NAYTTLFIDSQ+ 
Sbjct: 341 GKLSPTVTVFDVRKFDDIJDDKIQARDaV^ 400 

0« ouerv- 418 VK^IDDAIKAYKGEKlDPIKQKLDVHYQPGHNHTTMGETOEADGQ>)LVSIiNKFSKDRPL 477 

KWNI+DA +AY GEK+DPI+ KLDVHYQPGHNHT+MG+TKEADG+WI>+ SLMKPSKDR+li 

. . . — ww>t Tvrrrr<y/sTV«TrKTrjrn cruirwmrip TYSTTWT . T QT .Jsi k W jjKURYXi 460 



Sbjct: 401 CKWNIEDAKRAYAGEKVDPIRHKLDVHYQPGHN^ 460 

Query- 478 NAGPLKPECDQLIGISGDEMRLVHDNOT^^ 533 
oq N GPLKPE DQLI ISGDEM LVHDNPTFAEPHD +V ASK+NP W+R DP+F 

Sbjct: 461 NVGPLKPENI^LIDISGDEMVLVHDNPTFAEPHDATIVHASKINPVH^ 516 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

35 Example 5 

A DNA sequence was identified in ^gonorrhoeae <SEQ ID 9> which encodes amino acid sequence 

<SEQ ID 10; NGS5>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7.5): 1.09 
40 possible cleavage site: 19 

>» Seems to have a cleavable N-term signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 20 
ALOM: Finding transmembrane regions (Klein et al.) 
45 count: 0 value: 7.43 threshold: 0.0 

PERIPHERAL Likelihood * 7,43 
modified ALOM score: -1.99 
Score for OM-PP discrimination: 4.97 
Rule: outer membrane or periplasmic protein 
50 Score for OM-PP discrimination: 4.97 

Rule: outer membrane or periplasmic protein 

*** Reasoning Step: 2 

55 Outer membrane? Score: 0.496525 

Final Results 

bacterial outer membrane Certainty* 0.781 (Affirmative) < suco 

6ft bacterial periplasmic space Certainty* 0.138 (Affirmative) < suco 

OU bacterial inner membrane Certainty* 0.000 (Not Clear) < suco 

bacterial cytoplasm Certainty= 0.000 (Not Clear) < suco 
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The protein has no homology with sequences in the databases. 

The protein was expressed in Exoli as an insoluble 43.56kDa His-fusion product and then purified 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
5 useful antigens for vaccines or diagnostics. 

Example 6 

A DNA sequence was identified in N. gonorrhoeae <SEQ ID 11> which encodes amino acid sequence 

<SEQ ID 12; NGS6>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
10 Signal Score (-7.5): -3.93 

Possible cleavage site: 36 
»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 
calculated from 1 
15 ALOU: Finding transmembrane regions (Klein et al . ) 

count: 0 value: 6.42 threshold: 0.0 

PERIPHERAL Likelihood =6.42 i 
modified ALOM score: -1.78 
Rule: cytoplasmic protein 



20 



40 



*** Reasoning Step: 2 
Final Results — 



. I; 

25 bacterial cytoplasm Certainty* 0.447 (Affirmative) < suco 

bacterial periplasmic space Certainty= 0.000 (Not Clear) < suco 

bacterial outer membrane Certain ty= 0.000 (Not Clear) < suco 

bacterial inner membrane Certainty= 0.000 (Not Clear) < suco 

30 The protein has homology with the following sequences in the databases: 

>pir||F83173 outer membrane protein OprC PA3790 [imported] - Pseudomonas 
aeruginosa (strain PAOl) 
dbj |BAA05664.l| (D28119) outer membrane protein C [Pseudomonas aeruginosa] 
gb|AAG07177.l|AE004797_12 (AE004797) outer membrane protein OprC [Pseudomonas 
35 aeruginosa] 

Length = 723 



Score = 77.9 bits (191), Expect » le-13 

Identities = 58/188 (30%), Positives ■ 89/188 (46%), Gaps ■ 13/188 (6%) 

Query: 49 VKDLI IFDRAHGQSGTASKDGGI ITRNVDARLFTAQAYARYNFNPHWAAGIKAAYNYGHN 108 

V+D I+F G G++++ NVDAR+ + A Y +W AY +G N 

Sbjct: 546 VQDPILFSYREGMMGS STQ ATNVDARIMGGELGASYQLTGNWKTDASLAYAWGKN 600 



45 Query: 109 ETDGRPPYQIRPFEAAVQADYKNYPAHGSYNIGAATRFVAKQTRGDFEMAS 168 
+D R QI P EA Y+ G ++ G+ R VA Q R D ++GD H 

Sbjct: 601 SSDDRALPQIPPLEARFGLTYE EGDWSAGSLWRVVAPQNRIARDQGNVVGKDFDKS 656 

Query: 169 AKGFTVADWAGVNIKDKYGLRLGVNN^ — SGDHVLALSPS -WYAPGRTY 225 

50 A GF V + + L GV+N+F+K Y E++ +GD S + V PGRT+ 

Sbjct: 657 A-GFGVFSLNGAYRVTRNVKLSAGVDNLFD^ 715 

Query: 226 WLSLHAAF 233 
W + +F 
55 Sbjct: 716 WPKVDFSF 723 
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Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 7 

A DNA sequence was identified in ^gonorrhoeae <SEQ ID 13> which encodes amino acid sequence 

5 <SEQ ID 14; NGS7>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence {von Heijne) 
Signal Score (-7.5): 4.94 
Possible cleavage site: 26 
>» seems to have a cleavable N-term signal seq. 
10 Amino Acid Composition of Predicted Mature Form: 

calculated from 27 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 0.79 threshold: 0.0 
PERIPHERAL Likelihood = 0.79 
15 modified ALOM score: -0.66 

Score for CM-PP discrimination: -18.85 
Rule: outer membrane or periplasmic protein 
Score for OM-PP discrimination: -18.85 
Rule: outer membrane or periplasmic protein 

20 

*** Reasoning Step: 2 
Periplasmic space? Score: 1.8846 
25 Final Results 

bacterial periplasmic space Certainty- 0.929 (Affirmative) < suco 

bacterial outer membrane Certainty- 0.211 (Affirmative) < suco 

bacterial inner membrane — Certainty- 0.000 (Not Clear < suco 
30 bacterial cytoplasm Certainty- 0.000 (Not Clear) < suco 

The protein has homology with the following sequences in the databases: 

>r>irl ID72405 hypothetical protein - Thermotoga maritima (strain MSB8) 
^ilSsaSll^OnOSUS (AE001705) hypothetical protein [Thermotoga maritima] 
35 Length - 300 

Ouerv 38 pAMPSVTIAVAALQGKLRKQADVSIiKIWPJSPDQLRA^ 97 
^ ^' p P++ V + GK+ DV ++IW++P++ A+S+ +PVGNL+G 

Sbjct: 24 PIX3PALIPVVP:miXKIP--TDVKIEIWKN^^ 81 

Ouerv 98 QKVGMVNILTNGITQLVCKGSAI^ 155 
yuery. ' v ++ +v + + LV A + L p + D++++ L K + 

Sbjct: 82 WIKLVGVHEWKVFYLVASDDATFMWES^ 141 
Ouerv 156 HK-VSITYAATPPEAVGLFPSKGYHAVILPEPMATASLIJCGKTIGINVVHGFDLVKM^ 214 

+ V1YAPEVLFS LPEP + L +GK + D K W8+ 

Sbjct: 142 DRDVKILYAP-PQEIVALFKSGKVKYAALPEPFVSMCUDP^KVV LDFQKEWGK 193 

Query 215 AFDTKPLIPMAGI lANEEYEHRHKAOFDIFI^JDLKMMJWIIiMJRQHAJkKIGKNYIiPAPE 274 

IP+AG+ E K + + I< +++ W+ N ++ 1 t 

Sbjct: 194 ELGVPGRIPIAGLFVRE GVDKETVEKVEKALIDSIRWMKENLDETVQLSSEKIjGIPA 250 

Ouerv 275 PALVMGLDGARLTVSKGSEVKNEILKFVE1IiMQFHPEIiI/3GKIiPDNGFP 323 
w L L+ + 4- E+ F + L + PE K+PD GF+ 

Sbjct: 251 KIIiKSSLERIEFEVVPVEKCREEVETFLKKLNELYPEGF-EKIPDEGFY 298 

60 The protein was expressed in Ecott as an insoluble 32.89kDa His-fusion product and then purified. 



40 



50 



55 
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Based on this analysis, it was predicted that this protein from gonorrhoeae y and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 8 

A DNA sequence was identified in N. gonorrhoeae <SEQ ID 15> which encodes amino acid sequence 
<SEQ ID 16; NGS8>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 

Signal Score (-7.5): 2.39 

Possible cleavage site: 15 
»> Seems to have a cleavable N-term signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 16 
ALOM: Finding transmembrane regions (Klein et al.) 

count: 4 value: -8.23 threshold: 0.0 

INTEGRAL Likelihood = -8.23 Transmembrane 49 - €5 ( 41 - 73) 
INTEGRAL Likelihood = -7.38 Transmembrane 83 - 99 ( 75 - 106) 
INTEGRAL Likelihood = -7.06 Transmembrane 110 - 126 ( 100 - 133) 
INTEGRAL Likelihood = -4.41 Transmembrane 164"- 180 ( 163 - 187) 
PERIPHERAL Likelihood =5.89 
modified ALOM score: 2.15 
Rule: cytoplasmic membrane protein 

*** Reasoning Step: 2 

Final Results 



I 

bacterial inner membrane — Certainty= 0.42 9 (Affirmative) < suco , 

bacterial periplasmic space Certainty* 0.000 (Not Clear) < suco I 

bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

bacterial cytoplasm Certainty* 0.000 (Not Clear) < suco 

Hie protein has homology with the following sequences in the databases: 

>sp I P3 8044 | NRTB_SYNP7 NITRATE TRANSPORT PERMEASE PROTEIN NRTB 
pir||S30892 nitrate transport protein nrtB - Synechococcus sp. (strain PCC 
7942) 

emblCAA43810.l| (X61625) nitrate transporter [Synechococcus sp.] 
prf I (1908370A nitrate transporter [Synechococcus sp.] 
Length = 279 

Score = 67.5 bits (164), Expect = le-10 

Identities = 54/202 (26%), Positives = 96/202 (46%), Gaps = 7/202 (3%) 

Query: 4 VALWAWGSAVFGEFMLPAPVEVFQKSL — DLLKHFQEN EIGISLWRSWGISVAL 56 

+A+W SA+ G+ LP P+ V + +++ F +N +G+ + S+ +++ 

Sbjct: 36 LAIWQVI SAILGQDRL PGP INWANTWMPYIVEPFFDNGGT SKGLGLQILI SLQRVAIGY 95 

Query: 57 I AGLAAGLVAGLVAGSFKTAMALLKPVITILIiAMP PI IWVVMALFWFGFGNPSVLFTI IV 116 

+ G++ G V G K L FVT +L +PP+ W ++L F N S +F I + 

Sbjct: 96 LLAACTGILVGGVLGMSKFLGKGLD^ 155 

Query: 117 LVAPLTFAS AAVGMASVNKQHEELFDAYKLGRLKKIRYLYI PHLTGYVI SSVGVAVAMGV 176 

+ AVG+ + + + KL+ I+IP YV+ + +AV + 
Sbjct: 156 TAIWPI IINTAVGINQI PDDYNNVARVLKLSKKDYILNIL IPSTVPYVFAGLRIAVGLAW 215 

Query: 177 KAVTMAELLGASKGVGARIADA 198 

A++ AE+L A G+G I DA 
Sbjct: 216 LAXVAAEMLKADGGIGYFIWDA 237 

Based on this analysis, it was predicted that this protein from N.gotwrrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 
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Example 9 

A DNA sequence was identified in ^gonorrhoeae <SEQ ID 17> which encodes amino acid sequence 
<SEQ ID 18; NGS9>, Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 

Signal Score (-7.5): -5.07 

Possible cleavage site: 29 
»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 

count: 1 value: -1.81 threshold: 0.0 n * n^* 

INTEGRAL Likelihood = -1.81 Transmembrane 97 - 113 ( 96 - 113) 

PERIPHERAL Likelihood ■ 4.24 
modified ALOM score: 0.86 
Rule: cytoplasmic membrane protein 

*** Reasoning Step: 2 

Final Results 



SSUB 
Pir 
exnb 
emb 



bacterial inner membrane Certainty* 0 .172 (Affirmative) < succ> 

bacterial periplasmic space Certainty* 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty^ 0.000 (Not Clear) < suco 

bacterial cytoplasm Certainty= 0.000 (Not Clear) < suco 

The protein has homology with the following sequences in the databases: 

>sp|P97027|SSTJBJACSU PUTATIVE ALIPHATIC SULFONATES TRANSPORT ATP-BINDHQG PROTEIN 

IG69816 nitrate ABC transporter (binding protein) homolog ygaL - B.subtilis 
CAB07520.ll (Z93102) hypothetical 30.6 kd protein [Bacillus subtilis] 
CAB12711.il (Z99108) similar to nitrate ABC transporter (binding protein) 
[Bacillus subtilis] 
Length « 274 

Score =99.5 bits (247), Expect = 3e-20 

Identities = 68/181 (37%), Positives » 102/181 (55%), Gaps * 9/181 (4%) 

Query 4 LFGPSGCGKTTVLRLIAGLETPKSGTIRNTFH KTGFLFQENRLPENLTAMQNI 56 

L GPSGCGK+T+L++IAGL++ G++ + GF+FQE+RL LT QNI 

Sbjct: 56 LIGPSGCGKSTLLKIIAGLDSEYTC 115 

Query- 57 A — IFMDNPDEGEIVALAAKVGLTAGDLNKYPTELSGGMA 114 

A + + +P + V ++ G YP ELSGGM++RVA R LL ++ LLDE 
Sbjct: 116 AADLNLKDPKVKQKVDELM 175 

Query 115 PFVGLDRDLRD^VAMLVEKIERQG!^^ 174 

PF LD R L +L++ ++ ILVTHD E+ L +E+ +L AK + ++ 
Sbjct: 176 PFGALDAFTRKHLQDVLLDIWRKK^ 235 

Query: 175 L 175 
+ 

Sbjct: 236 I 236 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 10 

A DNA sequence was identified in N. gonorrhoeae <SEQ ID 19> which encodes amino acid sequence 
<SEQ ID 20; NGS10>. Analysis of this protein sequence reveals the following: 
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GvH: Examining signal sequence (von Heijne) 

Signal Score (-7.5): 2.27 

Possible cleavage site: 26 
»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 

count: 0 value: 5.14 threshold: 0.0 

PERIPHERAL Likelihood =5.14 
modified ALOM score: -1.53 

*** Reasoning Step: 2 

imb2 HYPID: 2 CFP: 0.1 

Final Results 

bacterial inner membrane Certainty* 0.100 (Affirmative) < suco 

bacterial periplasmic space Certainty^ 0.000 (Not Clear) < suco , 

bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

bacterial cytoplasm Certainty* 0.000 (Not Clear) < suco 

The protein has homology with the following sequences in the databases: 

>pir||A82615 surface protein XF1981 [imported] - Xylella fastidiosa (strain 
9a5c) 

gb|AAF84783.l|AE004017_6 (AE004017) surface protein [Xylella fastidiosa] 
Length = 1190 

Score = 50.2 bits (119), Expect » 2e-05 I, 
Identities = 59/210 (28%), Positives = 92/210 (43%), Gaps = 5/210 (2%) 

Query: 17 S IGTSAEANAPGALALGGSSEASKKFSIAEGYLASSDGYGAIAIGSAAKI -KQLEKGTIN 75 

++G A+A GA A+G + ASKSAG A + G++A+G AK +■ + 
Sbjct: 876 AVGVGTLASAEGATAVGSGAAASGKGSTAIGRNAVAS AIXS SYTG 935 

Query: 76 HIVGNDNKGLYTOAIXSNVTKITVRTESEKDILSRYGQTyGAVALG S 131 

GN++G++KRTS L + N + 

Sbjct: 936 KYSGLQNNTVGWSVGDASKGETRTVSlWAD^ 995 

Query: 132 SFGAFSTATAIE SLAVGDS SQSTGYRSATFGSHSRALAEESLALGYETRANAYGSVALGA 191 

S T +SL + +G + G+A +S+A+G + A+A +VA+G 

Sbjct: 996 SLSEGQTFVKVNSLNN SATPIAAGVDATAIGVGATASGADS IAMGNKASASADNAVAIGN 1055 

Query: 192 ESVANEENTVSVSSDTLKRKIVNVADGTED 221 

SVA+ NTVSV S +R++ NVA GT D 
Sbjct: 1056 HSVADRANTVSVGSAGSERQVTNVAAGTAD 1085 

>sp|Pl0858 |YADA__YERPS INVASIN PRECURSOR (QUTER MEMBRANE ADHESIN) 
pir| |S04534 invasin precursor - Yersinia pseudotuberculosis plasmid pIBI 
emb|CAA32088.1| (X13883) Yopl preprotein (AA 1 - 434) [Yersinia 

pseudotuberculosis] 
prf | (14U295A invasin [Yersinia pseudotuberculosis] 
Length = 434 

Score =42.1 bits (98), Expect = 0.006 

Identities = 35/134 (26%), Positives = 68/134 (50%), Gaps ■ 28/134 (20%) 

Query: 116 AVALGFRSSSHNLFASSFGAFSTATAIESLAVGDSSQSTGYRSATFGSHSRA 167 

++A+G + + A + G+ S AT + S+A+G S++ G + T-^G+ S A 
Sbjct: 107 S IAIGATAEAAKPAAVAVGSGSXATGVN SVAIGPLSKALGDSAVTYGAS STAQKDGVAIG 166 

Query: 168 LAEESLALGYETRANAYGSVALGA ESVANEENTVSVSSDT 207 

++ +A+G+ ++ +A SVA4G S + EN+VS+ ++ 

Sbjct: 167 ARASASDTGVAVGFNSKVDAQNSVAIGHS SHVAADHGY S IAXGDHSKTDRENSVS IGHES 226 



Query: 208 LKRKIVNVADGTED 221 
L R++ ++A GTED 
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Sbjct: 227 LNRQLTHLAAGTED 240 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

5 Example 11 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 21> which encodes amino acid sequence 

<SEQ ID 22; NGS1 1>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7.5): -0.16 
10 Possible cleavage site: 60 

»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
15 count: 0 value: 4.67 threshold: 0.0 

PERIPHERAL Likelihood = 4.67 
modified ALOM score: -1.43 
Rule: cytoplasmic protein 

20 *** Reasoning Step: 2 

Final Results 



35 



bacterial cytoplasm Certainty= 0.297 (Affirmative) < suco 

25 bacterial periplasmic space Certainty= 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

bacterial inner membrane Certainty* 0.000 (Not Clear) < suco 

The protein has homology with the following sequences in the databases: 

>sp | P10858 | YADA^YERPS INVASIN PRECURSOR (OUTER MEMBRANE ADHESIN) 
30 pir |S04534 invasin precursor - Yersinia pseudotuberculosis plasmid pIBI 

*mfe CAA32088.1 (X13883) Yopl preprotein (AA 1 434) [Yersinia 



pseudotuberculosis ] 
prf | (1411295A invasin [Yersinia pseudotuberculosis] 
Length = 434 

Score = 41.3 bits (96), Expect = 0.007 

Identities * 27/71 (38%), Positives = 48/71 (67%), Gaps = 4/71 (5%) 



Query: 16 QLNRLSKRTNKVGASAAAI^ 74 
40 " * +L++L KR ++ AS+AAL SL +P +GK + F+ Q G Y+++QA+A+G+ ++ E+ 

Sbjct: 353 RLDKLDKRVDKGLAS SAALNSLFQPYGVGKVN FTAGVGGYRSSQALAIGSGYRVNES 409 

Query: 75 VLLNVAGSFAG 85 
V L ++AG 
45 Sbjct: 410 VALKAGVAYAG 420 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 12 

50 A DNA sequence was identified in N.gonorrhoeae <SEQ ID 23> which encodes amino acid sequence 

<SEQ ID 24; NGS12>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7.5): -1.29 
Possible cleavage site: 61 
55 »> Se em s to have a cleavable N-term signal seq. 
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Amino Acid Composition of Predicted Mature Form: 

calculated from 62 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 6.36 threshold: 0.0 
5 PERIPHERAL Likelihood = 6.36 

modified ALOM score: -1.77 
Score for OM-PP discrimination: 6.03 
Rule: outer membrane or periplasm! c protein 
Score for OM-PP discrimination: 6.03 
10 Rule: outer membrane or periplasmic protein 

*** Reasoning Step: 2 

Outer membrane? Score: 0.602784 

Final Results 



15 



bacterial outer membrane Certainty^ 0 . 867 (Affirmative) < suco 

bacterial periplasmic space Certainty= 0.158 (Affirmative) < suco 

20 bacterial inner membrane Certainty^ 0.000 (Not Clear) < suco 

bacterial cytoplasm Certainty* 0.000 (Not Clear) < suco 

The protein has no homology with sequences in die databases. 

Based on this analysis, it was predicted that this protein from hi. gonorrhoeae, and its epitopes, could be 
25 useful antigens for vaccines or diagnostics. 

Example 13 jj 

A DNA sequence was identified in N.gonorrlioeae <SEQ ID 25> which encodes amino acid sequence 

<SEQ ID 26; NGS13>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
30 Signal Score (-7.5): -3.64 

Possible cleavage site: 51 
»> May be a lipoprotein 

Amino Acid Composition of Predicted Mature Form: 
calculated from 21 
35 ALOM: Finding transmembrane regions (Klein et al.) 

count: 1 value: -1.01 threshold: 0.0 

INTEGRAL Likelihood = -1.01 Transmembrane 36 - 52 ( 36 - 52) 
PERIPHERAL Likelihood =5.14 
modified ALOM score: 0.70 
40 Rule: inner or outer membrane protein 

Rule: inner or outer membrane protein 
Rule: cytoplasmic membrane protein 



45 



50 



55 



*** Reasoning Step: 2 

Lipoprotein? 
Inner membrane? 



Final Results 



bacterial outer membrane Certainty= 0.790 (Affirmative) < suco 

bacterial inner membrane Certain ty= 0.742 (Affirmative) < suco 

bacterial periplasmic space Certainty= 0.000 (Not Clear) < suco 

bacterial cytoplasm Certainty= 0.000 (Not Clear) < suco 

The protein has homology with the following sequences in the databases: 

>gb|AAC33455.1| (AF067083) outer membrane protein homolog (Vitreoscilla sp.] 
Length =217 



60 



Score = 236 bits (602), Expect = 2e-61 
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Identities = 134/217 (61%) , Positives = 166/217 (75%) 

Query 1 MTFFKPSTWLTASAI^ 60 

M +K -H-+ T +A+ALS C DP+TGQ N + + LGGAA CGIVGALTH KGAR 
Sbjct: 1 MKAWKKFSLMAWAAVALSACATDPMTGQV^ 60 

Query: 61 NSALACC^IGAGVGGYMDYQEQRI^ 12° 

NSALACGAIGAGVG YMD+QB++LRQ+LA TQ+B+ R G++IRLVMPBS+TFATGS L 
Sbjct: 61 NSMACGMGAGVGAVMDHQERBIi^ 120 

Query: 121 GSAQYAIOTAAQTLVQYPDTTLTIN^^ 180 

SA +LN+ + L QY DTT+ I GHTD+TGSDA+N PLS++RA AVA YL +R VA++ 
Sbjct: 121 S SASTSLNSVSSVLAQYTDTT INIVGHTDSTGSDAINEPL SRNRASAVANYLVSRNVASN 180 

Query: 181 BIjTVYGYGSHMPVASNATVEGRAQNRRVEIIjINPDQR 217 

r+T G G PVASN TV GRA+NRKVEI +NP QR 
Sbjct: 181 RITTVGAGCRQFVASNNTVAGRAENRRVEITVNPIQR 217 

>gb|AAD40344.l|U88088_22 (U88088) OmpA [Pseudomonas alcaligenesj 
Length = 220 

Score = 130 bits (328), Expect = le-29 

Identities = 90/219 (41%), Positives = 127/219 (57%), Gaps = 6/219 (2%) 

Query- 7 STWLTASALALSGCVA DPVTGQQSPNKSAMYGLGGAAVCGIVGALTHSGKGARNSA 63 

S + + L+GC + + T + + A L GA ++G + +GA A 

Sbjct: 3 SVTAASLVIFTLTGCASIQNEDGTTKNTALYGA^ 61 

Query- 64 lACGAIGAGVGGYMDYQEQRLRQNIJ^^ 123 

G++GAG G Y D QE LR+ + G+ ++++RQG++I +VMP ++TFATG A + + 
Sbjct: 62 AVAGSLGAGYGYYADKQEAELREQMKGSGVQVERQGDEIVXVMPGAITFATGKAEIQPOT 121 I 

Query- 124 QYALJTOAQTLVQYPDTTLTra^ 183 

LN A + YPD+ L + GHTD+ GS N LSQ RAQ+VA +L+ GV R+ 
Sbjct: 122 ANTLNQLAGSFRNYPDSRLinm^DSVG 181 

Query: 184 WGYGSHMPVASNATVEGRAQNRRVEILINPDQRAVNAA 222 

V G G + PVASNAT EGRAQNRRVEI + P RAV A 
Sbjct: 182 VIGAGPNQPVASNATAEGRAQNRRVEIKLAP — RAVQQA 218 

The protein was expressed in Kcoli as a soluble 22.55kDa His-fasion product and then purified. 
Based on this analysis, it was predicted that this protein from ^gonorrhoeae, and its epitopes, could 
useful antigens for vaccines or diagnostics. 

Example 14 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 27> which encodes amino acid sequence 
<SBQ ID 28; NGS14>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 

Signal Score (-7.5): -5.32 

Possible cleavage site: 40 
»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 

count: 0 value: 3.39 threshold: 0.0 

PERIPHERAL Likelihood a 3.39 
modified ALOM score: -1.18 
Rule: cytoplasmic protein 

*** Reasoning Step: 2 

Final Results 
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bacterial cytoplasm Certainty= 0.2 54 (Affirmative) < suco 

bacterial periplasmic space Certainty= 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

5 bacterial inner membrane Certainty= 0.000 (Not Clear) < suco 

The protein has no homology with sequences in the databases. 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae ^ and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

10 Example IS 

A DNA sequence was identified in N. gonorrhoeae <SEQ ID 29> which encodes amino acid sequence 

<SEQ ID 30; NGS15>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7.5): -1.75 
15 Possible cleavage site: 45 

»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
20 count: 0 value: 5.89 threshold: 0.0 

PERIPHERAL Likelihood =5.89 
modified ALOM score: -1.68 
Rule: cytoplasmic protein I 

! 

25 *** Reasoning Step: 2 ' 
Final Results 



50 



bacterial cytoplasm Certainty= 0 .232 (Affirmative) < succ> 

30 bacterial periplasmic space Certainty= 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

bacterial inner membrane Certain ty= 0.000 (Not Clear) < suco 

The protein has homology with the following sequences in the databases: 

35 >sp|Pl0487|RCIl_3COLI SHUFFLON- SPECIFIC DNA RECOMBINASE 

pirl|S03815 probable integrase - Escherichia coli 

dbj |BAA77989.1| (AB027308) shuff lon-specific DNA recombinase [Plasmid R64] 
Length = 384 

40 Score = 67.1 bits (163), Expect = 3e-10 

Identities = 75/301 (24%), Positives = 125/301 (40%), Gaps = 34/301 (11%) 

Query: 68 KVKMMTLSEAMRKYLNETLGAGRSKKMGL RFLMEFPIGGIGIDKLKRSDFAEHVMQR 124 

+++ M+LS A+ KYL + + . + +PI D A + R 

45 Sbjct: 5 RIRKMSLSRALDKYLKWSVHKKGHQQEFYRSNVIKRYPIALRNMDE 64 

Query: 125 RRGI PELDIAP IAASTALQELQYIRSVLKHAFYVWGLE IGWQELDFAANGLKRSNMVAKS 184 

I PI +T EL + S+ A WG N ++ S 

Sbjct: 65 LAEINPRTGKPITGNTVRLELALLSSLFNIARVEWG TCRTNPVELVRKPKVS 116 



Query: 185 AIRDRLPTTEELQTLTTYFLRQWQSRKSSIPMHLIMWLAIYTS^ 244 

+ RDR T+ E + L+ YF R+ ++ +++I LA+ T+ RQ EI L W 
Sbjct: 117 SGRDRRLTSSEERRLSRYF — REKNLMLYVIFHLALETAMRQGEILAL RWEHI 167 



55 Query: 245 DCTRPVTOLKNPNGSTGNNKEFT)ILPMALFVIDELPEE 304 
D V L P G++++ + A + +P + ++ 

Sbjct: 168 DLRHGVAHL — PETKNGHSRDVPLSRRARNFLQMMP VNLHGNVFDYTAS 214 

Query: 305 SVSAAWTRACKVIX5IKDLRFHDLRHEAATRMAEDG-FT I PQMQRVTLHDGWNSLQRYVSVR 364 
60 AW A + L I+DL FHDLRHEA +R EG + ++ ++ H N L+RY +R 
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Sbjct: 215 GFKNAWRIATQRLRI EDLHFHDLRHEAI SRFFELGSLNVMEIAAI SGHRSMNMLKRYTHLR 275 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

5 Example 16 

A DNA sequence was identified in ^gonorrhoeae <SEQ ID 31> which encodes amino acid sequence 

<SEQ ID 32; NGS16>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7.5): -3.64 
10 Possible cleavage site: 20 

»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
15 count: 0 value: 4.67 threshold: 0.0 

PERIPHERAL Likelihood =4.67 
modified ALOM score: -1.43 
Rule: cytoplasmic protein 

*** Reasoning Step: 2 

20 Final Results 

bacterial cytoplasm Certainty* 0 .262 (Affirmative) < succ> 

bacterial periplasmic space Certainty* 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

25 bacterial inner membrane Certainty* 0.000 (Not Clear) < succ> 

The protein has homology with the following sequences in the databases: 

>sp|Pl0484|TlMl_£COLI TYPE I RESTRICTION ENZYME EC0R124II M PROTEIN (M.EC0R124II) 
pir||S02166 type I site-specific deoxyribonuclease (EC 3.1.21.3) EcoR124/3 
30 chain hsdM - Escherichia coli plasmid R124/3 

emb|CAA31541.l| (X13145) hsdM protein (AA 1-520) [Escherichia coli] 
Length = 520 

Score =44.4 bits (104), Expect = 0.002 

Identities = 65/235 (27%), Positives = 99/235 (41%), Gaps * 55/235 (23%) 

35 

Query: 107 NRKKAGGYAEYITGGSLRRLVAAKVRRYCGEHPGOTDGAAGSG QLEQYIEPS 158 

N K+GG E+ T + +L+A ++D AAGSG Q + +1 

Sbjct: 191 NAGKSGG — EFFTPQHVSKLIAQLAMHGQTHVNKIYDPAAGSGSLLLQAKKQFDNHIIEE 248 

40 Query: 159 DFRAV^IQAEACKALLQNYPAAKVYNTSLFL YTDGBPQDC 198 

F EI N+ + ++FL + D +P D 

Sbjct: 249 GFFGQEI NHTTYNLARMNMFLHNINYDKFDIKLGNTL 298 

Query: 199 TVMNPPFSIKLKDLSEDEKSRIAQEYPWKKSGV ADEIFVLKGLE — NARRFGFF 250 

45 " * V NPP+S+K + D+ + I E + +GV AD FVL L +A+ 

Sbjct: 299 IVSNPPYSVKW — IGSDDPTLINDER-FAPAGVIAPKSKADFAFVLHALNYLSAKGRAAI 355 

Query: 251 ILFPGIAYR-KSEQRFRE-IIGNRLAE — LNRIQNAFEDTPIEVLLLVIDKDKTD 301 
+ FPGI YR +EQ+ R+ ++ N E 4-+ N F T I V +LV+ K KTD 
50 Sbjct: 356 VCFPGIFYRGGAEQKIRQYLVDNNYVETVTSLAPNLFFGTO 410 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 
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Example 17 

A DNA sequence was identified in N. gonorrhoeae <SEQ ID 33> which encodes amino acid sequence 

<SEQ ID 34; NGS17>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
5 Signal Score (-7.5): -5.29 

Possible cleavage site: 16 
»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 
calculated from 1 
10 ALOM: Finding transmembrane regions (Klein et al.) 

count: 0 value: 2.60 threshold: 0.0 
PERIPHERAL Likelihood = 2.60 
modified ALCM score: -1.02 
Rule: cytoplasmic protein 

15 

*** Reasoning Step: 2 

Final Results 

20 bacterial cytoplasm Certainty= 0.284 (Affirmative) < suco 

bacterial periplasmic space Certainty^ 0.000(Not Clear) < suco 

bacterial outer membrane Certainty= 0.000 (Not Clear) < suco 

bacterial inner membrane Certainty^ 0.000(Not Clear) < suco 



25 The protein has homology with the following sequences in the databases: 

i 

>ref |NP_052389.l| translocator YopD [Yersinia enterocolitica] 
sp|P37132 |YOPD_YEREN YOPD PROTEIN j 
gb|AAD16812.l| (AF102990) translocator YopD [Yersinia enterocolitica] 
Length = 306 

30 

Score =32.1 bits (72), Expect =8.2 

Identities = 29/93 (31%) , Positives * 43/93 (46%), Gaps = 17/93 (18%) 

Query: 13 MLAAKRAAKESTRQERAVKRAGTVRN 72 
35 +L R A+E Q+R ++ T+ AQKE +A M+SGAK+ A+ 

Sbjct: 89 LLELARKAREMGLQQRDIENKATI ■ - S AQKEQVAEMVSGAKLMIAMAVVS 136 

Query: 73 GIMMRLSLQDMRYACNQELINFAEHIVKQVQRL 105 
GUf S ++ +E+ IVKQ Q L 

40 Sbjct: 137 GIMAATSTVASAFSIAKEV KIVKQEQIL 164 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 18 

45 A DNA sequence was identified in N. gonorrhoeae <SEQ ID 35> which encodes amino acid sequence 

<SEQ ID 36; NGS18>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7.5): -2.56 
Possible cleavage site: 38 
»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 4.56 threshold: 0.0 
PERIPHERAL Likelihood =4.56 
modified ALOM score: -1.41 
Rule: cytoplasmic protein 



50 



55 
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*** Reasoning Step: 2 
Final Results — 



5 bacterial cytoplasm Certainty* 0.397 (Affirmative) < suco 

bacterial periplasmic space Certainty* 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

bacterial inner membrane Certainty* 0.000 (Not Clear) < suco 

10 The protein has no homology with sequences ib the databases. 

Based on this analysis, it was predicted that this protein from H. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 19 

A DNA sequence was identified in ^gonorrhoeae <SEQ ID 37> which encodes amino acid sequence 

15 <SEQ ID 38; NGS 19>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7.5): -4.12 
Possible cleavage site: 43 
»> Se ems to have no N- terminal signal seq. 
20 Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 8.49 threshold: 0.0 
PERIPHERAL Likelihood = 8.49 
25 modified ALOM score: -2.20 

Rule: cytoplasmic protein 

*** Reasoning Step: 2 

30 Final Results 

bacterial cytoplasm cert ainty= 0.250 (Affirmative) < suco 

bacterial periplasmic space Certainty* 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

35 bacterial inner membrane Certainty* 0.000 (Not Clear) < suco 

The protein has homology with the following sequences in the databases: 

>ref |NP_043483.1| orf!4 [Bacteriophage HP1] mmtTn ot ^ v ^ T 

S p|P51716|YOl4_BPHPl HYPOTHETICAL 14.9 KD PROTEIN IN REP-HOL INTBRGENIC REGION 

40 (ORF14) 

pir||S69520 hypothetical protein 14 - phage HP1 
gb|AAB09199.l| (U24159) orfl4 [Bacteriophage HP1] 
Length * 133 

45 Score = 73.3 bits (179), Expect = le-12 

Identities = 44/129 (34%), Positives = 74/129 (57%), Gaps =* 6/129 (4%) 

Query: 1 MFI PAALHKDEHSAYGVTIPDLPGCFSCGDTVEEAVANARSAAYM^ 60 
M P + K + Y V++PD+PGCFS GDT+ EA+ NA+ A HI+GM+ED + L 
50 Sbjct: 1 MLYPICIEK-VNIX3YWSVPDVPGCT 57 

Query- 61 VSS-IADLSQEPDYHGATWVMIEIDPAKISRQQIRFNVSWPQYLLDRVDEY TSANHET 117 

S+ I +P+Y ++++D + + + N++ P LL R+D++ T ++ 

Sbjct- 58 KSNPIEQYINQPEYKDFTVTVVBTOLTHLM^ * 17 

55 

Query: 118 RSGFLAKAA 126 

RS FL++ A 
Sbjct: 118 RSNFLSQLA 126 
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Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 20 

A DNA sequence was identified in gonorrhoeae <SEQ ID 39> which encodes amino acid sequence 

5 <SEQ ID 40; NGS20>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7.5): -0.1 
Possible cleavage site: 19 
»> Seems to have no N- terminal signal seq. 
10 Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
count; 0 value: 7.58 threshold: 0.0 
PERIPHERAL Likelihood =7.58 
15 modified ALOM score: -2.02 

Rule: cytoplasmic protein 

*** Reasoning Step: 2 

20 Final Results 



bacterial cytoplasm Certainty* 0.057 (Affirmative) < suco 

bacterial periplasmic space Certainty* 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

25 bacterial inner membrane Certainty* 0.000 (Not Clear) < suco ! 

The protein has no homology with sequences in the databases. 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

30 Example 21 

A DNA sequence was identified in N. gonorrhoeae <SEQ ID 41> which encodes amino acid sequence 

<SEQ ID 42; NGS21>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7.5): -3.52 
35 Possible cleavage site: 52 

»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
40 count: 0 value: 5.83 threshold: 0.0 

PERIPHERAL Likelihood = 5.83 
modified ALOM score: -1.67 
Rule: cytoplasmic protein 

45 *** Reasoning Step: 2 

Final Results 



bacterial cytoplasm Certainty^ 0.3 11 (Affirmative) < suco 

50 bacterial periplasmic space Certainty= 0.000 (Not Clear) < suco 

bacterial outer membrane — — Certainty= 0.000 (Not Clear) < succ> 

bacterial inner membrane Certainty= 0.000 (Not Clear) < suco 



The protein has homology with the following sequences in the databases: 
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>ref |OT_040628.1| cl (repressor; 237) [bacteriophage lambda] 
ref |NP_061378.1| phage lambda repressor protein CI [Escherichia coli] 
sp | P03034 | RPCl_JiAMBD REPRESSOR PROTEIN CI 
pir| |RPBPL repressor protein cl - phage lambda 
5 eiiib|CAA24991.l| (X00166) coding sequence cl gene [bacteriophage lambda] 

gb|AAA96581.l| (J02459) cl (repressor; 237) [bacteriophage lambda] 
emb|CAB96428.1| (AJ277653) phage lambda repressor protein CI [Escherichia coli] 
Length = 237 

10 Score = 62.5 bits (151), Expect = 5e-09 

Identities ■ 36/85 (42%) , Positives » 51/85 (59%) 

Query: 2 KKRELNEIETAECAELKRIFNSKKEEIjK^^ 61 
KK+ L+++ LKI+KKEL Ii+Q +A+ +G+ QS V NG NALNA A 
15 Sbjct: 4 JOOCPLTQEQLEDARRIiKAIYEKK^ 63 

Query: 62 SQFAKILQIFVSDFSLRLAEBISSM 86 

+ AKIL++ V +PS +A BI M 
Sbjct: 64 AliliAKIIaKVSVEEFSPSIAREIYEM 88 

20 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 22 

A DNA sequence was identified in N. gonorrhoeae <SEQ ID 43> which encodes amino acid sequence 

25 <SEQ ID 44; NGS22>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7.5): -2.6 
Possible cleavage site: 43 
»> Seems to have no N- terminal signal seg. 
30 Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 7.74 threshold: 0.0 
PERIPHERAL Likelihood = 7.74 
35 modified ALOM score: -2.05 

Rule: cytoplasmic protein 

*** Reasoning Step: 2 

40 Final Results 

bacterial cytoplasm certainty* 0.072 (Affirmative) < suco 

bacterial periplasmic space Certainty* 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

45 bacterial inner membrane Certainty* 0.000 (Not Clear) < suco 

The protein has homology with the following sequences in the databases: 

>pir||S30432 hypothetical protein - Streptomyces clavuligerus plasmid pSCL 
Length =307 

50 

Score =43.6 bits (102), Expect = 0.002 

Identities = 25/86 (29%), Positives = 49/86 (56%), Gaps = 2/86 (2%) 

Query: 6 MGMAFKT-GIPRGQKFVLVKLCO^^ 64 
55 ~ ~ MGM F G+ ++ +L+ + + G C+PS++ L +D G + + V++ + L N 

Sbjct: 1 MGMVFAAEGLIX5SKKLLLLGYTNWTDPYGY CWPSEDRLVDDCGTSRSTVQRTKRKLVKKN 60 

Query: 65 FIKSARRQRGR-ERKSDIYRINVAIjL 89 
++S RR+ + E S++ R+N+ LL 
60 Sbjct: 61 LLRSVRRKNSKGBPISNLSRVNLPLL 86 
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Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 23 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 45> which encodes amino acid sequence 

5 <SEQ ID 46; NGS23>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7.5): -2.8 
Possible cleavage site: 59 
»> Seems to have no N- terminal signal seq. 
10 Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 0.05 threshold: 0.0 
PERIPHERAL Likelihood =0.05 
15 modified ALOM score: -0.51 

Rule: cytoplasmic protein 

*** Reasoning Step: 2 

20 Final Results 



bacterial cytoplasm Certainty= 0.195 (Affirmative) < suco 

bacterial periplasmic space Certainty^ 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty= 0.000 (Not Clear) < suco 

25 bacterial inner membrane Certainty= 0.000 (Not Clear) < suco ij 

\ 

The protein has homology with the following sequences in the databases: 

>sp|P07905|DNACJSCOLI DNA REPLICATION PROTEIN DNAC 
pir| |XMECNC DNA replication protein dnaC - Escherichia coli (strain K-12) 
30 Length = 245 

Score = 110 bits (275) , Expect = 2e-23 

Identities = 75/224 (33%), Positives =116/224 (51%), Gaps = 23/224 (10%) 

35 Query: 50 EAADEMAAYAETLRRGAMRDA LEKRIGRSGIAPRFRNCRIENYAV — SDS 97 

+ +B+ A+ + +GA+R A +++ RSGI P +NC ENY V 

Sbjct: 24 KTGEELLAWQK — EQGAJRSAALERENRAMKMQRTFNRSGIRPLHQNCSFENYRVECEGQ 81 

Query: 98 IPC^IARAKAAAAEYAANFADVLQTGRSMIFSGRRGTGKN^ 157 
40 + +++A+ E+ N A S IFSG+ GTGKNHLA I E++ GKS L+IT 

Sbjct: 82 MNALSKARQYVEEFDGNIA - -SFIFSGKPGTGKNHLAAAICNELLLRGKSVLIIT 134 

Query: 158 VGDMLRTVKDSF- -GGGGEAGAVGIFVKPDLLVLDEFGAGSLSETDGRILFSWNARYER 215 
V D++ +KD+F G E + DLLV+DE G + S+ +1+ +V+ R 

45 Sbjct: 135 VADIMSAMKDTFRNSGTSEEQLLNDLSNVDLLVIDE 194 

Query: 216 LMPMLVLTNLTAEAFRENTDARI RDRLRDGGGKL I PFDWE SYRA 259 

P +LTN E + R+ DR+R G + F+W+SYR+ 
Sbjct: 195 KRPTGMLTNSNMEEMTKLLGERVMD^ 238 

50 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae* and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 24 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 47> which encodes amino acid sequence 
55 <SEQ ID 48; NGS24>. Analysis of this protein sequence reveals the following: 
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GvH: Examining signal sequence (von Heijne) 
Signal Score (-7.5): -5.76 
Possible cleavage site: 26 
»> Seems to have no N- terminal signal seq. 
5 Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 1.43 threshold: 0.0 
PERIPHERAL Likelihood ■ 1.43 
10 modified ALOM score: -0.79 

Rule: cytoplasmic protein 

*** Reasoning Step: 2 

15 Final Results 



40 



45 



50 



bacterial cytoplasm Certainty= 0 .112 (Affirmative) < suco 

bacterial periplasmic space Certainty* 0.000 (Not Clear) < suco 

bacterial outer membrane Certain ty= 0.000 (Not Clear) < suco 

20 bacterial inner membrane Certainty^ 0.000 (Not Clear) < suco 

The protein has homology with the following sequences in the databases: 

>ref |NP_053228.1| pX02-73 [Bacillus anthracis] 
gb|AAFl3678.l|AFl88935_76 (AF188935) pX02-73 [Bacillus anthracis] 
25 Length ■ 541 

Score = 125 bits (315), Expect = 9e-28 

Identities = 139/535 (25%) , Positives = 254/535 (46%), Gaps = 70/535 (13%) 

30 Query: 14 PVIjF IGTGMSLRYIJDNSYTWDGLL SKIAIDLFGDDREYLNIKSRYCEDGRFQYEE IAEEL 73 
P LFIG+G S RYL N W GL+ K + +L + BY Y + E+AE + 

PFLFlGSGFSKRYL-miEDWAGIiMKKFS-NLMPYEFEY YSSTANKDWAEVAELM 70 

QSKFDKVXi — ENDPDGRFKEINDKFFENMRAGNTLSRFKXYISTIjLSQLNYK DNSN 127 

35 F + E + KE D R + S K+ ++ L+ + YK + ++ 

AKDFHPIWWKEQQFENNRKEFKD RI SSKQSPLKVEVAKYIjNS XEYKYGLDEKND 124 



65 



Query: 


14 


Sbjct: 


19 


Query: 


74 


Sbjct: 


71 


Query: 


128 


Sbjct: 


125 


Query: 


182 


Sbjct: 


183 


Query: 


242 


Sbjct: 


243 


Query: 


301 


Sbjct: 


299 


Query: 


358 


Sbjct: 


358 


Query: 


396 


Sbjct: 


418 


Query: 


456 


Sbjct: 


478 



E++ LKK + IITTN+D Ii + XFE + IG ++L S+P +YKIHGC 



p +++T DY+ + EK + A+IiL++FI +P+IFLGY++ D+NI++ILK I + 



+Q+4- K R L+ E ED E++ +T G T+ I ++KT+++ +IY ALA 

DQDNIHKLKDR LIFVERAGQEEDSFET^SSLTIGKITVPITRVKTNDYEKIYNAIiA 298 



S +R+ +S E+ ++ +D++ + V++GK + 



TSEMMSN YFKIIEEENS QLLKLIDKHSIASTQYFPI — 395 

55 Q+ B+ ++ + +++ +E ++K+ + S QY P+ 



Y D +++K + K+ F+E++ NS + + + ++ + 

60 Sbjct: 418 YVTjESSVBDELLDSKIKNKLK^ 477 



2LSEDEVENYL KNFVNKKN THYKRLLCMFDYKKYADT 499 

L+ D+++++L FV +KN T +RL ++D+ KY + 
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Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 25 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 49> which encodes amino acid sequence 

5 <SEQ ID 50; NGS25>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7.5): 4.76837e-07 
Possible cleavage site: 56 
»> May be a lipoprotein 
10 Amino Acid Composition of Predicted Mature Form: 

calculated from 21 
AXjQM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 6.68 threshold: 0*0 
PERIPHERAL Likelihood = 6.68 
15 modified AL0M score: -1.84 

Rule: inner or outer membrane protein 
Rule: inner or outer membrane protein 



20 



25 



30 



35 



50 



*** Reasoning Step: 2 

Lipoprotein? 
Inner membrane? 



Final Results 



bacterial outer membrane Certainty* 0.790 (Affirmative) < suco j 

bacterial inner membrane Certainty= 0.700 (Affirmative) < suco i 

bacterial periplasmic space — — Certaintys 0.000 (Not Clear) < suco 
bacterial cytoplasm Certainty= 0.000 (Not Clear) < suco 

The protein has homology with the following sequences in the databases: 

>gi|7433005|pir| |B70313 cytochrome-c peroxidase (EC 1.11.1.5) - Aquifex aeolicus 
gij 2982865 |gb|AAC06485.l| (AE000675) cytochrome c peroxidase [Aquifex aeolicus] 
Length = 355 

Score «= 345 bits (885), Expect = 7e-94 

Identities « 180/336 (53%), Positives = 237/336 (69%), Gaps = 12/336 (3%) 



Query: 59 EDQDLLKRAQGVFQPLPTVEEMQKIRPFTEE 118 
40 +D++LUK A+ F+PLP V B + P T E+VKLG L+Y+PRLSK +SCN+CHNLA 

Sbjct: 22 DDKELLKMARQYFKPLPKVAEN^ 80. 

Query: 119 SAGVDNMPTSQGHKGQFGGRNSPTALNAAra 178 
GVDN+PTS GH+ G RN+PT NAA+ +QFWDGRA DVEEQA GP+VNP+EMAN 
45 Sbjct: 81 RYGVDNLPTS IGHRWAIGPRNAPTVYNAAIHIAQFWDGRAKDVEEQALGP IVNP IEMAN- 139 

Query: 179 SQEAAAAKIAKVPEYQEMFKKAFP-EDGAVSFKNITTALGAFER 237 

++E A + +PEY E+FKKAFP E V ++NI A-K3AFERTL+TP+++DE+LKGN 
Sbjct: 140 TEENAVKTLKS IPEYVELFKKAFPNEKDPVKYENIGKAIGAFERTIjMTP 199 



Query: 238 NALSEQERKGVRAFMDNGCIACHNGVNLGGTTFQKFGLVQGPYWK — FIEDP — KR 289 

AL+EQE++G++ F++ GC+ACHNG +GG F KFG++ YWK + P K 

Sbjct: 200 KALTEQEKRGbKTFIEVGCVACHNGPGVGGNMFAKFGMIT-EYWKVTYPY^ 258 



55 Query: 290 DKGRADVTKKT EDEFFFRVPGLRNVAKTYP YFHNGSVWEI^ IMGKAQLGKD I PKED 349 

D GR VTKK ED F F+VP LRN+ TYPYFH+GSVW h+ AV IM K QLGK++ + 
Sbjct: 259 DFGRFGVTKKEEDMFVFKVP SLRNIEHTYPYFHIX3SVWSLEDAVRIMAKTQLGKELTDQQ 318 

Query: 350 VDNrVWLNALSGNVSESARTMPELPLTAPMESKPD 385 
60 V +IV FIi AL+G + + A +PELP + KP+ 

Sbjct: 319 VKDIVAFLKALTGKIPKHALEVPELPPSTDKTPKPE 354 
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Based on this analysis, it was predicted that this protein from ^gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 26 

5 A DNA sequence was identified in N. gonorrhoeae <SEQ ID 51> which encodes amino acid sequence 

<SEQ ID 52; NGS26>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7.5): 0.610001 
Possible cleavage site: 15 
10 »> Seems to have no N-terminal signal seq. 

Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALQM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 5.04 threshold: 0.0 
15 PERIPHERAL Likelihood = 5.04 

modified ALOM score: -1.51 
Rule: cytoplasmic protein 

*** Reasoning Step: 2 

20 

Final Results 

bacterial cytoplasm Certainty= 0 .127 (Affirmative) < suco 

bacterial periplasmic space Certainty= 0.000 (Not Clear) < suco 

25 bacterial outer membrane Certainty= 0.000 (Not Clear) < suco 1 

bacterial inner membrane Certainty* 0.000 (Not Clear) < suco 

The protein has homology with the following sequences in the databases: 

>pir||T13296 hypothetical protein 8 - Streptococcus phage phi-O1205 
30 gb|AAC79524.1| (U88974) 0RF8 [Streptococcus thermophilus temperate bacteriophage 

O1205] 
Length = 157 

Score = 62.5 bits (151), Expect = 2e-09 
35 Identities - 53/161 (32%) , Positives * 86/161 (52%) , Gaps = 8/161 (4%) 

Query: 5 TLYRCAADVQAAL0YYFDSETEREDTLEAV — IGQFBVKAQSVIAYIKNQEITEKMLBGH 62 

TLY + + D ET + DTLEA+ +E K + + IK+ E + + 

Sbjct; 3 TL YELTDQLLE IYNMDVDDET -KLDTLEAJCDWTTDYENKVEGYVKVIKSIiEADIEAEKNE 61 

40 

Query: 63 IRQMTGKLKAAKARNQSLKDYLARiraQAAGITE 122 

+++ G K+ +++ LKLA+M GT+D FK FRKSEAW+ +E ++P 
Sbjct: 62 KKRLIX3LNKSDQSKIDKLKTALAVSMAETCQTRV- -OTTLFKVGFRKSEAVW-NEEKLP 118 

45 Query: 123 AEFMREAVKTE PDKTAIRKAIESGRQVAGAKIEGRKNLQIR 163 

E+ K PDK +++ ++SG+ + GA +E R+NL IR 

Sbjct: 119 KEYQIATYK — PDKKTLKELLKSGKHIEGATLEERRNLNIR 157 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
50 useful antigens for vaccines or diagnostics. 

Example 27 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 53> which encodes amino acid sequence 

<SEQ ID 54; NGS27>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
55 Signal Score (-7.5): -5.45 
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Possible cleavage site: 49 
>>> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 1.80 threshold: 0.0 
PERIPHERAL Likelihood =1.80 
modified ALOM score: -0.86 
Rule: cytoplasmic protein 

*** Reasoning Step: 2 

— . — Final Results — — 

bacterial cytoplasm Cer tain ty= 0.559 (Affirmative) < suco 

motifs : 

Subtilase^Asp (S,T,A, X,V)x(Xi, I,V,M,F) (L, I,V,M)D(D, S,T,A)G(Ii, I,V,M,F,C)x) 

(A)x(I*) (I)D(D)G(I)x{2}(D) 
79: DDDFL AALIDDGIVFD V 

A homolog was found in serogroup A N.meningitidis but not in serogroup B, so NGS27 protein and 
nucleic acid are useful for distinguishing between gonococcus and serogroup B N.meningitidis. 

Based on this analysis, it was predicted that this protein from VI. gonorrhoeae^ and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 28 

A DNA sequence was identified in N. gonorrhoeae <SEQ ID 55> which encodes amino acid sequence 
<SEQ ID 56; NGS28>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 

Signal Score (-7.5): -0.19 

Possible cleavage site: 61 
»> Seems to have a cleavable N-term signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 62 
ALOM: Finding transmembrane regions (Klein et al.) 

count: 0 value: 0.69 threshold: 0.0 

PERIPHERAL Likelihood =0.69 
modified ALOM score: -0.64 
Score for OM-PP discrimination: -24.78 
Rule: outer membrane or periplasmic protein 
Score for OM-PP discrimination: -24.78 
Rule: outer membrane or periplasmic protein 

*** Reasoning Step: 2 

Periplasmic space? Score: 2.47798 
Periplasmic space? Score: 2.47798 

Final Results 

bacterial periplasmic space Certainty^ 0.916 (Affirmative) < suco 

The protein has homology with the following sequences in the databases: 

>prf | |1306286A mobilization protein B [Escherichia coli] 
Length = 529 

Score = 34.7 bits (78), Expect ■« 2.4 

Identities = 24/69 (34%), Positives = 31/69 (44%), Gaps = 12/69 (17%) 
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Query: 344 QLRARQQEIPVDYARTAVCGRIPFRRHSRPTLRSRT^ 400 

+LRA Q++P D++TV PRR+ GA GQGIRD 
Sbjct: 440 RLRAAGQDLPADFVKTTVLDNTPIR GQTGEIKVDVER 490 

5 Query: 401 RTPNTQRGT 409 

R P +RGT 
Sbjct: 491 RGPAGRRGT 499 

A homolog was found in serogroup A N.meningitidis but not in serogroup B, so NGS28 protein and 
10 nucleic acid are useful for distinguishing between gonococcus and serogroup B N.meningitidis. 

Based on this analysis, it was predicted thai this protean from N. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 29 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 57> which encodes amino acid sequence 

15 <SEQ ID 58; NGS29>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7.5): -3.61 
Possible cleavage site: 31 
»> Seems to have no N- terminal signal seq. 
20 Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 4.03 threshold: 0.0 
PERIPHERAL Likelihood = 4.03 
25 modified ALOM score: -1.31 

Rule: cytoplasmic protein 

*** Reasoning Step: 2 

30 Final Results 

bacterial cytoplasm Certainty** 0 .106 (Affirmative) < suco 

The protein has homology with the following sequences in the databases: 

35 emb|CAB83930.l| (AL162753) hypothetical protein NMA0640 [Neisseria meningitidis 

Z2491] 
Length = 387 

Score = 653 bits (1685), Expect = 0.0 
40 Identities = 324/388 (83%), Positives = 351/388 (89%), Gaps ■ 1/388 (0%) 

Query: 1 MNITIAAPYCSLPSEPHFNRFWYLAELLSQSHDVLLITSNFKHYDKSFRRPEDA 60 

MNITI APYCSLPSEP+FNRFWYLAE LSQSHDVLLITS F+HYDKSFRR EDA A S G 
Sbjct: 1 MNITIVAPYCSLPSEPYFNRFWYIiAERLSQSHDVL^ 60 



45 



Query: 61 RLKVMLLEESGY SKNVSI^ROTSHHRFVKHFEKWLENCRPGEQDVVYSAYPL IATNLLLG 120 

RL+V LL+E GY KNVSL RV SH FV++ +WL + 4- EQD+VYSAYPL+ATNLLLG 
Sbjct: 61 RLRVKLLDEPGYRKNVSLARV7VSHRVFVRNL 120 



50 Query: 121 KHKARLGYKLIVDVQBVWPESFSSVVPFLKKIP 180 

KHKARLGYKLIVWQDVWPESFSSVVPFLiKK+PH LLPFASRANRAYR ADAL+AVSQTY 
Sbjct: 121 KHKARLGYKLIVWQDVWPESFSSWPFIjKKTO 180 

Query: 181 LDRAKEANPNVPGEVVYIGADFAAIAPPPRFRSKTVRFFY^ 240 

55 " LDRAKEANPNVPGE vyig dfaaia pprfrsktvr fylgtlsy+ydvetvckgvrkl 

Sbjct: 181 LDRAKEANPNVPGETIVYIGTDFAAIA-PPR 239 

Query: 241 LDIXSENVELHiDlGGGPDLDRLKQYACTC 300 
LDDGENVELHIMGGGPDL++LKQY IKFYGY+FY+EMMS+AK CDIAVNAIHS+AMQ 
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Sbjct: 240 tJ3IX5ENVEUIB*G^ 299 

Query: 301 SITNKLSDYMALQKPILNSQVHDEVAEVLTLLPH^ 360 

S+TNKLSDYMALQKPILNSQ + EV ++L LLPHENYRSGDVD FVQAAK+ILKRK+DPV 
Sbjct: 300 SVTNKLSDYMALQKPILNSQNNAE^ 359 

Query: 361 QSDEIVRRFRHDISYRKIVNLIERLANE 388 

QSDEXVRRFR DISYRKIVNLIERLA+E 
Sbjct: 360 QSDEIVRRFRRDISYRKIVNLIERLAHE 387 

>emb|CAB58324.1| (AL121855) hypothetical protein SCF62.09 [Streptoxnyces 
coelicolor 

A3 (2)] 
Length = 407 

Score = 54.7 bits (130), Expect - 2e-06 

Identities = 57/243 (23%), Positives = 105/243 (42%), Gaps =24/243 (9%) 

Query: 99 RPGEQDWYS AYP — - L IATNLLLGKHKARLGYKL IVDVQDVWPE SFSSWPFLKKI PHN 155 

r G DW++ P L LL RO++D D+PE+S K+ + 

Sbjct: 81 RVGPVDVVHACNPPDLLFLPALWL KRRGARFVFDQHDLIPELYLSRFGRGKDIiLYR 136 

Query: 156 LLPPASRANRAYRYADALVAVSQTYLDRAKEANPNVPG DFAAIAPPPR 210 

+ R YR AD ++A +*+Y DA P+V++ F+PP 

Sbjct: 137 AVCALERWT^-YRAADVVIjATNESYKE^AIRRGGROT 194 

Query: 211 F-RSKTVRFFm5TLSYTm>VETVCKG^ — 265 

R K YLG + V+ + + KLD+ +H +GG D + + + 

Sbjct: 195 LKRGKPHLLCYLGVMGPQDGVDYALRAIA^ 254 

Query: 266 CDGIKFYGYI PYAEMMSVAKGCDIAVNAIHSYAMQSIT — NKLSDYMALQKPILNSQ 320 

+ ++F G IP A+++ D+ ++ + <M- NK+ +YMA+ +PI++ 

Sbjct: 255 LGLDEQVRFTGRI PDADLVRHLSTADVCLS PDPRNPLNDVSTMNKVLEYMAMGRPIVSFD 314 

Query: 321 VHD 323 
+ + 

Sbjct: 315 LRE 317 

As a homolog was found in serogroup A ^meningitidis but not in serogroup B, NGS29 protein and 
nucleic acid are useful for distinguishing between gonococcus and serogroup B N.meningitidis. 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 30 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 59> which encodes amino acid sequence 
<SEQ ID 60; NGS30>. Analysis of this protein sequence reveals the following: 

Signal Score (-7.5): -4.8 

Possible cleavage site: 46 
>» Seems to have no K-terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
JO/M: Finding transmembrane regions (Klein et al.) 

count: 0 value: 1.54 threshold: 0.0 

PERIPHERAL Likelihood =1.54 
modified ALCM score: -0.81 
Rule: cytoplasmic protein 

*** Reasoning Step: 2 

Final Results 
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bacterial cytoplasm Certainty^ 0 . 113 (Affirmative) < suco 

The protein has homology with the following sequences in the databases: 

fastidiosa (strain 9a5c) 
gb|AAF84279.1|AE003977_2 (AE003977) conserved hypothetical protein [Xylella 
fastidiosa] 

Length = 376 

Score = 73.6 bits (179), Expect = 3e-12 

Identities = 82/354 (23%), Positives = 143/354 (40%), Gaps = 35/354 (9%) 

Query: 1 MKI ILTT SMSGLGGTET ATVRLGRLLKRHGHDI ILASSDG-PFVGEAQASGIRWQFVDFY 59 

MKI+ T + +G GGB R ++ GH + L GP A+ +G+ ++ + 
Sbjct: 1 MKILHTEAATGCGGEEreiYRHMLSMQAQ^ 60 

Query: 60 RGGLAGYLKSTFAYARMLRREQPDIIIXIQMARV^ 119 

G L +L+RE D+++ A AA++ +T++I SLA 

Sbjct: 61 — GPWRVLNGIHTVQHLLQRETFDV^ 116 

Query: 120 TYPKTAKLFDKLGAYI IGNCKHEREKLIRHGFPAGRIAYA YNTPPEFHFRK 170 

K+ 4- L +1 +H R+ LI+ G RI +T PE +++ 

Sbjct; 117 I — KSQLTYTYLPHRVIOTSQHVRDLLIKQGIQPT^ 174 

Query: 171 TEK ECAVLGTLSRLDTVRAVHLMLDILKKMVGRN 217 

++ ++G L + +LD + + N + L +AG GE 

Sbjct: 175 LQQTRHWRTELGFNDNDI IVGCVAVIjREAKGHRET J iDAIAPLCQANPRLHLVTAGDGBP 234 

Query: 218 -MDNLKAQAKRLGIEDKVTFIrGGVRDLTGyFKEVDI 276 

M +L A K L +E ++ LG D DI + G LEA 

Sbjct: 235 VMQHLLAHRKTLTLETQIHLLGYRHDAPRLMSGFDIFA LATQKEAAGTVFLEAAQ 289 

Query: 277 YDTPWTYNMAGI SEWITGQTGYCI PFGDDEAFIEAVDTLIKHPELRSQMSKA 330 

P++ + G+ EM+ G + G+ A A+ TL+ + + R MG+A 

Sbjct: 290 AGIPIIATRVGGVPEMLQEGTNAILVTPGNQTA^ 343 

A homolog was found in serogroup A N.meningitidis but not in serogronp B, so NGS30 protein and 
nucleic acid are useful for distinguishing between gonococcns and serogroup B N. meningitidis. 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 31 

A DNA sequence was identified in ^.gonorrhoeae <SEQ ID 61> which encodes amino acid sequence 

<SEQ ID 62; NGS31>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) Signal Score (-7.5): -5.36 

Possible cleavage site: 16 

»> Seems to have no N- terminal signal seq. 

Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 

count: 0 value: 3.50 threshold: 0.0 

PERIPHERAL Likelihood =3.50 
modified ALOM score: -1.20 
Rule: cytoplasmic protein 

*** Reasoning Step: 2 

Final Results 

bacterial cytoplasm Certainty= 0.299 (Affirmative) < suco 
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The protein has homology with the following sequences in the databases: 

gb|AAB49297.l| (U84350) hypothetical hydroxylase a [Amycolatopsis orientalis] 
Length = 491 

5 Score = 111 bits (278), Expect = le-23 

Identities = 87/269 (32%) f Positives « 123/269 (45%), Gaps = 15/269 (5%) 

Query: 1 LKNGAAF SWG SRYTEFDF TDKFSIX3PGTVYQVRRAVFDKILIEEAAKQGVEVRFGH 56 

+K G F WG+R + F + K + YQV RA FD IL++ A +GV VR G 

10 Sbjct: 73 IKRGGTFRWGARPEPWTFHFGISAKMAGSTSHAYQVKRAKFDDI^ 132 



15 



50 



Query: 57 GvTAFDNSGDFARLNIETDT-GBSYELTA 114 

V G+ TD G ++E++A+FV+DASG R+ S 

Sbjct: 133 SVNDWEIX3ERVTGARYTDAIX3NAHEVSAI^^ 192 

Query: 115 HFTHIDDNITHPKFDRNK IL ITTHPQHRDVWIWL I PFGDNRC SVGW GTPDKLAGES 171 

F + + P+ IL W W IP D SVG V DK+ G+ 

Sbjct: 193 LFGYFEGGKRLPBPVSGNILSVAFDSG VJFWYIPLSDTLTSVGAVVRREDADKIQGDR 249 



20 Query: 172 ETVLRKFVYEC PML SEILDKAVWENDFPFRS IQ — -GYSANVKSLHGRHFALLGNAAEFL 228 

E L + ECP++SE LA + ++ . YS S L+G+AA F+ 

Sbjct: 250 EKALNTLIAECPLISEYLSNATRVTTGRYGELRVRKDYSYQQDSY^ 309 

Query: 229 DPVFSSGVTIALHSAELAADLLTKQLKGE 257 
25 DPVFSSGV +A +SA LAA +> L G+ 

Sbjct: 310 DFVFSSGVHLATYSALLAARS INSVLAGD 338 

A homolog (amino acids 280-341) was found in serogroup A N.meningitidis but not in serogroup B, so 
NGS31 protein and nucleic acid are useful for distinguishing between gonococcus and serogroup B 
30 N.meningitidis. 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 32 

A DNA sequence was identified in ^.gonorrhoeae <SEQ ID 63> which encodes amino acid sequence 

35 <SEQ ID 64; NGS32>. Analysis of this protein sequence reveals the following: 

Signal Score (-7.5): -3.49 

Possible cleavage site: 38 
»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 
40 calculated from 1 

ALOM: Finding transmembrane regions (Klein et al . ) 
count: 0 value: 7.80 threshold: 0.0 
PERIPHERAL Likelihood =7.80 
modified ALOM score: -2.06 
45 Rule: cytoplasmic protein 



*** Reasoning Step: 2 

Final Results 

bacterial cytoplasm Certainty= 0 .278 (Affirmative) < succ> 

The protein has homology with the following sequences in the databases: 

fastidiosa (strain 9a5c) 
55 gb|AAF83310.1|AE0Q3899_2 (AE003899) phage-related repressor protein (Xylella 

fastidiosa] 

Length = 143 
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Score * 87.0 bits (214), Expect * 2e-16 
Identities = 40/71 (56%), Positives = 54/71 (75%) 

Query: 1 MFSGEQLGQAISEAIIOmNVSQKEVADHFGVKQPSVSGWIK^ 60 

M +GEQLG+AI +A++ K V+ ++A+HFGVK PSV GWIK GRI K+ L L YFSDV 
Sbjct: 1 MLTGEQLGRAIKQAMQIJCGVTPTKMAEHFGVKAPSVYQ 60 

Query: 61 VTPSHFGIETF 71 

V P+H+G+E + 
Sbjct: 61 VGPTHWGLEAW 71 

>sp|P18680|RPCl_BPHK0 26 KD REPRESSOR PROTEIN (REGULATORY PROTEIN CI) 
emb|CAA34222.1| (X16093) cl gene product (AA 1-208) [Bacteriophage HK0221 
Length = 235 

Score = 80,5 bits (197), Expect = 2e-14 

Identities = 60/200 (30%), Positives * 99/200 (49%), Gaps * 15/200 (7%) 

Query: 22 QKEVADHFGVKQPSVSGWIKNGRIDKKHLDKLIDYFSDVVTPSHF GIETFRV 73 

^ . „ Q + +A V ++S W I +K DK+ S + T + + GI + 

Sbjct: 29 QADLAVRLKVTPKAI SKWFNGESIPRK — DKMESLASVXXSTTAAYIiHGYADDTOITVNHL 86 

Query: 74 I*SNEQSSIRFPRimEATCGAGT-INDH^ l31 

. , +SN+ R IH- +A+ G GT +++ +IE + + AR G + ++VIT 

Sbjct: 87 SRSNDY ^VDVLDVQASAGPGTMVSNBFIEKIRAIEYTTEQARILFNGRPQESVKVIT 144 

Query: 132 ARGDSMEPTIENGDVMPVOTAVEAFTCDGLYI^ 191 

RGDSME TI GD +PVD ++ FDGDG+Y+ Y + KRLQ L +ISDN++ 

Sbjct: 145 VRGDSMEGTINPGDEIFTOVSITCF^ 2 03 

Query: 192 YRTETVRGEDLNAVRI IGRI 211 

Y + + + 1+ ++ 
Sbjct: 204 YDRWYIEEGEEEQLHILAKV 223 

Based on this analysis, it was predicted that this protein from ^gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 33 

A DNA sequence was identified in ^gonorrhoeae <SEQ ID 65> which encodes amino acid sequence 
<SEQ ID 66; NGS33>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 

Signal Score (-7.5): -4.87 

Possible cleavage site: 31 
»> Seems to have no N-terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 

count; 0 value: 4.88 threshold: 0.0 

PERIPHERAL Likelihood = 4.88 
modified ALOM score: -1.48 
Rule: cytoplasmic protein 

*** Reasoning Step: 2 

Final Results 

bacterial cytoplasm Certainty* 0.313 (Affirmative) < suco 

The protein has homology with the following sequences in the databases: 

gb|AAF31132.l| (AF069529) Gp54 [Bacteriophage HK97] 
Length = 273 
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Score = 47,4 bits (111), Expect = 3e-04 

Identities = 33/123 (26%), Positives = 52/123 (41%), Gaps - 20/123 (16%) 

5 Query: 221 NGGLSGKPKNANVPRRRKTHG^LQEIADLYNEVIjGGRLPS 280 

NGG G+ K P RRK +■ + + YN +G RliP +N+ RKR + + 
Sbjct: 160 NGGGDGQVK — — PERRKAERIDYESFLNAYNTEVGDI^E'HAVAVNEKRKRRL-KK 214 

Query: 281 LGTAAPNGKVRFGDKETGLiAWFAGFFKKVA — MNPFWMGENQTGFAVGFDWIFKAGNFVK 338 
10 I* T +G F + R PF+ G+N TG+ FD++ + + 

Sbjct: 215 LKTPNVDG FRAYVRAFVHQAKPFYFGDNDTGWTADFDYIiIiREDSIjTG 261 

Query: 339 ILE 341 
+ E 

15 Sbjct: 262 VRE 264 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 34 

20 A DNA sequence was identified in N.gonorrhoeae <SEQ ID 67> which encodes amino acid sequence 

<SEQ ID 68; NGS34>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7.5): -5.65 

Possible cleavage site: 50 j- 
25 »> Seems to have no N-terminal signal seq. 

Amino Acid Composition of Predicted Mature Form: j j 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 3.76 threshold: 0.0 
30 PERIPHERAL Likelihood = 3.76 

modified ALOM score: -1.25 
Rule: cytoplasmic protein 



35 



*** Reasoning Step: 2 
Final Results — 



bacterial cytoplasm Certainty^ 0.310 (Affirmative) < suco 

bacterial periplasmic space Certainty^ 0.000 (Not Clear) < suco 

40 bacterial outer membrane Certainty^ 0.000 (Not Clear) < suco 

bacterial inner membrane Certainty= 0.000 (Not Clear) < suco 

Hie protein has homology with the following sequences in the databases: 

>pir||H82649 hypothetical protein XF1674 XF1569 [imported] - Xylella fastidiosa 
45 (strain 9a5c) 

gb|AAF84378.1|AE003986^8 (AE003986) hypothetical protein [Xylella fastidiosa] 
gb|AAF84483.1|AE003993_2 (AE003993) hypothetical protein [Xylella fastidiosa] 
Length = 316 

50 Score = 167 bits (424), Expect =* 2e-40 

Identities = 108/308 (35%), Positives = 152/308 (49%), Gaps = 30/308 (9%) 

Query: 10 ETSVIRSLSSASLYMFTRRMFYQRRGYWQRANHHA 69 
E +VI++ A FTR F QR+ ++ HH I ++ V G K ++IN+PP 
55 Sbjct: 10 EQAVIKARCEADHLFFTRYFFKQRQQLRFRVNWHHH^^ 69 



60 



Query: 70 YSKTEIAVVNFIAWA^RVPDCEFIHASYSAAIA 128 

SKTE+ +N +A + P F+H SYS LA+ NS R +VQ +EYRA++P ++A 
Sbjct: 70 SSKTELVAINVMARGLALNPYARFLHISYSDDLAI^ 129 
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+S W AGGV Y G+ G IIIDDP K ++A S+ 

Sbjct: 130 DAKSKKRWNVWIX5KKAGG^ 188 

Query: 184 VRRQNVIDWFQNTVESRKNSPOTPIILIMQRLHEKDIi^^ 243 
5 R +TV+SRK SPDTPII+IMQRIi + D G++ GG WE + + 

Sbjct: 189 TGRSKANRKLVSTVKSRKASPDTPI IVIMQRIiAQDDPTGFIQSGGFPGAWECIEIPALID 248 

Query: 244 DG TALWPEKHDIETLRRMEQAAPYVFAGQYLQKPAPP 280 

D + WP K + L +E YVF+GQY Q+P+P 

10 Sbjct: 249 DAYVSRLPEHVQGQVVRDAQDQDGRYSYWF^ 308 

Query: 281 DGGTFKPD 288 

GG K D 
Sbjct: 309 GGGIIKGD 316 

15 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 35 

A DNA sequence was identified in Kgonorrhoeae <SEQ ID 69> which encodes amino acid sequence 

20 <SEQ ID 70; NGS35X Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7.5): -1.07 
Possible cleavage site: 40 
>» Seems to have no N- terminal signal seq. 
25 Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 1.64 threshold: 0.0 
PERIPHERAL Likelihood = 1.64 
30 modified ALOM score: -0.83 

Rule: cytoplasmic protein 

*** Reasoning Step: 2 

35 Final Results 

bacterial cytoplasm Certainty= 0.020 (Affirmative) < succ> 

The protein has homology with the following sequences in the databases: 

40 >ref |NP_047925.1| gp34 [Bacteriophage phi-C31] 

emb|CAA07104.l| (AJ006589) gp34 [Bacteriophage phi-C31] 
Length = 457 

Score = 59.7 bits (143), Expect = le-07 
45 Identities = 68/272 (25%), Positives = 117/272 (43%), Gaps » 49/272 (18%) 

Query: 226 GYSPVT3QI IM3VNIALKRQVHALEVYTA 285 

G SP+ ++ +AL Q + +++ G++P A+V VPTS + + R+EW SG 

Sbjct: 192 GC S P I SYARES IGLALAAQKY GSKFFANGAMPGAVVEVPGTMSEEGLARAREAWRAANSG 251 

50 

Query: 286 ETAQRRKMRFVPGELSRNFRETKQPPLKDVYDEW^^ 334 

E A+ K+ P E F +T+Q + ++ AR+ F V P 
Sbjct: 252 VDNAHRVALLTEGAKFSKVAMSPDEAQ — FLQTRQFQVPEI ARI FGVPPH- 299 

55 Query: 335 FVAQVNRSVAETS — REQSLSIXSMGSLKNWVKALIDDVLARYMDMAA — YEFVWKGEESL 390 

++ S + S EQ+++ M SL+ W++ + A + FV + + 

Sbjct: 300 LISDATNSTSWSSGLAEQNIAFTMFSLRPWLERIE^ 359 

Query: 391 N PKEQAEIYAIYKNAGILTADEIRAELGKEPLP-GQG QPEPDK 432 

60 PKE+ E++++ GI + DE+RA PLP G G +PEP+ 

Sbjct: 360 KRGAPKERMELWSLGLQNGI YS IDEVRAAEDMTPL PDGLGEKYRVPLNLGEVGEEPEPEP 419 
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Query: 433 QDG RKPEEPPNQGAEKLGKSESPMSEDE 460 

P E P-M* B GK 4- ¥ +E 
Sbjct: 420 APAPPAIEPPAEEPDEEPEPEGKPDDEGATEB 451 

A homolog (amino acids 641-700) was found in serogroup A ^meningitidis but not in serogroup B, so 
NGS35 protein and nucleic acid are useful for distinguishing between gonococcus and serogroup B 
N.meningitidis. 

Based on this analysis, it was predicted that this protein from ^.gonorrhoeae^ and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 36 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 71> which encodes amino acid sequence 
<SEQ ID 72; NGS36>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 

Signal Score (-7.5): 4.3 

Possible cleavage site: 26 
»> Seems to have a cleavable N-term signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 27 
ALGM: Finding transmembrane regions (Klein et al.) 

count: 0 value: 1.70 threshold: 0.0 

PERIPHERAL Likelihood = 1.70 Ij 
modified ALQM score: -0.84 i 
Score for OM-PP discrimination: 0.02 Ij 
Rule: outer membrane" or periplast c~protein ~ 
Score for OM-PP discrimination: 0.02 
Rule: outer membrane or periplasmic protein 

*** Reasoning Step: 2 

Outer membrane? Score: 0.00213559 
Outer membrane? Score: 0.00213559 

Final Results 

bacterial outer membrane Certainty* 0.232 (Affirmative) < suco 

The protein has homology with the following sequences in the databases: 

>pir| |D82437 TonB receptor- related protein VCA0625 [imported] - Vibrio cholerae 
(group Ol strain N16961) 
gb|AAF96526.1| (AB004392) TonB receptor-related protein [Vibrio cholerae] 
Length =784 

Score = 103 bits (256) , Expect = 2e-20 

Identities =104/427 (24%), Positives = 162/427 (37%), Gaps = 100/427 (23%) 

Query: 31 OTEQQKEIiNTinnraGKRS-A^ 89 

NTEQ + T+ VHG+ DQ+ D L++ R + D+ G+ 

Sbjct: 57 NTEQAVD-ETVTVHGQS ILTDQRTRSD LDKVRGIANADIFSGITS 100 

Query: 90 VYNMNTRTAGGAITPNIRGITGKGRI PVTIIXSTEQT^ I 149 

V + N GA+ IRG+ G+GR+P+ IDG+ Q+ GV DR Y+D L S+ 

Sbjct: 101 VQSNNMHNEAGALDIGIRGVQ^ 160 

Query: 150 AVEKSPALTRG — VKSGVGGAMSIRTIEPSDIIPEGRNWGIEVKTEFSGNTVAQKNDLRQ 207 

V K + VGG ++ T+ DII + + +G+ +K 
Sbjct: 161 TV^GATIESSPYASGAVGGVVNATTIXSIKDIIKDDQAFGVVIiK 204 
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Ouery 208 FIGRDYRTLSPIGATT^ 267 

V AA+ + PDV Y+ + LDE + F G 

Sbjct: 205 - ARANNHNRTPDVSGDYSEQGQ YALDERGEHSAFKHG 240 

Query- 268 DRQLML SAAFKTDITDGLAAYSHRQKGNYYAGKRGYQSYLNNPI YGADACYDQYPDKS 325 
T.MTi ++ + + + AYS R KGN++AGK+GY+ Y P+ G + + S _ 

Sbjct: 241 — SIjMLGI/JYQAESFNTVLAYSKRSKGNHF AGKKGYEEY-QEFVVGQGQEVVNTSFESDS 297 

Ouery 326 WREKDII^SSASLVPl^VLFRPGBBIMNSHT^ 385 

W K S N +R H +L WY Y D K 

Sbjct: 298 WLFK IiASDTGTAHNADFNYR HHAQKAGEVLMAYWYKS SEDWEC^PYPIX^ 347 

Query- 386 IGFGEIOTLITAWILGFAEQSLNBFVQQAPGIG^ 445 
W * + W LG A+ + TY Y ++P ++- W++L A+ 

Sbjct: 348 DRMPQWGIX3TAKVN- TYSANYYYQP-DHPWLNLNAN 381 

Query: 446 MWRVKTD 452 

W + D 
Sbjct: 382 FWYTEAD 388 

Score = 94.7 bits (234), Expect » 5e-18 M „. 
Identities = 80/290 (27%), Positives = 126/290 (42%), Gaps = 37/290 (12%) 

Ouerv 929 SYDLADNHRLFARYARMSRFPSLYELTAATGSGGIiYGSETVAEYS LKPEKSTNWBV 984 

+Y L + +LF + +R R PSliYB T S V Y+ +KFE++ N EV 

Sbjct: 514 TYALTPSTQLFLKSSRTYRMPSLYETTL SNEVF SYNPYNP IKPEQAWNNEV* 564 

Query- 985 GYNFOTAPHFAKI^QGDLRIjTYYSNKIKNQIOTSN EDGGMIQ YDKAVSK 1033 

GF + +++L ++Y+ N IK+ I + GM + YDK 

Sbjct: 565 GVQFMASNSVMDDR^ 624 . 

Query 1034 GVEI^SIU^SGRFFASFGGraU.KHM^ 1093 

GEli+ +FT++CA CGF + 

Sbjct: 625 GWELGAHYQYAWLYTHFAATLYSETKICSVQQA QYAESDTCNSLGFAWGLTPTRI 679 

Query: 1094 QPKYSLTLDVGTRFFNEKI^^ 1153 

PK +L L+VGT+FFN+ L+ G++ +HS + N +A A Y 
Sbjct: 680 P PKQNLYLNVGTKFFNDTLD SGVKV SYHSG — KSNP SDWL&GT AANP ILE I P SDY 732 

Ouery- 1154 TLLDAYARYRIGKHIDIOTSVTNIANRYYLDPMSSTPVPG 1203 

+D Y++Y + + L F++ N+ +RY + P S +P PGRTIT G + 
Sbjct: 733 -TIDLYSQYELNMJTQIiFFAIKMVTEOT 781 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 37 

A DNA sequence was identified in ^gonorrhoeae <SEQ ID 73> which encodes amino acid sequence 
<SEQ ID 74; NGS37>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 

Signal Score (-7.5): 4.47 

Possible cleavage site; 21 
»> Seems to have a cleavable N-term signal seg. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 22 
ALOM: Finding transmembrane regions (Klein et al.) 

count: 0 value: 7.21 threshold: 0-0 

PERIPHERAL Likelihood = 7.21 
modified ALOM score: -1.94 
Score for OM-PP discrimination: 16.42 
Rule: outer membrane or periplasmic protein 
Score for OM-PP discrimination: 16.42 
Rule: outer membrane or periplasmic protein 
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*** Reasoning Step: 2 

Outer membrane? Score: 1.64214 
5 Outer membrane? Score: 1.64214 

- Final Results — 

bacterial outer membrane Certain ty= 0.938 (Affirmative) < succ> 

10 

The protein has homology with the following sequences in the databases: 

>sp|Q03155|AIDA k JSCOLI ADHBSIN AIDA-I PRECURSOR 

pir I [S28634 adhesin AIDA-I precursor - Escherichia coli plasmid plB6 
emb|CAA46156.l| (X65022) AIDA-I [Escherichia coli] 
15 Length = 12B6 

Score s 35.8 bits (81), Expect = 0.67 

Identities * 34/138 (24%), Positives = 62/138 (44%), Gaps ■ 16/138 (11%) 

20 Query: 3 ASQLTLAVLLAAAFGSAYAVEVKGGDSSKGQLIQAAESDFLPFGSGAADIKV^ 62 

A L + + + G+A+AV + G SS G + + E+ + G G V++G ++ 

Sbjct: 31 AKOTLLVIAWSTIGNAFAWISGTV^^ 87 

Query: 63 S INLEAGPAQRIRNKYGNAPINGGNQNTNVNGAANSK — GWFSKTRLA- 119 
25 +N N + G+QN +GA S + G I ++ G S T L+ 

Sbjct: 88 IVNNGGKTTATTVN S SGSQNVGTSGAT I STIVNSGG IQRVS SGGVASATNLSG 140 

Query: 120 QVWYEKRANNTEVFS 134 

++ A+OT +FS j f 

30 Sbjct: 141 GAQNIYNLGHASNTVIFS 158 1 

! 

The protein was expressed in Exoli as an insoluble 32.45kDa His-fusion product and then purified. 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

35 Example 38 

A DNA sequence was identified in ^gonorrhoeae <SEQ ID 75> which encodes amino acid sequence 

<SEQ ID 76; NGS38>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7.5): 0.34 
40 Possible cleavage site: 24 

»> Seems to have a cleavable N-term signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 25 
ALQM: Finding transmembrane regions (Klein et al.) 
45 count: 0 value: 3.98 threshold: 0.0 

PERIPHERAL Likelihood =3.98 
modified ALQM score: -1.30 
Score for OM-PP discrimination: 2.87 
Rule: outer membrane or periplasmic protein 
50 Score for OM-PP discrimination: 2.87 

Rule: outer membrane or periplasmic protein 

*** Reasoning Step: 2 

55 Outer membrane? Score: 0.287446 

Outer membrane? Score: 0.287446 



60 



Final Results — 

bacterial outer membrane 



— Certain ty= 0.607 (Affirmative) < suco 
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The protein has homology with the following sequences in the databases: 

>sp|Q03155 |AIDA^ECOLI ADHESIN AIDA-I PRECURSOR 

pir| |S28634 adhesin AIDA-I precursor - Escherichia col i plasmid pIB6 
emb|CAA46156.1| (X65022) AIDA-I [Escherichia coli] 
Length * 1286 

Score =35.8 bits (81), Expect = 0.67 

Identities = 34/138 (24%), Positives = 62/138 (44%), Gaps = 16/138 (11%) 

Query: 3 ASQLTLAVLIAAAFGSAYAVEVKGGDSSKGQLIQAAESDFLP 62 

A L + + + G+A+AV + G SS G + + E+ + G G ++ V++G ++ 
Sbjct: 31 AKNTLLVLAWST IGNAFAVNI SGTVS S - GGTVS SGETQIVYSGRGNSNATVNSGG — TQ 87 



15 Query: 63 S INLEAGPAQRIRNKYGNAP INGGNQNTNVNGAANSRYLQPGDINPIA — GWFSKTRLA- 119 
+N N + G+QN +GA S + GI ++ G STL+ 

Sbjct: 88 rVNNGGKTTATTVN SSGSQNVGTSGATISTIVNSGGIQRVSSGGVASATNLSG 140 

Query: 120 QVWYEKRANNTEVFS 134 

20 ++ A+NT +FS 

Sbjct: 141 GAQNIYNLGHASNTVIFS 158 

>pir||G81213 conserved hypothetical protein NMB0313 [imported] - Neisseria 
meningitidis (group B strain MD58) 
25 gb|AAF40758.l| (AE002388) conserved hypothetical protein [Neisseria m en ingitidis 

MC58] 
Length = 488* 

Score ■ 84.3 bits (207), Expect = 3e-15 
30 Identities = 111/498 (22%), Positives = 185/498 (36%), Gaps = 35/498 (7%) 

Query: 7 LLFLPLCWCLAAPSNDAADERRRLIiDEGSRQTQQYRBSGW — LDTEQARGEVEENDGYI 64 

+L LPL S A+E R D SR + E+ +D E+ G+V E + 

Sbjct: 19 MLLLPLLA SAAYAEETPREPDLRSRPEFRLHEAEVKP IDREKVPGQVREKGKVL 72 



35 



Query: 65 S IGGEI YQVGDTAEELESAIYHALNARQWHKVRQFAARYAKLPRHKPALI 124 

I GE EL A+Y A+ + +R Y + + L A + + 

Sbjct: 73 QIDGETLL — KNPELLSRAMYSAWSNNIAGIRVILPIYLQQAQQDKMLALYAQGILAQA 130 



40 Query: 125 EGDFRAAGN SFQTALEAE PDNPRLLLEAGRFYAEDNQNKESAAAFEKVLKTD I PAETRPI 184 

+G + A + ++ + A+PD P + + E+ QN+ +A F++4- ++P + 

Sbjct: 131 DGRVKEAI SHYREL IAAQPDAPAVRMRLAAALFENRQNEAAADQFDRLKAENLPPQLMEQ 190 

Query: 185 VENYLSELGKRRRWHGQI SLGYGYNSNVNQGNGINQCVWEI AGMCLMERTLPAPTDSTFS 244 
45 VE Y L +R W N+NQ Q + T P DT 

Sbjct: 191 VELYRKALRERDAWKVNGGF SVTREHNINQAPKRQQ YGKWTFPKQVDGTAV 241 

Query: 245 SYSATAEKTVPLKGNHGVQVRG^YGNRYTEKDKDSAAMPDYGYRITC 304 
+Y AEK LK G + GYK + +G+ADR 

50 Sbjct: 242 NYRLGAEKKWSLKNGWYTTAGGDVSGRVYPGNKK FNDMTAGVSGGIGFADRRK 294 

Query: 305 SFSLLPYFEYDFRNRHTHYRAWGADADWSRTLSPHWRINSHAG^ 364 

h + E + GA ++R +P W+ S A + G ++ +D 
Sbjct: 295 DAGLAVFHERRTYGNDAYSYTNGARLYFNRWQTPKWQTLSSA EWGRLKNTRRARSDN 351 

55 

Query: 365 KQYELGAGAEFS ITLKSGLLVNFDAARKAYP-EKSSSSKEYTARLGAY 423 

++ F + + D R+ P ++ + Y R A+ GG+ L+++ 

Sbjct: 352 THLQISNSLVFTRNARQYWMGGLDFYRERNPADRGDNFNRYG 410 

60 Query: 424 LLY — RRSLYDAASFVSDNK — RRRDKQYIMMAAAGFPQWNIKGVYPELRFRRTIAHSNA 479 

L + Y+ F S K RRRDK+ + + KG+ PL SN 

Sbjct: 411 LRLGAAKRHYEKPGFF SGFKGERRRDKELOTSLSLV?HRAIjHFKGITPRLTLSHRETRSND 470 

Query: 480 VYYRYRQNEWLLGFKYRF 497 
65 V+ Y +N + F F 

Sbjct: 471 VFNEYEKNRAFVEFNKTF 488 
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>pir||C81790 conserved hypothetical protein NMA2174 [imported] - Neisseria 
meningitidis (group A strain Z2491) 
emb|CAB85386.l| (AL162758) conserved hypothetical protein (Neisseria 
meningitidis 

Z2491] 
Length = 490 

Score = 84.0 bits (206), Expect = 4e-15 

Identities = 111/498 (22%), Positives = 185/498 (36%), Gaps = 35/498 (7%) 

Query: 7 IiLFLPLCTVCLAAP SNDAADERRRLIjDEGSRQTQQYRESGW — LDTEQARGEVEENDGYI 64 

+L LPL S A+E R D SR + E+ +D E+ G+V E + 

Sbjct: 21 MLIiLPLLA SAAYAEETPREPDLRSRPEFRIjHEABVKPIDREKVPGQWEKGKVIj 74 



15 Query: 65 SIGGBIYQVGDTAEELBSAIYHAItf^^ 124 

I GE E It A+Y A++ +R Y+ + L A + + 

Sbjct: 75 QIDGETLIj — KNPELLSRAMYSAWSNNIAGIKVnijPIYIjQQAQQDKMLALYA 132 

Query: 125 EGDFRAAGNSFQTALEAE PDNPRLLLEAGRFYAEDNQNKE SAAAFEKVLKTDIPAETRPI 184 
20 +G + A + ++ + A+PD P + + E+ QN+ +A F+++ ++P + 

Sbjct: 133 DGKVKEAI SHYKBLIVAQPDAPAVRMRLAAALFEim^ 192 

Query: 185 VENYLSELGKRRRWHGQISLGYGYNSNVN^ 244 
VE Y L +R W N+NQ Q + T P D T 

25 Sbjct: 193 VELYRKALRERDAWKVNGGFSVTREHNINQAPKRQQ YGKWTFPKQVDGTAV 243 

Query: 245 SYSATAEKTVPIiKGNHGTVQVRGVM 304 

+Y AEK LK G + GYK + + G +AD R 

Sbjct: 244 NYRLGAEKKWSIiKNGWYTTAGGDVSGRVYPGNKK FNDMTAGVSGG IGFADRRK 296 



Query: 305 SFSLLPYFEYDFRNRHTHYRAWGADADWSRTO 364 

Xj E + GA ++R +P W+ S A 4- G ++ +D 
Sbjct: 297 mGIiAVFHERRTYGNDAYSYTNGARLYFNRWQTPKWQTLSSA EWGRLKNTRRARSDN 353 



35 Query: 365 KQYELGAGAEFS ITIiKSGLLVNFDAARKAYP-EKS S S SKEYTARLGAYRIiF SGGTYLNAV 423 

++ F + + D R+ P -M* +■ Y R A+ GG+ 

Sbjct: 354 THLQISNSLVFYRNARQYWMGGLDFYRE^ 412 

Query: 424 IiLY- -RRSIiYDAASFVSDNK- -RRRDKQYIMMAAAGFPQWNIKGVYPELRFRRTIAHSNA 479 
40 L + Y+ F S K RRRDK+ + + KG+ P L SN 

Sbjct: 413 IiRLGAAKRHYEKPGFFSGFKGERRRDKELOT 472 

Query: 480 VYYRYRQNEWIJjGFKYRF 497 
V+ Y +N + F F 
45 Sbjct: 473 VFNEYEKNRAFVEFNKTF 490 

The protein was expressed in Exoli as an insoluble 52.03kDa His-fusion product and then purified. 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

50 Example 39 

A DNA sequence was identified in N. gonorrhoeae <SEQ ID 77> which encodes amino acid sequence 

<SEQ ID 78; NGS39>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
Signal -Score (-7.5): -5.38 
55 Possible cleavage site: 18 

»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALGM: Finding transmembrane regions (Klein et al.) 
60 count: 0 value: 7.16 threshold: 0.0 

PERIPHERAL likelihood =7.16 
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modified ALOM score: -1.93 
Rule: cytoplasmic protein 

*** Reasoning Step: 2 

5 

Final Results 

bacterial cytoplasm Certainty* 0.325 (Affirmative) < suco 

10 The protein has homology with the following sequences in the databases: 

>ref |NP_052685.l| serine protease EspP [Escherichia coli] 
pir(|T00317 probable serine proteinase espP, extracellular - Escherichia coli 

plasmid p0157 ^ 4 . 

pir||T42120 probable serine proteinase espP, extracellular - Escherichia coli 
15 plasmid p0157 

emb|CAA66144.1| (X97542) putative exoprotein-precursor [Escherichia coli] 
dbj |BAA31836.1| (AB011549) serine protease EspP [Escherichia coli] 
gb|AAC70088.l| (AF074613) putative exoprotein-precursor [Escherichxa coli 
0157 :H7] 

20 Length * 1300 

Score = 58.9 bits (141), Expect = 2e-07 ,^ jfM 
Identities = 153/687 (22%), Positives = 248/687 (35%), Gaps = 106/687 (15%) 

25 Query: 194 DLTVENKNTLSDA EFGVYALNTSMVl^SSKDNNEVKSTQVGLYSQDGGS INVDR 247 

D +NT +DA Y N ++ +LS DB++G + +VR 

Sbjct: 595 DYVAGMQNTEADAVKQNGNAYKTN^^ 653 

Query- 248 KDNIIEGDAVALVGKGGSQNIRAS RTNLISSKSLGIHAEQAAKIAITG 295 

30 " ~" KNI GD A + +NI R N++ S G E IT 

Sbjct: 654 VIGDIQASKSNITIGDTTAYIDIjHAGKNITGDGFGFRQNIVRGNSQG ETIiFTGGITA 710 

Query: 296 ASimHASNAAIRSLDKSEVKIO^^ LNYKDDTRITGATV 352 

+TI + A ++ + TI+ N A++ Q G ++ + +TG 

35 Sbjct: 711 EDSTOTIKDKAKALFSNYVYLIdS^ 769 

Query- 353 SDKGLVAIKPLNNTNIVADT IHYKGDVIjAVNKGKVELDF TPNILLAGRLDNFSGLT 408 

D LN+ + + + ++A NK V D + +1+ + S L+ 

Sbjct: 770 KDNKFEPSIYLNDASYLI/TDDSAR — LVAKNKASVVGDIHSTKSASIMFGHDESDIiSQLS 827 

40 

Query 409 DSKHKNLFENYVANLDSKSAGEINFNLAKDAL WTMTGQSWLDKLEGQGTIDFNNDA 464 

DKL + D G+NA + W+TGSI*L++++D+ 
Sbjct: 828 DRTSKGIiAIiGLLGGFDVSYRGSVNAPSASATMNNTWWQ 887 

45 Query: 465 KTSGR — ALHIGELAGANK-FLMHLNKDG IHSDMLYVKKGTSTPQEWVKNLSEVLDSMN 521 

+ + jj + ELA +N + M N SD L VKK S ++ L + L 
Sbjct: 888 ANNKKFHTLTVDELATSNSAYAMRTNLS — ESDKLEVKKHLSGENNIL LVDFLQKPT 942 

Query: 522 YGERLRFATVTNSKNEFVNGKKYIDDT^^ 581 
50 " V+ K+ N K T D V 

Sbjct: 943 PEKQLNI ELVS APKJXTNENVFKASKQTIGFSDVTPV 978 

Query: 582 TAEKAGDDYVNKTYTDNRQWYLVKQATGNPS -LDTYAKR 639 

+ DD + T++ N K+AT N + S + AF ++ KR 

55 Sbjct: 979 ITTRETDDKI- -TWSLTGYNTVANKEATRNAAA LFSVDYKAFLNBVNNLNKR 1028. 

Query: 640 EGERAFSTLDKKEGimRLTHTR\n:^ 699 

G+ ++ + G W R+ S F + ++G D+ K ' G+ I* 

Sbjct: 1029 MGD — LRDINGEAGAWARIMSGTGSASGGFSDNYTHVQVGVDK KHELDGLDLF 1079 

60 

Query: 700 YGHGRTSLWNTFGKD KIRKYELALYNTTQYIDKEGDETG^ 755 

GT++D K+ LY++ D YID + K N A 

Sbjct: 1080 TGFTVTHTDSSASADVFSGKTKSVGAGLYASAMF DSGAYIDLIGKYVHHDNEYTA 1134 

65 V Query- 756 RNHMGQLWGKGKYSNTLFS I STEYGRRKFLDDDKLWRITPQVQLQYSYI»RGTGYRI ~DNG 814 

GGYS+ EGR++DWIPQ+LY+G+DG 
Sbjct- 1135 - TFAGL — GTRDYSTHSWYAGAEAGYRYKVTEDA- W- IEPQAELVYGSVSGKQF AWKDQG 1189 



WO 02/079243 



-68- 



PCT/3[B02/02069 



Query: 815 INVNLSHA— NSLIGRLGLDWRKFDG 839 

+++++ N LIGR G+DV + F G 

Sbjct: 1190 MHLSMKDKDYNPLIGRTGVDVGKSFSG 1216 

5 

Score = 36.6 bits (83), Expect = 1.2 

Identities * 97/412 (23%) , Positives « 164/412 (39%) , Gaps = 837412 (20%) 

Query: 63 DNIVTMKSGDADM>YVNNSKVLTETPYTC — SB 120 

10 ' D V G + ++ SK Y + +G + A+ S V + f N + +B 

Sbjct: 163 DKFVVETRGATEGADISLSKQQALERYGVNTO 222 

Query: 121 I SEKSKALNGGFRHIGIYS-W QNAKVB LSAKSDN IVQGG 158 

+S K + LNG F HI +S W N + + ++++ D+ +V G 

15 Sbjct: 223 VSYKPEMLETCSFVHIDDWSGWLILTNNQFDE 282 

Query: 159 IWGLYS— — NNSSISLKGKNNVI SNPKYNVFAYKKAKVDLTVENKNTLSDAEFGVYALN 214 

+WG+Y+ N + K I N K N ++Y VD++ T+ + + + 

Sbjct: 283 VWGI Y1TCANGKNHAAYSKWNQTT IDNLK-NKYSY NVDMSGAQVATIENGK — LTGTG 336 

20 

Query: 215 TSMVNLSSKDNNBVKSTQVGLYSQ DGG S INVDRKDNI IEGDAVALVGKG GS 265 

+ ++ +KD + L S GG + D+K + GD G G GS 

Sbjct: 337 SOTTDIKNKDLIFTGGGDILLKSSFDNGAGGLW 396 

25 Query: 266 Q NIR-ASRTNL — ISSKSLGIHAEQAAK IAITGASNTIHASNAAIRSLDKS 313 

HIR ++ML -I +L + Q + I GA T +N I ,S D 

Sbjct: 397 TVEWITCRYDNKDNLHK^ 

Query: 314 EVKIDGQITIDSNVAN LARQDGSIHLN-YKDDTRITGATVSDKGLV AIK 361 

30 V+++ + + N A+ G++ IjN Y AT D G V 

Sbjct: 454 TVRLNAENALSGGEYNGIFFAKNGGTLDIiNGYNQSFNKIAAT — DSGAVITNTSTKKSIL 511 ! 

Query: 362 PLNNTNIVADTIHYKG DVLAVNKGKVELDFTPNILLAGRLDNFSGLT 408 I 

LNNT AD I++ DVL ++ K B ++1- G +D + ++ 

35 Sbjct: 512 SLNNT ADYIYHGNINGNLDVLQHHETKKB NRRliILDGGVDTTNDIS 557 

The protein was expressed in Ecott as an insoluble 95.92kDa His-fusion product and then purified. 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 



40 Example 4© 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 79> which encodes amino acid sequence 

<SEQ ID 80; NGS40>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7.5): -4.18 
45 Possible cleavage site: 17 

»> Seems to have no N~ terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
50 count: 0 value: 7.05 threshold: 0.0 

PERIPHERAL Likelihood = 7.05 
modified ALOM score: -1.91 
Rule: cytoplasmic protein 

55 *** Reasoning Step: 2 

Final Results 

bacterial cytoplasm Certainty* 0 .108 (Affirmative) < suco 

60 
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The protein has no homology with sequences in the databases. 

Based on this analysis, it was predicted that this protein from N.gonorrJweae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 41 

5 A DNA sequence was identified in N. gonorrhoeae <SEQ ID 81> which encodes amino acid sequence 

<SEQ ID 82; NGS41>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7.5): -2.47 
Possible cleavage site: 17 
10 »> May be a lipoprotein 

Amino Acid Composition of Predicted Mature Form: 

calculated from 16 
ALOM; Finding transmembrane regions (Klein et al.) 
count: 0 value: 7.37 threshold: 0.0 
15 PERIPHERAL Likelihood = 7.37 

modified ALOM score: -1.97 
Rule: inner or outer membrane protein 
Rule: inner or outer membrane protein 

20 *** Reasoning Step: 2 

Lipoprotein? 
Inner membrane? 

25 Final Results 

bacterial outer membrane Certainty* 0.790 (Affirmative) < suco 

The protein has no homology sequences in the databases. 

30 Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 42 

A DNA sequence was identified in N. gonorrhoeae <SEQ ID 83> which encodes amino acid sequence 
<SEQ ID 84; NGS42>. Analysis of this protein sequence reveals the following: 

35 GvH: Examining signal sequence (von Heijne) 

Signal Score (-7.5): -5.2 
Possible cleavage site: 14 
»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 
40 calculated from 1 

ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 6.58 threshold: 0.0 
PERIPHERAL Likelihood = 6.58 
modified ALOM score: -1.82 
45 Rule: cytoplasmic protein 

*** Reasoning Step: 2 

Final Results 

50 

bacterial cytoplasm Certainty* 0.514 (Affirmative) < suco 

The protein has no homology with sequences in the databases. 
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Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 43 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 85> which encodes amino acid sequence 

5 <SEQ ID 86; NGS43>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7.5): -4.34 
Possible cleavage site: 39 
»> Seems to have no N-terminal signal seq. 
10 Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 2 value: -4.78 threshold: 0.0 

INTEGRAL Likelihood = -4.78 Transmembrane 1881 -1897 (1876 -1898) 
15 INTEGRAL Likelihood = -1.01 Transmembrane 1966 -1982 (1966 -1982) 

PERIPHERAL Likelihood =1.91 
modified ALOM score: 1.46 
Rule: cytoplasmic membrane protein 

20 *** Reasoning Step: 2 

Final Results 

bacterial inner membrane Certainty* 0.2 91 (Affirmative) < suco . 

The protein has no homology with sequences in the databases. I 
Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 44 

30 A DNA sequence was identified in N.gonorrhoeae <SEQ ID 87> which encodes amino acid sequence 

<SEQ ID 88; NGS44>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7.5): -3.49 
Possible cleavage site: 58 
35 »> Seems to have no N-terminal signal seg. 

Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 1 value: -1.33 threshold: 0.0 
40 INTEGRAL Likelihood = -1.33 Transmembrane 141 - 157 { 140 - 157) 

PERIPHERAL Likelihood =2.54 
modified ALOM score: 0.77 
Rule: cytoplasmic membrane protein 

45 *** Reasoning Step: 2 

Final Results 

bacterial inner membrane Certainty= 0 .153 (Affirmative) < suco 

50 

The protein has no homology with sequences in the databases. 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 
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Example 45 

A DNA sequence was identified in ^gonorrhoeae <SEQ ID 89> which encodes amino acid sequence 

<SEQ ID 90; NGS45>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
5 Signal Score (-7.5): -4.07 

Possible cleavage site: 46 
»> Seems to have no N-terminal signal seq. 
Amino Acid Conposition of Predicted Mature Form: 
calculated from 1 
10 ALOM: Finding transmembrane regions (Klein et ai.) 

count: 0 value: 2.07 threshold: 0.0 
PERIPHERAL Likelihood ■ 2.07 
modified ALOM score: -0.91 
Rule: cytoplasmic protein 

15 

*** Reasoning Step: 2 

Final Results 

20 bacterial cytoplasm Certainty* 0 . 333 (Affirmative) < suco 

The protein has homology with the following sequences in the databases: 

>ref|NP 049512.1 1 putative portal protein [Bacteriophage 933W1 

ref NP~050550.1 hypothetical protein [Bacterxophage VT2-Sa] 

25 gb|AADl545??l|AF125l20_52 (AF125520) putative portal protein [Bacterxophage 

9 S|BAA84334.lj (AP000363) hypothetical protein [Bacteriophage VT2-Sa] 
dbj|BAA94158.l| (AP000422) portal protein [Escherichia coli 0157 .H7] 
Length = 714 

Score = 314 bits (805), Expect = 2e-84 ,„,,. .... 

Identities = 213/658 (32%). Positives = 327/658 (49%). Gaps = 22/658 (3%) 

Ouerv. 7 ETGVLPDKNGEPLTIG EYRLFVGEMMNQPAVJRAVADKEMDYADGRQLIJNEIiLQKQR 62 

y y ' ET + KM T + + ++ +QP WR A+K Y DO QL E+LQ + 

Sbjct: 4 ETNTMATKNDNGATPRFSQRQLQALC SDIDSQPKWRDAANKACAYYDGDQLP PEVI^VLK 63 

Query- 63 ELGLPPAVENLITPTLLSVQGYEATIRTDWRWATCBTGGRD-VADALNFKL^ 121 
U + G P + NLI PT+ V G EA RTD V +D + +A+A+N + A R 

Sbjct: 64 DRGQ EMT IHNL iaPTVIXWIjGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRIiGK 123 

Ouery- 122 Al^CSI^GQIACGIGWvEVTRNPNPF^PYECGVI^ 181 
^ +KA SDA+ QI G+ WVBV RN+PF ++ +RN + WDS + DLSDRWL 

sb j ct . 124 MNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEFKVSTVSRNEVFWD^ 183 

Query- 182 iFJOWniPEFIAQFFPEYAGHFKAMGRGGS^ 240 

+RRRW+ + FPG++ EWR +G+LAW 

Sbjct: 184 MRRRWMDTDEAKATFP GMAQyiDYAIDDWRGFVDTTVTEGQPSPIMSAVnSEYQSWDR 240 

50 Ouery- 241 SFJSFWFNETTREIAVAEVWYRRWVTA^ 299 

JU V + W R + + V+YR + + + GR V FD N A+G V + 

Sbjct: 241 QQNEWLQRERRRVIjLQyVYYRTFERLF 299 

Ouerv- 300 ASVPPJIRRAFWGDLVVRDEPTPYPHQKFPYWFFGFREDNTGIPY 359 
« U y * V R+R AH- VG + D P P FP VPF+G+R+D TG PYG + AQD +-H 

Sbjct: 300 GRVSRIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKIJKTGEPYGLISRAIPAQDEVN 359 

Ouerv- 360 STNSKLRWGLSJUIRTVRTKGIVDMSDEQFPJUJIARVDADrVIJJKIEAAQPGAR FDVSR 417 
» KL W L A R +■ + +SD IRDILH+ Q FV4- 

60 Sbjct: 360 FRRIKLTWLLQAKRVIMDEDATQLSDNDLMEQIERPDGI IKLNPVRKNQKSVADVFKVEQ 419 

Query- 418 DFELSAQHWQMLQDSRATIRQISGITPSFMGNRGNATSGRQBSIQVEQSNQSLGLVMDNF 477 

DF4-+++Q +Q++Q+S 1+ G+ 4-F+G ATSG S VEQ +L + DN+ 
Sbjct- 420 DF " "~'«"~'^™ ?tWT ' Tn >™^ 479 



30 



40 



45 
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Query- 478 RQSRSLVGELLL^IEDLGS-DBQT\TO 536 

+ + VG LLLA +++DL WI D + +T+V+N E D L+ND+ 
Sbjct: 480 QFACQQVGRLLLML^ NGELTNDI 534 

Query- 537 QNIRLKVALEDVPSTNSYRSQ^^ 596 

+ +AL V T ++++Q MSB ++ LPP+ QA VL V+L+D+P K + +E+ 
Sbjct: 535 SRIjmilAIAPVQQTPAFI^ 594 

10 Query: 597 IK-EVRVQETPEQI — EARXAQAVQDALAKSGNDIKRRELALKEQRTASEIKEIEARA 651 

I + + ++P+++ E + A Q AL + ++* RE+A ♦ + ++ A A 

Sbjct: 595 nuUU^FKSPDEMTPBBQEVAAQQ 652 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
15 useful antigens for vaccines or diagnostics. 

Example 46 

A DNA sequence was identified in N.gonorrlweae <SEQ ID 91> which encodes amino acid sequence 

<SEQ ID 92; NGS46>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) . 
20 Signal Score (-7.5): -3.25 

Possible cleavage site: 37 
»> Seems to have no N-terminal signal seq. 

Amino Acid Composition of Predicted Mature Form: ^ 
calculated from 1 j 
25 ALOM: Finding transmembrane regions (Klein et al.) j 

count: 0 value: 4.77 threshold: 0.0 j 
PERIPHERAL Likelihood =4.77 
modified ALOEM score: -1.45 
Rule: cytoplasmic protein 

30 

*** Reasoning Step: 2 

r Final Results ■ 

35 bacterial cytoplasm Certainty= 0.281 (Affirmative) < suco 

The protein has homology with the following sequences in the databases: 

>sp|P44184|YEl0jHABIN HYPOTHETICAL PROTEIN HI1410 
pir||E6402B hypothetical protein HI1410 - Haemophilus influenzae (straxn Rd 
40 KW20) 

gb|AAC23058.l| (U32820) H. influenzae predicted coding region HI1410 

[Haemophilus 

influenzae Rd] 
Length = 394 



45 



Score = 150 bits (379), Expect = 3e-35 

Identities = 75/168 (44%) , Positives = 114/168 (67%) , Gaps <= 2/168 (1%) 



Query: 57 REIQKSMRDSVHRLLKDKVAQLGLG 116 
50 ~ '" REIQKS+ DSV ++L D++ L L F+++ +1 G NG+ F F+GL+++ + SIRS 

Sbjct: 3 REIQKSISDSVTQMLAIX2IEMLSLQAFFDVQKTQIIGQNGSR 61 

Query: 117 GIDIVWVEEGHGVSKKSWDVLTPTIRKBGSE 176 
GID+VWVEEG VSK+SWD+L PTIR++GS+I ++ NP D+TY+RF+ P E 
55 Sbjct: 62 G IDWWVEEGENVSKESWDIL I PT IREDG SQ I IVSFNPKNILDDTYQRFVTHP PERCKSV 121 

Query: 177 E INWRDNPWF PEALNRERLKAQRSMNKEDYGNIWEGRPRMVS EG AVYR 224 

+NW+DNP+FP+ I* E R + E Y +++EG P S+ A+ + 

Sbjct: 122 LVlSWQDNPYFPKEL-MEIMEQMRERDYELYRHVYEGEPVADSDIiAI IK 168 



60 



>ref |NP_050979.1| P18 [Bacteriophage APSE-1] 
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-73- 

gb|AAF03961.l|AF157835JL8 (AF157835) Pl8 [Bacteriophage APSE-1] 
Length = 469 

Score * 117 bits (294) # Expect « 2e-25 lct , v 
Identities = 72/233 (30%), Positives = 110/233 (46%), Gaps = 13/233 (5%) 

Ouery. 17 IiFK p CK YKVMYXX^ 76 

+FKP RKV + R LC RE S+ DS H +V 

Sbjct: 1 MFKFKRIKVYFGGRGGMKTVSFAKIALITASMHKRRFI^ 60 

Ouerv 77 QLGLGHFYEITDFEIRGANGTIiFVFSGIjQSHTVDS IKSFEGIDIVWVEEGHGVSKKSWDV 136 

LGL + + I + I G N ++F + L + + SIKS D+ WVEE VS+KS D 
Sbjct: 61 TLGLQNRFRXLNTTIBGINDSIFKYG 119 

Ouerv- 137 LTPTIRKEGSEIWITLNPDMETDETYRRFIA MPSEDTWLCEINWRDNPW 185 

L PTIRX GSE4-W + NP E Y+RF+ +D ++ ++++ DNPW 

Sbjct: 120 LIPTIRKPGSELWFSFNPAEEDGAVYKRFVKPYKEL^^ 179 

Query: 186 FPEALNRERLKAQRSMNKEDYGOTWEGR^ 238 
* P L + K+R N + + +++ G BA++.E +A + L 

Sbjct: 180 LPWSfcKNDAQKMK^^ 231 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 47 

A DNA sequence was identified in ^gonorrhoeae <SEQ ID 93> which encodes amino acid sequence 
<SEQ ID 94; NGS47>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 

Signal Score (-7.5): -4.87 

Possible cleavage site: 31 
»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 

count: 0 value: 4.88 threshold: 0.0 

PERIPHERAL Likelihood = 4.88 
modified ALOM score: -1.48 
Rule: cytoplasmic protein 

*** Reasoning Step: 2 

Final Results 

bacterial cytoplasm — Certainty* 0.313 (Affirmative) < suco 

The protein has homology with the following sequences in the databases: 

>ref |NPJ)37739.1| Gp54 [Bacteriophage HK97] 
gb|AAF31132.l| (AF069529) Gp54 (Bacteriophage HK97] 
Length = 273 

Score = 47.4 bits (111), Expect = 3e-04 

Identities = 33/123 (26%), Positives = 52/123 (41%), Gaps = 20/123 (16%) 

Query- 242 NGGLSGKPKNANVPRRRKTHGWLQEIADLYNE^IJSGRLP SVQVLNDTRKRAI ANRWCEM 301 

NGG G+ K P RRK + + + YN +G RLP +N+ RKR + + 
Sbjct: 160 NGGGDGQVK PERRKAERIDYESFLNAYNTEVGDRLPHAVAVNEKRKRRL--KKIIPQ 214 

Ouerv 302 LGTAAPNGKVRFGDKETGLAWFAGFFRKVA — MNPFWMGENQTGFAVGFDWIFKAGNFVK 359 

L T +G F + R PF+ G+N TG+ FD++ + + 

Sbjct: 215 LKTPNVDG FRAYVBAFV1IQAKPFYFGDNDTGWTAD 261 
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Query: 360 ILB 362 
+ E 

Sbjct: 262 VRE 264 

5 Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 48 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 95> which encodes amino acid sequence 

<SEQ ID 96; NGS48>. Analysis of this protein sequence reveals the following: 

10 GvH: Examining signal sequence (von Heijne) 

Signal Score (-7.5): -6.85 
Possible cleavage site: 15 
>» Seems to have no N-terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 
15 calculated from 1 

ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 7.16 threshold: 0.0 
PERIPHERAL Likelihood =7.16 
modified ALOM score: -1.93 
20 Rule: cytoplasmic protein 



25 



35 



55 



*** Reasoning Step: 2 
Final Results — 



bacterial cytoplasm Certainty= 0.379 (Affirmative) < succ> j 

The protein has homology with the following sequences in the databases: 

>dbj |BAA36059.1| (D90754) Outer membrane protein P. 69 precursor [Escherichia 
30 coli] 

Length = 762 



Score =64.7 bits (156), Expect = le-09 

Identities = 79/292 (27%), Positives = 121/292 (41%), Gaps = 55/292 (18%) 

Query: 3 NGARVmm5DSMLKELDLSEDAQVEFSDNNK FVKVSV SKLKGDGGVFKMYGDXV — 56 

N + W VT++S L L LS V+F+ + F ++V L G+ F M D+V 

Sbjct: 289 JWSVWNVTSNSNLOT^ 346 

40 Query: 57 KGESDKLITRKGSEGTHI IEYMDDAKAKTTGREYLKLVEN^ 112 

+ D L S G H++ + TTG E L +V+ D AS AS + 

Sbjct: 347 GNGVNNKGDLLNI SGS SAGNHVLAIRNQG SEATTGNEVLTWKTT DGAASFSASSQ 402 

Query: 113 LNVRCTEQGGWCFALGESG ASKKVNISTDGKRDF YLYPD 151 

45 " " + E GG+ + + ++G AS V T + PD 

Sbjct: 403 V ELGGYLYDVRKNGTNWELYASGTVPEPTPNPEPTPAPAQPPIVNPDPTPEPAPT 457 

Query: 152 TLTPGAS SSVLFGEALYQLNAVSDETLVQRMGEIHAIX^ 208 

T T A + L YLNV + TL+QRMG++ +D N+W++ GG 

50 Sbjct: 458 PKPTTTADAGGNYL- -NVGYLLNYVENRTLMQRMGDLRNQ SKDGNIWLRSYGGSLDS 512 



Query: 209 SRSDYRVGGYGNRYWGFAGGFNRTGFGDKWIHYKGLMLRHI^SSY 260 

S +* G+ Y G G ++ D Y GL ++• S++AS BY G 
Sbjct: 513 FASG-KLSGFDMGYSGIQFGGDKR-LSDVMPLYVGL YIGSTHASPDYSG 559 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 
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Example 49 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 97> which encodes amino acid sequence 

<SEQ ID 98; NGS49>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
5 Signal Score (-7.5): -8.37 

Possible cleavage site: 15 
»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: . 
calculated from 1 
10 ALOM: Finding transmembrane regions (Klein et al.) 

count: 0 value: 4.93 threshold: 0.0 
PERIPHERAL Likelihood = 4.93 
modified ALOM score: -1.49 
^ Rule: cytoplasmic protein 

*** Reasoning Step: 2 

Final Results 

20 bacterial cytoplasm Certainty* 0.355 (Affirmative) < suco 

bacterial periplasmic space Certainty* 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

bacterial inner membrane Certaintys 0.000 (Not Clear) < suco 

25 The protein has homology with the following sequences in the databases: 

gi 1 11282647 |pir | |H81959 patch repair protein (EC 3.1.-.-) NMA0429 [imported] - 
Neisseria meningitidis (group A strain Z2491) 

gi|7379179|emb|CAB83728.l| (AL162753) patch repair protein [Neisseria 
meningitidis Z2491] 
30 Length = 140 

Score s 256 bits (628), Expect = 8e-68 

Identities = 131/140 (93%), Positives = 132/140 (93%) 

35 Query: 1 MTDIFTPSKRSFVMSKIHSKETKPEVLVRKFI^ 60 

MTDIFT SKRSFVM KIHSKETKPEVLVRKFLF QGFRYKRNDKRY GKFDIVL KYKTV 
Sbjct: 1 MTDIFTTSKRSFVMLKIHSKETKPEVLVRKFLFFQGFRYRKND^ 60 

Query: 61 VFIHGCFWHGHSCn^GHIPKSNMDFWLEKlTKNRERDlKNETELEKI 120 
40 VFIHGCFW+GHSCNKGHIPKSN DFWLEKITKN ERDIKNETELEKIGFKVIWWECELK 

Sbjct: 61 WIHGCFWYGHSCl^GHIPKSNTDFWLEKI 120 

Query: 121 NKAICRERLNRLVEEIKDAV 140 
NKAICRERLNRLV EIKDAV 
45 Sbjct: 121 NKAICKERLNRLVRBIKDAV 140 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 50 

50 A DNA sequence was identified in N.gonorrhoeae <SEQ ID 99> which encodes amino acid sequence 

<SEQ ID 100; NGS50>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7.5): -6.6 
Possible cleavage site: 50 
55 »> Seems to have no N- terminal signal seq. 

Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
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count: 0 value: 7.80 threshold: 0.0 
PERIPHERAL Likelihood = 7. B0 
modified ALOM score: -2.06 
Rule: cytoplasmic protein 

*** Reasoning Step: 2 

Final Results — 



10 bacterial cytoplasm Certain ty= 0.398 (Affirmative) < suco 

bacterial periplasmic space Certainty* 0.000 (Not Clear) < suco 

bacterial outer membrane Certain ty= 0.000 (Not Clear) < succ> 

bacterial inner membrane Certainty* 0.000 (Not Clear) < suco 

15 The protein has homology with the following sequences in the databases: 

>gi 1 11352963 |pir | (G81959 conserved hypothetical protein NMA0428 [imported] - 
Neisseria meningitidis (group A strain Z2491) 

gi|7379178|emb|CAB83727.l| (AL162753) conserved hypothetical protein [Neisseria 
meningitidis 
20 Z2491] 

Length = 548 



Score ■ 371 bits (954), Expect = e-102 

Identities = 189/197 (95%), Positives = 194/197 (97%) . 

Query: 1 VKGESGVDIENWKNKLPEKEIUSFVEV^ SALHSI IPEYPYYHW 60 

VKGESGVDIE+WKNKLPEKEREPVEVT^ 
Sbjct: 350 VKGESGVDIEDWKNKLPEKEREPVEVIIjNRIiEDSELTNKEQAEVISAIiHS IIPBYPYYHW 409 

30 Query: 61 RHLHQDLHTACNDFYNEKKDYLSAAIEAVKVFEDKVQK^ 120 

RHLHQDLHTACNDFYNEKKDYL SAAIEAVKVFEDKVQKQTGLHS IDGRELIE+AFGSK S 
Sbjct: 410 RHLHQDI^ACNDFYNEKKDYLSAA^ 469 

Query: 121 MLLLTNNKTQAEQNLEDGLEQLACGTWTGF^ IFNDKDALDLISLVS 180 

35 +LLLTNNKT+ AEQNLEDGLEQIJ^GTWTGFRNPV^ IFNDKDALDLI SLVS 

Sbjct: 470 II*LLTNNKTKAEQNLEIX3LEQIA^^ 529 

Query: 181 YLLRKVEQTKKRAKPTS 197 
YLLRKVEQTKKR+K S 
40 Sbjct: 530 YXjLRKVEQTKKRSKWS 546 



>gi 1 10955124 1 ref |NP_059780.1| ymh [Agrobacterium tumefaciens] 
gi|5738274|gb|AAB91582.2| (AF242881) ymh [Agrobacterium tumefaciens] 
Length = 266 

Score = 58.7 bits (141), Expect = 5e-08 

Identities = 40/127 (31%) , Positives 69/127 (53%) , Gaps = 5/127 (3%) 



Query: 61 RHLHQDLHTACNDFYNEKKDYLSAAIEAVKVFEDKVQKQTCLH 120 
50 R +H D+ C + +Y A +EAVK DK++++TGL + DG L+++AF 

Sbjct: 137 RGVHPDVLRFCREEL-LVDNYFHAVLEAVKST^ 194 

Query: 121 MLIJjTiraKTQAEQNLETC IFNDKDALDLISLVS 180 
ML + ++++E+ + G L GT++ FRN H R + S +DA DL S+ S 
55 Sbjct: 195 MLATNELQSESEKGEQRGFSNLVKGTFSMF^ KEDAEDLFSMFS 251 

Query: 181 YLLRKVE 187 

+ R+++ 
Sbjct: 252 LMHRRID 258 



As a homolog was found in serogroup A N.meningitidis but not in serogroup B, NGS50 protein and 
nucleic acid are useful for distinguishing between gonococcus and serogroup B N.meningitidis. 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 
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Example 51 



A DNA sequence was identified in ^gonorrhoeae <SEQ ID 101> which encodes amino acid sequence 

<SEQ ID 102; NGS5 1>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
5 Signal Score (-7.5): 0.14 

Possible cleavage site: 42 
»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 
calculated from 1 
10 ALQM: Finding transmembrane regions (Klein et al.) 

count: 0 value: 5.67 threshold: 0.0 
PERIPHERAL Likel ihood '= 5 . 67 
modified ALOM score: -1.63 
Rule: cytoplasmic protein 

15 

*** Reasoning Step: 2 



Final Results 



20 bacterial cytoplasm Certainty= 0 . 145 (Affirmative) < suco 

bacterial periplasmic space Certainty^ 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty^ 0.000 (Not Clear) < suco 

bacterial inner membrane Certainty= 0.000 (Not Clear) < suco 

25 The protein has homology with the following sequences in the databases: 

>pir||G81959 conserved hypothetical protein NMA0428 [imported] - Neisseria 
meningitidis (group A strain 22491) 
emb|CAB83727.1| (AL162753) conserved hypothetical protein [Neisseria 
meningitidis 
30 Z2491] 

Length = 548 



35 



55 



Score = 532 bits (1371), Expect = e-150 

Identities ■ 272/285 (95%), Positives = 280/285 (97%) 

Query: 1 MSEEKLKMSFEPTVTEHLGVKKYSHTV^ 60 

MSEEKLKMSFEPWIEHLGVKMYSHTvTAIAELIANA 
Sbjct: 1 MSEEKLKMSFEPTOIEHLGVKMYSHT^ 60 



40 Query: 61 DNGIGMSFDEINDFYLRIGRNRRBEKQASPCGRI PTGKKGLGKLiALFRLGNKIEI STIQG 120 

DNG IGMSFDE INDFYLRIGRNRREEKQASPCGRI PTGKKGLGKIiALF LGNKIEISTIQG 
Sbjct: 61 I^ICMSFDEINDFYLRIGRNRREEKQASPCGRIPTGKKGLGKI^ 120 

Query: 121 NERVTFTLDYAEIXKSERIYQPEFQiaSSVK 180 
45 NERVTFTLDYAEI++S+ IYQPEF+KESV+ N E+GTTITLTELTKKQGYPLDNYV HLS 

Sbjct: 121 NERVTFTLDYAEIRRSKGIYQPEFRKESVESNIESGTTITLTELTKKQGYPLDNYVEHL 180 

Query: 181 RLFDFPAQDFKIKVSLNGSEPRI IDGNLKYNLVT PQFEWEYQDLATNI S SLS SKFEQYEY 240 
RLFDFPAQDFKIKVSLNGSEP+IIIX3NLKY+LWPQFEWEYQDLATNI SS 
50 Sbjct: 181 RIiFDFPAQDFKIKVSLNGSEPKI IDGNIiKYDLVT PQFEWEYQDLATNISSLS SKFEQYEY 240 

Query: 241 SGLIQGKFITTEKPLKNNMKGITLFANGRMVNMPEFFTDSESSHF 285 

SGLIQGKFITTEKPLKNNMKGITLFANGRMVNMPEFFTDSESSHF 
Sbjct: 241 SGL IQGKFITTEKPLKNNMKGITLFANGRMVNMPEFFTDSESSHF 285 



>emb|CAC22276.l| (AJ302030) putative heat shock protein [Listeria monocytogenes] 
Length - 181 



Score = 70.2 bits (171), Expect = 2e-ll 
60 Identities = 57/173 (32%), Positives = 90/173 (51%), Gaps = 10/173 (5%) 

Query: 1 MSEEKLKMSFEPTVTIEHUSVKMYSHTVPAI^ 60 

HSB++ + +P ++E LG +Y++ + BLIANAYDA A V V E+K++++ 
Sbjct: 1 MSEKEYNLDIDPRILELLGPHLYTNIYYILGELIANAYDM^ 60 
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Query: 61 DNGIGMSFD — EINDFYLRIGRNRREEKQASPC GRIPTGKKGLGKLALFRLGNKIEI 115 

D+G GMS++ ++ +F L + + R S R G+KG+GKLA + + I 

Sbjct: 61 DTCSGMSYENKDVKOT-LSVAKESRTNAINSX^ 119 

5 

Query: 116 STIQGNEKVTFTLOTXBI-KKSERr^^ 167 

TI+ E+ F Ii I KK E I + +K +GT I +T T K 
Sbjct: 120 KTIKDGEKSGFVIjSRKVINKKLEAINEDTISFIKIK NHGTAIEMTNPTYK 169 

10 As a homolog was found in serogroup A N.meningitidis but not in serogroup B, NGS51 protein and 
nucleic acid are useful for distinguishing between gonococcus and serogroup B N.meningitidis. 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 52 

15 A DNA sequence was identified in N.gonorrhoeae <SEQ ID 103> which encodes amino acid sequence 

<SEQ ID 104; NGS52>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 

Signal Score (-7.5) : -3.5 1 
Possible cleavage site: 49 
20 »> Seems to have no N- terminal signal seq. 

Amino Acid Composition of Predicted Mature Form: 

calculated from 1 .: 
ALOM: Finding transmembrane regions (Klein et al.) 

count: 0 value: 7.64 threshold: 0.0 j 
25 PERIPHERAL Likelihood =7.64 

modified ALOM score: -2.03 
Rule: cytoplasmic protein 



30 
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*** Reasoning Step: 2 
.- Final Results — 



bacterial cytoplasm Certainty^ 0.213 (Affirmative) < suco 

bacterial periplasmic space Certainty* 0.000 (Not Clear) < suco 

35 bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

bacterial inner membrane Certainty* 0.000 {Not Clear) < suco 

The protein has homology with the following sequences in the databases: 

* 2.1.1.73) NMA0427 [imported] - Neisseria meningitidis (group A strain Z2491) 
40 gi|7379177)erob|CAB83726.1| (AL162753) modification me thy las e (cytosine-specific 

DNA methylase) 

[Neisseria meningitidis Z2491] 
Length = 351 

45 Score = 310 bits (794) , Expect 8e-84 

Identities = 152/154 (98%), Positives = 153/154 (98%) 

Query: 1 LGMENGFPKIMAGMQDETDFMHSCAGIjSD 60 
IjGMENGFPKI+AGHQDETDFMHSCAGLSDIN^ 
50 Sbjct: 198 LGMENGF PKI lAGHQDETDFMHSCAGIiSD INLKRLAL I PKNGGNRLAF AHI PEIjQIjECF I 257 

Query: 61 GKDNSFKDTFGRLWWDKPAPTITTKFFS I SNGRFAHPEEDRALSLREGATLQSFPRNYVF 120 

GKDNSFKOTFGRLWVTOKPAPTITTKFFSISNGRFAHPEEDRALSLR^ 
Sbjct: 258 GKDNSFKDTFGRLWWDKPAPTITTKFFSI SNGRFAHPEEDRALSUREGATLQSFPRNYVF 317 



Query: 121 KAGSRDKIARLIGNAVPPMYTEKIGRAIVDNIEC 154 

KAGSRDKIARLIGNAVPPMY EKIGRAIVDN1EC 
Sbjct: 318 KAGSRPKIARLIGNAVPPMYAEKIGRATVDNIEC 351 
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> g i|l2744l|splP2 52 65| M T D2 ^ HODXFXOtfXO* M^TKVI^B HGXDXX (C^OSXKB-SPBCXFXC 
MBTHYLTRANSFERASB 

Length = 354 

Query: 128 - IARLIGNAVPPMYTEKIGRAI 148 

++R IGNAVP I ++X 

Sbjct: 316 TVSRQIGNAVPVAIiGRVIAKSI 337 

As a homolog was found in serogroup A ttn^ngi** but not in serogroup B, NGS52 protein and 
nucleic acid are useful for distinguishing between gonococcus and serogroup B N.menmgituus. 
Based on this analysis, it was predicted that this protein from ^gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example S3 

A DNA sequence was identified in gonorrhoeae <SEQ ID 105> which encodes amino acid sequence 
<SEQID 106; NGS53>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence <von Heijne) 

Signal Score (-7.5): -7.56 

Possible cleavage site: 31 
»> Seems to have no N- terminal signal seq. 
Zinl Acid Composition of Predicted Mature Form: 

calculated from 1 . . . al , 

ALOM: Finding transmembrane regions (Klein et al.) 

count: 0 value: 5.36 threshold: 0.0 

PERIPHERAL Likelihood = 5.36 
modified ALOM score: -1.57 
Rule: cytoplasmic protein 

*** Reasoning Step: 2 

Final Results 

bacteria! cytoplasm - Certainty* MJg«»a»J < ™*> 

The protein has homology with the following sequences in the databases: 

>*i|11256915|pir| (181959 ^^L^^^^^^^^^ 

jm "-thylase^^^^ meningitidis „„„ 
Length * 351 
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Score = 247 bits (606), Expect » 5e-65 

Identities = 124/149 (83%), Positives = 127/149 (85%) 

5 Query: 1 I^PETLEKELGLKKNDDDLILI 60 

LQPETLEKEIASLKKNDDDLILIGCSPC^^ 
Sbjct: 59 LQPETLEKELGIiKKNDD^ 118 

Query: 61 YVVVENVPGILSRMKBSGLDNPIKIiLEEKGFTVHFGIHNTADYGI PQSRKRFTLIANRIT 120 
10 YWVENVPGILSRMKES LDNFIKLTjREKGFTVHFGIHl^ADYGIPQSRKRFTLIANRIT 

Sbjct: 119 YVVVEmnPGILSRMKESRIiD^IKLLEEK 178 

Query: 121 KKSWNQSSIRANGLRYAMFWEWKTAPPKL 149 
K+ L + FPK+ 

15 Sbjct: 179 KEKliEPVKYSGKRLTVRDVLGMENGFPKI 207 

>gi|l2744l|sp|P25265|MTD2JERAU MODIFICATION METHYLASE HGIDII (CYTOSINE-SPECIFIC 
METHYI/TRANSFERASE 

HGIDII) (M.HGIDII) 

20 gi|53866l|pir| |JT0594 site-specific DNA-methyltransferase (cyto sine- specific) 

(EC 

2.1.1.73) - Herpetosiphon aurantiacus 
gi| 48773 |emb|CAA38941.l| (X55141) methyl transferase [Herpetosiphon aurantiacus] 
Length = 354 

25 

Score = 71.9 bits (169), Expect = 4e-12 

Identities = 39/105 (37%), Positives « 57/105 (54%), Gaps = 1/105 (0%) 

Query: 12 LKKNDDDLIL IGC 5PCQYWSVI QTDKRKSEKSKSLLI iEFQRFVEYFNPGYVVVENVPGIL 71 
30 L N+ IL+GC+PCQ +S TK++ LLEFR+ P + +ENVP + . j 

Sbjct: 64 LYPIWQHKIIjVGCAPCQDFSQY-TKKSRTGTKWQLLTEFSRLIRE IEPDI I SMENVPEVR 122 J 

Query: 72 SHMKESGLDNPIKTiT .T^KGFTVHFGIHNTADYGIPQSRKRFTLIA 116 1 
+ + +NFI+ LE+ G+ V + + DYGIPQ R R LA 
35 Sbjct: 123 TFNRGEVFNNF IQ SLEQIjGYHVSHSWHCPDYG I PQQRDRLVLF A 167 

As a horaolog was found in serogroup A N.meningitutis but not in serogroup B, NGS53 protein and 
nucleic acid are useful for distinguishing between gonococcus and serogroup B N.meningiHdis. 

Based on this analysis, it was predicted that this protein from K gonorrhoeae, and its epitopes, could be 
40 useful antigens for vaccines or diagnostics. 

Example 54 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 107> which encodes amino acid sequence 

<SEQ ID 108; NGS54>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
45 Signal Score (-7.5): -6.82 

Possible cleavage site: 50 
»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 
calculated from 1 
50 ALOM: Finding transmembrane regions (Klein et al.) 

count: 0 value: 6.89 threshold: 0.0 
PERIPHERAL Likelihood = 6.89 
modified ALOM score: -1.88 
Rule: cytoplasmic protein 

55 

*** Reasoning Step: 2 
Final Results 



60 



bacterial cytoplasm 



Certainty= 0 .253 (Affirmative) < suco 



WO 02/079243 PCT/IB02/02069 

-81- 

bactexial periplasmic space Certainty* 0.000 (Not Clear) < suco 

bacterial outer membrane Certain ty= o. 000 (Not Clear) < suco 

bacterial inner membrane Certainty* 0.000 (Not Clear) < suco 

5 The protein has homology with the following sequences in the databases: 

gi 1 1074456 1 pir | |D64155 hypothetical protein HI0597 - Haemophilus influenzae 
(strain Rd KW20) 

gi|1573586|gb|AAC22254.1| (U32741) conserved hypothetical protein [Haemophilus 
influenzae Rd] 
10 Length = 272 

Score - 188 bits (459), Expect * 2e-47 
Identities ■ 95/100 (95%), Positives = 97/100 (97%) 

15 Query: 1 MNLPFRAMVSDLGGTLLTPEHLVGDLT IDTLRVLBQKGVDI ILATGRNHTDMS SILGKIG 60 

MNLPFRAMVSDL GTLLT PEHLVGDLT IDTLR IiEQKGVDIILATGRNHTD+SSILGKIG 
Sbjct: 1 MmjPFRAMVSDLIXSTLLTPEHLVGDLTIDTLRALEQKGVDI ILATGRNHTDVSSILGKIG 60 

Query: 61 AERAVMITSNGARVRDLQGJJLLYSNSLPEELVLELYKTSY 100 
20 AERAVMITSNGARVRDLQGNLLYSNSLPEELVLELYKT + 

Sbjct: 61 AERAVMITSNGARVRDLQGNLIjYSNSLPEELVIjELyKTPF 100 

A homolog was found in serogroup A ^meningitidis but not in serogroup B, so NGS54 protein and 
nucleic acid are useful for distinguishing between gonococcus and serogroup B N.memngitidis. 

25 Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 1 

Example 55 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 109> which encodes amino acid sequence 

<SEQ ID 110; NGS55>. Analysis of this protein sequence reveals the following: 

30 GvH: Examining signal sequence (von Heijne) 

Signal Score (-7.5): -4.46 
Possible cleavage site: 37 
»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 
35 calculated from 1 

ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 3.02 threshold: 0.0 
PERIPHERAL Likelihood = 3.02 
modified ALOM score: -1.10 
40 Rule: cytoplasmic protein 

*** Reasoning Step: 2 

Final Results 

45 

bacterial cytoplasm Certainty* 0.311 (Affirmative) < suco 

bacterial periplasmic space Certainty= 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

bacterial inner membrane Certainty* 0.0 00 (Not Clear) < suco 

50 

The protein has homology with the following sequences in the databases: 

aeruginosa (strain PAOl) 
gi| 9948791 |gb|AAG06104.l|AE004699_9 (AE004699) probable FMN oxidoreductase 
[Pseudomonas aeruginosa] 
55 Length = 411 

Score * 279 bits (686), Expect = 2e-74 

Identities * 157/375 (41%), Positives « 228/375 (59%), Gaps * 10/375 (2%) 
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Query: 1 MEEQLAQNDQ - P SEKLVRLYGAWAEGG AGVLVTGNVMVAESGKG S INDVLI SDDRALEML 59 

MEE +A Q PSE+L+RLY AWA+GGAG+L++GNVMV V++ DD LE 

Sbjct: 24 MEENMM)AAQAPSERLMRLYQAWADGGAGLIi I SGNVMVDSRAMTGPGGWLEDDAQIiEKF 83 

5 

Query: 60 KKWAKTaiTQNIOTiLIMQINHAG^ 117 

♦+WA+ +QINH G+Q A + + APSAVPL GM+ P+ + 

Sbjct: 84 ERWARIGRSAGAQFWLQINHPGRQMQANLG^ 143 

10 Query: 118 INGL IQQFVQTAKIAEQAGFSGVQIYAVHGYL I SQFLS PHHNRRQDQWGGSLENRMRFLL 177 

I +IQ+F ++A +AE+AGFSGV+I+A HGYL+SQFLSP NRR D WGGSLENR R LL 
Sbjct: 144 iD^EVTQRFARSAGLAERAGFSGVEIHAAHGYIiLSQ^ 203 

Query: 178 ETYTAIRAAAGKDFLVGVKLNSADFQKGGro 237 
15 E A+RA F V VKLNSADFQ+GGF ++ +W+ L +G+D +E+SGG+YB+P 

Sbjct: 204 EIVRAVRAEVAPGFAVAVKLNSMFQRGGFSAD 263 

Query: 238 QMLA- AKDS -TRKREAFF IDYAEKARAASQAPLI I TGGFRSQTAMEDALSSGHLDLVGIA 295 
M A+D T REA+F+++A RAA++ P+++TGG R + E L+SG +D+VGI 
20 Sbjct: 264 AMQGRARTCRTLAREAYFVEFARDIRAAARMPVMVTGGIRRRFV 322 

Query: 296 RPFALVPDLANKMQNRTYQTVQADRIQTGVAFVDKK^ 355 

A+ p+L + Q I + +K ++ K' Q+ + +G+ +N 

Sbjct: 323 TALAIEPNLPRDWRAGKDSAPQLRPI TWRNKFLASLANMAAVKFQLRKLSRGRATN 378 



25 



45 



Query: 356 PKIVGVES IAENFAG 370 

P++ + ++ AG 
Sbjct: 379 PKVSPLCALLAQQAG 393 



30 Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 1 

Example 56 

A DNA sequence was identified in ^gonorrhoeae <SEQ ID 11 1> which encodes amino acid sequence 

<SEQ ID 112; NGS56>. Analysis of this protein sequence reveals the following: 

35 GvH: Examining signal sequence (von Heijne) 

Signal Score (-7.5): -4.69 
Possible cleavage site: 54 
>» Seems to nave an uncleavable N-term signal seg 
Amino Acid Composition of Predicted Mature Form: 
40 calculated from 1 

ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 4.29 threshold: 0.0 
PERIPHERAL Likelihood =4.29 
modified ALGM score: -1.36 



*** Reasoning Step: 2 
Final Results — 



5Q bacterial inner membrane Certainty= 0 .042 (Affirmative) < suco 

bacterial periplasmic space Certainty= 0.000 (Not Clear) < suco 

bacterial outer membrane certainty* 0.000 (Not Clear) < suco 

bacterial cytoplasm Certainty* 0.000 (Not Clear) < suco 

55 The protein has homology with the following sequences in the databases: 

>gi [7444004 |pir | |D70029 transcription regulator ArsR family homolog yvhA - 
Bacillus 

subtilis 

gi|2635892|emb|CABl5384.1| (Z99121) similar to transcriptional regulator (ArsR 
60 family) 

[Bacillus subtilis] 
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Length =90 
Score = 51.3 bits (118), Expect » 3e-06 

Identities = 24/65 (36%), Positives = 42/65 (63%), Gaps * 1/65 (1%) 

5 

Query: 15 IFTVLSDENRHQILHVLWKHGRMNVl^LTEHLHL 74 

+P +SD R +IL +L K G M ++ EH ++S+P++SHHL 1+ QA ++ + G+ 
Sbjct: 4 VFKAISDPTRKKILDLL-KGGDMTAGD 62 

10 Query: 75 RFYSI 79 

+YS+ 

Sbjct: 63 IYYSL 67 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
15 useful antigens for vaccines or diagnostics. 



Example 57 

A DNA sequence was identified in ^gonorrhoeae <SEQ ID 113> which encodes amino acid sequence 

<SEQ ID 1 14; NGS57>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
20 Signal Score (-7.5): -2.57 

Possible cleavage site: 55 
»> Seems to have no N-terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 
calculated from 1 

25 ALOM: Finding transmembrane regions (Klein et al.) j 

count: 0 value: 5.67 threshold: 0.0 
PERIPHERAL Likelihood =5.67 
modified ALOM score: -1.63 
Rule: cytoplasmic protein 



30 



55 



*** Reasoning Step: 2 
Final Results — 



35 bacterial cytoplasm Certainty* 0 .160 (Affirmative) < suco 

bacterial periplasmic space Certainty* 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty* 0.000 (Not Clear) < succ> 

bacterial inner membrane Certainty* 0.000 (Not Clear) < suco 

40 The protein has homology with the following sequences in the databases: 

>gi| 10444407 |gb|AAGl7897.l|AF297971_l (AF297971) restriction endonuclease 
R.NgoMIII [Neisseria gonorrhoeae] 
Length =213 

45 Score * 319 bits (818) , Expect = le-86 

Identities * 156/156 (100%), Positives = 156/156 (100%) 

Query: 1 LYKQYADWNRLSYNAPIYVGKAVYKG^ 60 
LYKQ YADWNRLSYNAP I YVGKAVPKGWRQARNSDNALNQSTELFHRLKEHSRS IAAVSDL 
50 Sbjct: 58 LYKQYADWNRLSYNAPIW^ 117 

Query: 61 DPSDFMCRFVTFEGAGSDMIGTIEAALIKLHKPLW 120 

DPSDFMCRFVIFEGAGSI^IGTIEAALIKLHKPLWNSCVIXSFGNHDPGKGRYEQAKS 
Sbjct: 118 DPSDFMCRFVIFEGAG SDMIGT I EAAL IKLHKPLWNSCVIX3FGNHDPGKGRYEQAKSDWD 177 



Query: 121 VLHSGRVWADRLNGIPNSYESILENINTHLEIIKRK 156 

VLHSGKVWADRLNGI PNSYES ILENINTHLEI IKRK 
Sbjct: 178 VLHSGRVWADRLNGI PNSYES ILENINTHLE I IKRK 213 
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Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 58 

A DNA sequence was identified in ^gonorrhoeae <SEQ ID 115> which encodes amino acid sequence 
<SEQ ID 1 16; NGS58>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 

Signal Score (-7.5): -1.92 

Possible cleavage site: 16 
>» Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 

count: 0 value: 5.41 threshold: 0.0 

PERIPHERAL Likelihood = 5.41 
modified ALOM score: -1.58 
Rule: cytoplasmic protein 

*** Reasoning Step: 2 

Final Results 



bacterial cytoplasm Certainty* 0 . 107 (Affirmative) < suco 

bacterial periplasmic space Certainty= 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty= 0.000 (Not Clear) < suco 

bacterial inner membrane Certainty* 0.000 (Not Clear) < suco j 

Hie protein has homology with the following sequences in the databases: 

> g i|l0444408lgb|AAGl7898.l|AF297971_2 (AF297971) DNA cytosine methyl transferase 
M.NgoMIII [Neisseria gonorrhoeae] 
Length ■ 377 

Score = 759 bits (1960), Expect = 0-0 

Identities = 377/377 (100%) , Positives = 377/377 (100%) 

Query- 1 MKSLEIFSGAGGLAKGLE1AGFQHASFIELNK 60 

MKSLEIFSGAGGLAKGLELAGFQHASFI 
Sbjct: 1 MKSI^IFSGAGGLAKGLELAGFQHASFIELNKDAC^S 60 

Query- 61 EGIEV1AGGPPCQPF SLGGKHLAHEDRRDMFPHAWYVEYYR 120 

EGIEVXAGGPPCQPFSLGGKHLAHEDRRDMFPHAW 
Sbjct: 61 EGIEVXAGGPPCQPFSLGGKHLAHEDRRDMFPHAVRYV^^ 120 

Query: 121 DYFEYHjLRLTYPNLGILQNEDWKGHLTRLKE IEFNLYKG IKYKVS YQLLNAADYGVPQK 180 
* ~ DYFEYILLRLTYPNX^ILQl^DVmGHLTRLKEI 

Sbjct: 121 DYFEYILLRLTYPNIXSILQNEDVTCGH^^ 180 

Query: 181 RERWIVGIRADIJ3IDWKFPKRTHSEDRIiNWEKYVTC 240 

RERVVIVGIRADIjDIDWKFPKRTHSEDRLNWEKV^ 
Sbjct: 181 RERvVIVTSIRADLDIDWKFPKRTHSED 240 

Query: 241 YGIFEPEKKPWQTVROTLSDIPHPIXjNHKITGHEYRDGARI^ 300 

YGIFEPEKKPWQTVRDTLSDIPHPLGNHKITGHEYRDGARIYre 
Sbjct: 241 YGIFEPEKKFWQTVRDTLSDIPHPI^NHKITGHEYR 300 

Query: 301 GHGVPGGENMIRYDIX3TVRYFTSYEAKLIjQTFPEEFVT SGAWGEAMRQIGNAVPVKLSEI 360 

GHGVPGGENMIRYDIXjTVRYFTSYEAKIjLQTFPEEFVI SGAWGEAMRQIGNAVPVKLSEI 
Sbjct: 301 GHGWGGENMIRYDDGTVRYFTSYEAKIJ 360 

Query: 361 LGKHLMGVLSEKSSLHN 377 

IjGKHLMGVLSEKSSLHN 
Sbjct: 361 LGKHLMGVLSEKSSLHN 377 
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Based on this analysis, it was predicted that this protein from ^gonorrhoeae* and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 59 

A DNA sequence was identified in gonorrhoeae <SEQ ID 117> which encodes amino acid sequence 
<SEQ ID 118; NGS59>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 

Signal Score (-7.5): -3 . 82 

Possible cleavage site: 60 
»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALGM: Finding transmembrane regions (Klein et al.) 

count: 0 value: 2.86 threshold: 0.0 

PERIPHERAL Likelihood « 2.86 
modified ALGM score: -1.07 
Rule: cytoplasmic protein 

*** Reasoning Step: 2 

Final Results 

bacterial cytoplasm Certainty* 0.197 (Affirmative) < suco 

bacterial periplasmic space Certainty= 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty* 0.000 (Not Clear) < suco I 

bacterial inner membrane Certainty* 0.000 (Not Clear) < suco 

The protein has homology with the following sequences in the databases: 

>gi (11353338 |pir | |F81882 hypothetical protein NMA1155 [imported] - Neisseria 
meningitidis 

(group A strain Z2491) 
gi|7379848jemb|CAB84417.lj (AL162755) hypothetical protein NMA1155 [Neisseria 
meningitidis 

Z2491] 

Length =120 

Score b 131 bits (329) , Expect = 2e-30 
Identities = 64/68 (94%) , Positives = 67/68 (98%) 

Query: 1 LSDI SASRAAYMOTQKQYPFEWAVCVLPNHIHAIWTLPP SO 

+S+ISASRAAYMOTQKQYPFETVAVCVLPNHIHAI\^ 
Sbjct: 1 MSNISASRAAYMDVQKQYPFETVAVC^ 60 

Query: 61 PHTKNLGA 68 

P+TKNL A 
Sbjct: 61 PYTKNLSA 68 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 60 

A DNA sequence was identified in ^.gonorrhoeae <SEQ ID 119> which encodes amino acid sequence 

<SEQ ID 120; NGS6Q>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7.5): -3.14 
Possible cleavage site: 16 
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>» Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
5 count: 0 value: 2.76 threshold: 0.0 

PERIPHERAL Likelihood « 2.76 
modified ALOM score: -1.05 
Rule: cytoplasmic protein 

10 *** Reasoning Step: 2 

Final Results - 

bacterial cytoplasm Certainty* 0.3 30 (Affirmative) < suco 

15 bacterial periplasmic space Certainty* 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

bacterial inner membrane Certainty* 0.000 (Not Clear) < suco 

The protein has homology with the following sequences in the databases: 

20 >gi 1 11281269 |pir | |D81804 hypothetical protein NMA1789 [imported] - Neisseria 

meningitidis (group A strain Z2491) 

gi|73B0430|emb|CAB85016.l| (AL162757) hypothetical protein [Neisseria 
meningitidis Z2491] 

Length =243 i 

25 

Score = 154 bits (389), Expect * 5e-37 
Identities ■■ 82/85 (96%), Positives = 82/85 (96%) 

Query: 12 MOTKTELQKLLEEDISTLKETLIRVDALPPR^ 71 \ 

30 ~" ■ - MNTKTELrQKLLEEDISTL ETLI DALPPRYVRS I ATP IVRRWL IDKQLNILAKEIGLT 

Sbjct: 1 MNTKTELQKLLEEDI STLTETLICADALPPRYVRSIATP rVRRVJLXDKQLNItiAKEIGLT 60 I 

Query: 72 IELPILOTSLVFEKLSTLENKVNFY 96 
IELPILDTSLVFEKLSTLENKVNFY 
35 Sbjct: 61 lEIiPILDTSIiVFEjabSTLENKVNFY 85 

As a homolog was found in serogroup A N. meningitidis but not in serogroup B, NGS60 protein and 
nucleic acid are useful for distinguishing between gonococcus and serogroup B N.meningitidis. 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
40 useful antigens for vaccines or diagnostics. 

Example 61 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 121> which encodes amino acid sequence 

<SEQ ID 122; NGS61>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7.5): -6.88 
Possible cleavage site: 32 
»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 5.52 threshold: 0.0 
PERIPHERAL Likelihood =5.52 
modified ALOM score: -1.60 
Rule: cytoplasmic protein 

*** Reasoning Step: 2 



45 
50 
55 



Final Results 
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bacterial cytoplasm Certainty= 0 . 300 (Affirmative) < suco 

bacterial periplasmic space Certainty* 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty= 0.000 (Not Clear) < suco 

bacterial inner membrane Certainty= 0.000 (Not Clear) < suco 

5 

The protein has homology with the following sequences in the databases: 

>gi 1 11281269 |pir | |D81804 hypothetical protein NMA1789 [imported] - Neisseria 
meningitidis (group A strain Z2491) 
gi|7380430|emb|CAB85016.1| (AL162757) hypothetical protein [Neisseria 
10 meningitidis Z2491] 

Length = 243 



15 



45 



Score « 193 bits (491), Expect ■ 5e-49 

Identities = 96/101 (95%), Positives » 97/101 (95%) 

Query: 1 MAGGVYLGGKI ISPIYHS SQEFSGEPIIYABTNI ILCPAEKFLTLKRVFHNGNIFNMNQI 60 

MAGGVYLGG+ IS IYHSSQEFSGEPIIYAE NIIUTPAEKFLTLKRVFHNGNIFNMNQI 
Sbjct: 86 MAGGVYIiGGEFISSIYHSSQEFSGEPIIYAEPNIILCPAEKFLTL^ 145 



20 Query: 61 I TFL SITCQGGVRFDKNYDKYKTW^ 101 

ITFLSNKQGGV FDKNYDKYKTWQVAIEKAANFLKLGNPYN 
Sbjct: 146 ITFLSNKQGGVHFDKNYDKVKTWQVAIEKAAOT 186) 

As a homolog (amino acids 1-96) was found in serogroup A N.meningitidis but not in serogroup B, 
25 NGS61 protein and nucleic acid are useful for distinguishing between gonococcus and serogroup B 
N.meningitidis. 

i 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 62 

30 A DNA sequence was identified in N. gonorrhoeae <SEQ ID 123> which encodes amino acid sequence 

<SEQ ID 124; NGS62>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7,5): -2.43 
Possible cleavage site: 44 
35 »> Seems to have no N- terminal signal seq. 

Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 4.72 threshold: 0.0 
40 PERIPHERAL Likelihood =4.72 

modified ALOM score: -1.44 
Rule: cytoplasmic protein 



*** Reasoning Step: 2 
Final Results — 



bacterial cytoplasm Certainty* 0 .324 (Affirmative) < suco 

bacterial periplasmic space Certainty* 0.000 (Not Clear) < suco 

50 bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

bacterial inner membrane Certainty* 0.000 (Not Clear) < suco 

A homolog was found in serogroup A N.meningitidis but not in serogroup B, so NGS62 protein and 
nucleic acid are useful for distinguishing between gonococcus and serogroup B N.meningitidis. 
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Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 63 

A DNA sequence was identified in ^gonorrhoeae <SEQ ID 125> which encodes amino acid sequence 

5 <SEQ ID 126; NGS63>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7.5): 0.74 
Possible cleavage site: 24 
»> Seems to have a cleavable N-term signal seq. 
10 Amino Acid Composition of Predicted Mature Form: 

calculated from 25 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 10.72 threshold: 0.0 
PERIPHERAL Likelihood « 10.72 
15 modified ALOM score: -2.64 . 

Score for CM-PP discrimination: -22.14 
Rule: outer membrane or periplasmic protein 
Score for OM-PP discrimination: -22.14 
Rule: outer membrane or periplasmic protein 



20 



25 



40 



*** Reasoning Step: 2 

Periplasmic space? Score : 2 . 21378 
Periplasmic space? Score: 2.21378 



Final Results 



bacterial periplasmic space Certainty* 0. 931 (Affirmative) < suco 

bacterial outer membrane Certainty* 0.237 (Affirmative) < suco 

30 bacterial inner membrane Certainty* 0.000 (Not Clear) < suco 

bacterial cytoplasm Certainty* 0.000 (Not Clear) < suco 

The protein has homology with the following sequences in the databases: 

>gi 1 11284146 |pir | |B81142 hypothetical protein NMB0912 [imported] - Neisseria 

35 meningitidis 

(group B strain MD58) 
gi|7226150|gb|AAF41320.1] (AE002443) hypothetical protein [Neisseria 
meningitidis MC583 

Length = 208 



Score = 51.7 bits (119), Expect * 3e-06 
Identities * 30/72 (41%), Positives * 40/72 (54%) 



Query: 5 LLKNWKPLLILSAIAFFAVSWQLDRAAQYRR 64 
45 llk WKP+ +L I +W DRA +YR G AA +E+S RLK +E A+ AR + 

Sbjct: 43 LLKYVTCWGVLLLrVLI^ 102 

Query: 65 AYQAQXAAREEK 76 
A A R+ K 
50 Sbjct: 103 KAAAAFAERQTK 114 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 64 

55 A DNA sequence was identified in N.gonorrlweae <SEQ ID 127> which encodes amino acid sequence 
<SEQ ID 128; NGS64>. Analysis of this protein sequence reveals the following: 
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McG: Examining signal sequence (McGeoch) 
Length of UR: 0 
Peak Value of OR: 2.99 
Net Charge of CR: 4 
5 Discriminant Score: 5.35 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7.5): -2.53 
Possible cleavage site: 33 
>» Seems to have an uncleavable N-term signal seq 
10 Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 3.23 threshold: 0.0 
PERIPHERAL Likelihood = 3.23 
15 modified ALOM score: -1.15 

*** Reasoning Step: 2 

Final Results 

20 

.bacterial inner membrane Certainty^ 0 .054 (Affirmative) < suco 

bacterial periplasmic space Certainty* 0.000 (Not Clear) < suco 

bacterial outer membrane Certain ty= 0.000 (Not Clear) < suco 

bacterial cytoplasm Certainty* 0.000 (Not Clear) < suco 

25 

The protein has no homology with sequences in the databases. 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae> and its epitopes, could be 
useful antigens for vaccines or diagnostics. j 

Example 65 

30 A DNA sequence was identified in N.gonorrhoeae <SEQ ID 129> which encodes amino acid sequence 

<SEQ ID 130; NGS65>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7.5): -5.61 
Possible cleavage site: 61 
35 »> Seems to have no N-terminal signal seg. 

Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 3.34 threshold: 0.0 
40 PERIPHERAL Likelihood = 3.34 

modified ALOM score: -1.17 
Rule: cytoplasmic protein 



45 



*** Reasoning Step: 2 
Final Results — 



bacterial cytoplasm Certainty* 0.236 (Affirmative) < suco 

bacterial periplasmic space Certainty* 0.000 (Not Clear) < suco 

50 bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

bacterial inner membrane Certainty* 0.000 (Not Clear) < suco 

The protein has homology with the following sequences in the databases: 

>gi|U7579l|sp|P44189|YEl8 - HAEIN HYPOTHETICAL PROTEIN HI1418 
55 gi j 1074769 |pir | |A64029 hypothetical protein- HI1418 - Haemophilus influenzae 

(strain Rd 

KW20) 

gi|1574254|gb|AAC23068.1| (U32821) H. influenzae predicted coding region HI1418 
[Haemophilus 
60 influenzae Rd] 
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Length = 201 
Score = 104 bits (251), Expect = le-21 

Identities = 58/119 (48%) , Positives = 72/119 (59%), Gaps = 4/119 (3%) 

5 

Query: 51 LKMQOTISVFSFKSQNVRTQI-IiGAEPWFCLGDVAEILQIQNAR QLPLKDQGIQKSS 106 

+K Q S F+FK VR + B WFC DV Hi N+R Q K G+ K 
Sbjct: 14 MKNQIQFSTFNFKDLPVRVILDPKGEFWFCGTDVCHIIiGYTO 73 

10 Query: 107 VATKKGNQELLF INEPNLYRVIFRSRKAEAVKFQDWIFEEVX PQIRKTGGYQ ITPKTTA 165 

TK +QE+ FXNEPNLYR+I +SRK EA F+ W+FEEV+PQIRKTG YQ+ P+ A 
Sbjct: 74 TPTKSADQEMTFINEPNIiYRM^ 132 

>gi| 11281012 |pir | | A81144 hypothetical protein NMB0900. [imported] - Neisseria 
15 meningitidis 

(group B strain MD58) 
gi| 7226137 1 gb| AAF41308.1] . (AB002442) hypothetical protein [Neisseria 
meningitidis MC58J 

Length = 305 



20 



35 



Score = 104 bits (249), Expect = 2e-21 

Identities * 73/137 (53%), Positives = 93/137 (67%), Gaps =2/137 (1%) 



Query: 190 YSMIHQRFNVEAVEGI PADKLPEAVAYVHALTLHTC ISGN 248 

25 +S + +F B +PA++ PE ++ «► + + G L GEY DREPLPAPQPALPISGN 

Sbjct: 164 WSAVKSKFGC SYKE- VPAEQFPEVLSVMGRVAVENGVLYGEVLDREPLPAPQPALPI SGN 222 

Query: 249 AliADIAAMVTY^GTRMIELGKDVSAPLKQl^^ 308 
AL D+A V YG I++G+DVS PLKQLGCKQAVTMWTVW ETRS LK + ALE L 
30 Sbjct: 223 ALYDLAVAVRYGAWAIQMGRDVSLPIiKQLGCKQAVTMWW 282 



Query: 309 YADKDASGRIAACLEGI 325 

+AD + + +1 L I I 
Sbjct: 283 HADAEHAAKIRPMLPEI 299 

>gi 1 7460273 |pir | (T13267 hypothetical protein - Lactococcus lactis phage BK5-T 
gi| 928839 |gb|AAA985?0.1| (L44593) ORF266? putative [Lactococcus phage BK5-T] 
Length = 266 



40 Score = 75.9 bits (179), Expect = 6e-13 

Identities = 42/111 (37%), Positives = 63/111 (55%), Gaps =3/111 (2%) 

Query: 55 NTISVFSFKSQNVRTQII^AEPWFCI^DVAEILQIQNAR QLPLKDQGIQRS SVATKK 111 

N + F+F + VRT ++ EPWP DVA + +N R + +KD+ ++S + T 
45 Sbjct: 2 NELQNFNFNmiPVRTVLINDEPWFVGK^ 61 

Query: 112 GNQELLFINEPNLYRVIFRSRKAEAVKFQDWIFEEvT 162 

G Q + I+EP LY++ S+ A FQDW++EEV+P IRK G Y K 
Sbjct: 62 GVQSVTVJSEPGLYQIaAGESKLPSAEPFQDWVYEEVLPTIRKHGAYMTDAK 112 

50 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae* and its epitopes, could be 
useful antigens for vaccines or diagnostics* 

Example 66 

A DNA sequence was identified in N. gonorrhoeae <SEQ ID 13 1> which encodes amino acid sequence 

55 <SEQ ID 132; NGS66>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7.5): -3.12 
Possible cleavage' site: 53 
»> Seems to have no N~ terminal signal seq. 
60 Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALGM: Finding transmembrane regions (Klein et al.) 
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count: 0 value: 8.96 threshold: 0.0 
PERIPHERAL Likelihood = 8.96 
modified ALCM score: -2.29 
Rule: cytoplasmic protein 
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*** 



Reasoning Step: 2 
- Final Results — 



bacterial cytoplasm Certainty* 0.402 (Affirmative) < suco 

bacterial periplasmic space Certainty* 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

bacterial inner membrane Certainty* 0.000 (Not Clear) < suco 

The protein has no homology with sequences in the databases. 

Based on this analysis, it was predicted that this protein from ^gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 67 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 133> which encodes amino acid sequence 
<SEQ ID 134; NGS67>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 

Signal Score (-7.5): 5.71 

Possible cleavage site: 22 
»> Seems to have a cleavable N-term signal seg. i 
Amino Acid Composition of Predicted Mature Form: 

calculated from 23 
ALOM: Finding transmembrane regions (Klein et al.) 

count: 0 value: 2.81 threshold: 0.0 

PERIPHERAL Likelihood = 2.81 
modified ALOM score: -1.06 
Score for OM-PP discrimination: -32.34 
Rule: outer membrane or periplasmic protein 
Score for OM-PP discrimination: -32.34 
Rule: outer membrane or periplasmic protein 

*** Reasoning Step: 2 

Periplasmic space? Score: 3.23391 
Periplasmic space? Score: 3.23391 



Final Results 



bacterial periplasmic space Certainty* 0.928 (Affirmative) < suco 

bacterial outer membrane Certainty* 0.199 (Affirmative) < suco 

bacterial inner membrane Certainty* 0.000 (Not Clear) < suco 

bacterial cytoplasm Certainty* 0.000 (Not Clear) < suco 

The protein has homology with the following sequences in the databases: 

|H69834 hypothetical protein yhjQ - Bacillus subtilis 
CAA74479.1 (Y14081) hypothetical protein [Bacillus subtilis] 
CAB12900.1| (Z99109) yhjQ [Bacillus subtilis] 



>gi 
gi 
gi 



7475078 
2226189 
2633396 



pir 
emb 



emb 

Length = 108 



Score * 32.9 bits (74), Expect =2.1 

Identities * 27/98 (27%), Positives = 44/98 (44%), Gaps = 4/98 (4%) 

Query: 54 CLDAGQVCLTHCL SLLTQGDTSMSIXIAVAVRQMLAIiCGAVHDtiAAQNS PLTRDAAKVCLE 113 

C+ A C T d, Q +S C R+ +C +SP ++ +C + 

Sbjct: 15 CMKAC^CFTKCLEESVQ~h^ 72 



40 



55 
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Query: 114 ACKQCAKACKEHSAHHAECKACYESCLDC IKECEKLAA 151 

C+ C C +H H C+AC ++C C ++C +AA 
Sbjct: 73 ICEACGTECGKHD — HDHCQACAKACFTCAEQCRSMAA 108 



>gi|7479923|pir 
gi (4582392 | emb 
coelicolor A3 (2 



|T36241 hypothetical protein SCE39.31c - Streptbmyces coelicolor 
CAB40339.1) (AL049573) hypothetical protein [ Strep tomyces 
] 



Length =136 

10 Score = 30.9 bits (69), Expect = 7.7 

Identities * 27/102 (26%), Positives = 43/102 (41%), Gaps = 6/102 (5%) 

Query: 54 CLDAGQVCLTHCLSLLTQGOTSM^^ -NSPLTRDAAK 109 

C A C CLS TD++C +C A . + ++ ++ +TR + 

15 Sbjct: 34 C^ACTACADACLSBPTVAD--IiTKCIRTDM1X^ 91 

Query: 110 VCLEACKQCAKACKEHS AHHAECKACYESCLDC IKECEKLAA 151 

C C C C . H+ H C+ C E+C C + C++L A 
Sbjct: 92 ACATVCAACGDECARHAGMHEHCRVCAEACRSCEQACQEI^ 133 

20 

The protein was expressed in Exoli as a soluble 14. 19kDa His-fusion product and then purified. 

Based on this analysis, it was predicted that this protein from N.gonorrltoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 68 

25 A DNA sequence was identified in N.gonorrhoeae <SEQ ID 135> which encodes amino acid Sequence 

<SEQ ID 136; NGS68>. Analysis of this protein sequence reveals the following: j 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7.5): -4.05 
Possible cleavage site: 38 
30 »> Seems to have no N- terminal signal seq. 

Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et.al.) 
count: 0 value: 5.25 threshold: 0.0 
35 PERIPHERAL Likelihood * 5.25 

modified ALOM score: -1.55 
Rule: cytoplasmic protein 



*** Reasoning Step: 2 
— Final Results 



bacterial cytoplasm Certainty* 0.220 (Affirmative) < suco 

bacterial periplasmic space Certainty* 0.000 (Not Clear) < suco 

45 bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

bacterial inner membrane Certainty* 0.000 (Not Clear) < suco 

The protein has homology with the following sequences in the databases: 

>gi 1 11353493 |pir | |A81795 hypothetical protein NMA2214 [imported] - Neisseria 
50 meningitidis 

(group A strain Z2491) 
gi 1 7380833 )emb|CAB85425.l| (AL162758) hypothetical protein [Neisseria 
meningitidis 22491] 

Length = 208 



Score = 263 bits (673), Expect = 3e-69 

Identities * 140/145 (96%), Positives * 143/145 (98%) 



60 



Query: 1 LDWRGNKPLGAAELADLKPLYKDFM^ 60 
LIW+GNKPLGAAELADLKPLYKDFMW GEDTQR 



WO 02/079243 PCT/IB02/02069 

-93- 

Sbjct: 44 LDWQGNKPLGAAEL/UDIjXPL 103 

Query: 61 YASFWGDGYDVYRQLRWRQ I PEKQRKAFKKAAKSKNTVNFAGREYGI SKQNLSDVWDDFE 120 

YASFWGDGYDVYRQLRW+Q I PBKQRKAFKKAAKSK TVMFAGRKYGI SKQNLSDVWDDFE 
Sbjct: 104 YASFWGDGYDVYRQLRWQQI PEKQRKAFKKAAKSKKTVMFAGREYGISKQNLSDVWDDFE 163 

Query: 121 DAMELKAFPCL SSLFLTKWHKNLYE 145 

DAMELKAFPCLSSLFLTKWHKNLY+ 
Sbjct: 164 DAMELKAFPCL SSLFLTKWHKNLYD 188 

>gi | 11280955 |pir | [B81219 hypothetical protein NMB0273 [imported] - Neisseria 
meningitidis 

(group B strain MD58) 
gi|7225497|gb|AAF40727.l| (AE002383) hypothetical protein [Neisseria 
meningitidis MC58] 

Length = 141 

Score = 216 bits (550) , Expect = 5e-55 

Identities « 117/121 (96%), Positives = 119/121 (97%) 

Query: 25 MYWERGLHMYKASAVVPTGYVRVGNTAPLC^ 84 

MYWERGLHMYKASAVVPTGYVRVGNTAPL GEDTQRYASFWGDGYDVYRQLRW+QIPEKQ 
Sbjct: 1 MYWERGLHMYKASAVVPTGYVKVGOTAPLVGEDTQK 60 

Query: 85 RKAFKKAAKSKNTVMFAGREYGISKQNLSDVWDDFEDA^ 144 

RKAFKKAAKSK TVMFAGREYGISKQl^SDVWDDFEDAMELKAFPCLSSLFLTKWHKNLY 
Sbjct: 61 RKAFKKAAKSKKTVMF AGRKYGI SKQI^SDVVTOD 120 

Query: 145 E 145 
+ 

Sbjct: 121 D 121 i 

Based on this analysis, it was predicted that this protein from N.gonorrlioeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 69 

A DNA sequence was identified in gonorrhoeae <SEQ ID 137> which encodes amino acid sequence 
<SEQ ID 138; NGS69>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 

Signal Score (-7.5): -5.63 

Possible cleavage site: 43 
»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 

count: 0 value: 6.84 threshold: 0.0 

PERIPHERAL Likelihood =6.84 
modified ALOM score: -1.87 
Rule: cytoplasmic protein 

*** Reasoning Step: 2 

Final Results 

bacterial cytoplasm Certainty= 0 .361 (Affirmative) < suco 

bacterial periplastic space Certainty= 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

bacterial inner membrane Certainty= 0.000 (Not Clear) < suco 

The protein has homology with the following sequences in the databases: 

>gi|7464552|pir| (C64707 hypothetical protein HP1499 - Helicobacter pylori (strain 
26695) 
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gi 1 2314683 | gb | AAD08545.1) (AE000648) H. pylori predicted coding region HP1499 
[Helicobacter pylori 26695] 
Length » 272 

5 Score = 38.2 bits (88), Expect = 0.13 

Identities = 34/123 (27%), Positives = 58/123 (46%), Gaps = 10/123 (8%) 

Query: 3 EFKFIFGQDFGLSKKEAIRKVLKWLPSHLKFTLMV^ FHPKAVFWKNDKN 56 

EF+ I 6 DF + ++IR +Ii ++ K + FHPK +NK 

10 Sbjct: 54 EFEIIVGLDFKTTDSKSIRFLIiDIiNKTYKKLRFYCY^ 112 

Query: 57 EYYALIGSSNLTHAAFNSNYEAN- ILTKISEQDFIKVKSWADEI — AMKS IPVSEDWLEE 113 

E ++IGS+NLT +N+E NIT+ +++ + +1 A +E++L+ 

Sbjct: 113 EKTSIIGSTmjTKGGLENNFEVNTIFTE^ 172 

15 

Query: 114 YQB 116 
YE 

Sbjct: 173 YNE 175 

20 A homolog was found in serogroup A N.meningitidis but not in serogroup B, so NGS69 protein and 
nucleic acid are useful for distinguishing between gonococcus and serogroup B N.meningitidis. 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 70 

.1 

25 A DNA sequence was identified in N.gonorrhoeae <SEQ ID 139> which encodes amino acid ^sequence 

<SEQ ID 140; NGS70>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7.5): -1.18 
Possible cleavage site: 22 
30 »> Seems to have a cleavable N-term signal seq. 

Amino Acid Composition of Predicted Mature Form: 

calculated from 23 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 4.19 threshold: 0.0 
35 PERIPHERAL Likelihood = 4.19 

modified ALQM score: -1.34 
Score for OM-PP discrimination: -5.89 
Rule: outer membrane or periplasmic protein 
Score for OM-PP discrimination: -5.89 
40 Rule: outer membrane or periplasmic protein 

*** Reasoning Step: 2 

Periplasmic space? Score: 0.588927 
45 Periplasmic space? Score: 0.588927 

r Final Results 

bacterial periplasmic space — - Certainty^ 0 .849 (Affirmative) < suco 

50 bacterial outer membrane Certainty^ 0 .106 (Affirmative) < suco 

bacterial inner membrane Certainty= 0.000 (Not Clear) < suco 

bacterial cytoplasm Certainty* 0.000 (Not Clear) < suco 

The protein has homology with the following sequences in the databases: 

55 >gi 1 11353344 |pir | |A81886 hypothetical protein NMA1183 [imported] - Neisseria 

meningitidis 

(group A strain Z2491) 
gi|7379875|emb|CAB84445.l| (AL162755) hypothetical protein NMA1183 (Neisseria 
meningitidis 
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Z24911 
Length = 74 

Score =58.7 bits (141), Expect = 2e-08 
Identities * 30/43 (69%) , Positives = 32/43 (73%) 

Ouerv 62 IGGFGGVGGFGGLKPALVYRNFRI I ATNRPAATRARPRQTTVA 104 

+G G+ G GGLKP LVY N IIATNRPAATRA PR TTVA 
Sbjct: 32 IMHIWIMSOGI^^ 74 

Based on this analysis, it was predicted that this protein from N.gonorrltoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 71 

A DNA sequence was identified in ^gonorrhoeae <SEQ ID 141> which encodes amino acid sequence 

5 <SEQID 142;NGS71>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7.5): -1.98 
Possible cleavage site: 28 
»> seems to have a cleavable N-term signal seq. 
5 Amino Acid Composition of Predicted Mature Form: 

calculated from 29 . 
ALOM* Finding transmembrane regions (Klein et ax.j 
count: 0 value: 5.04 threshold: 0.0 

PERIPHERAL Likelihood =5.04 j 
25 modified ALOM score: -1.51 

Score for OM-PP discrimination: -9.17 
Rule: outer membrane or periplasmic protein 
Score for OM-PP discrimination: -9.17 
Rule: outer membrane or periplasmic protein 



30 



35 



40 



*** Reasoning Step: 2 

Periplasmic space? Score: 0.916744 
Periplasmic space? Score: 0.916744 



Final Results 

bacterial periplasmic space Certainty= 0.923 (Affirmative) < suco 

Ser^ ™ Certainty= 0.146 (Affirmative) < suco 

Scterial inner membrane — Cert:ainty= 0.000(Not Clear < suco 
bacterial cytoplasm — Certainty= 0.000 (Not Clear) < suco 



The protein has no homology with sequences in the databases. 

Based on this analysis, it was predicted that this protein from Kgonorrhoeae, and its epitopes, could be 
45 useful antigens for vaccines or diagnostics. 

Example 72 

A DNA sequence was identified in ^gonorrhoeae <SEQ ID 143> which encodes amino acid sequence 

<SEQ ID 144; NGS72>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
50 Signal Score (-7.5): -3.18 

Possible cleavage site: 42 
»> seems to have an uncleavable N-term signal seq 
Amino Acid composition of Predicted Mature Form: 
calculated from 1 
55 ALOM: Finding transmembrane regions (Klein et al.) 
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count: 4 value: -8.76 threshold: 0.0 

INTEGRAL Likelihood = -8.76 Transmembrane 11 - 27 ( 8 - 37) 
INTEGRAL Likelihood = -6.90 Transmembrane 80 - 96 ( 75 - 102) 
INTEGRAL Likelihood = -2.39 Transmembrane 98 - 114 ( 98 - 114) 
5 INTEGRAL Likelihood = -1.12 Transmembrane 47 - 63 ( 47 - 64) 

PERIPHERAL Likelihood = 3.55 
modified ALOM score: 2.25 
Rule: cytoplasmic membrane protein 

10 *** Reasoning Step: 2 

Final Results 



bacterial inner membrane Certainty= 0.450 (Affirmative) < suco 

15 bacterial periplasmic space Certainty= 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

bacterial cytoplasm Certainty* 0.000 (Not Clear) < suco 

The protein has homology with the following sequences in the databases: 

20 >gi |11354095 |pir | |H81995 probable transmembrane transport protein NMA0047 

[imported] - Neisseria meningitidis (group A strain Z2491) 

gi|7378822)emb|CAB83364.1| (AL162752) putative transmembrane transport protein 
(Neisseria meningitidis Z2491] 
Length = 405 



25 



Score = 257 bits (658) , Expect = 5e-68 

Identities = 152/162 (93%), Positives = 154/162 (94%) 



Query: 1 MTHTASKTPKLWAVIAAAAF ILLIT IGMRMTLGLFVQPVVNTTELNIAQF SLI ITVFQLM 60 
30 ~ - MTHTASKTPK W I AAAFILLITIGMROTLGLFVQFVVNTTEL+IAQFSLII VFQLM 

Sbjct: 1 MTHTASKTPKFWLTITAAAFILLITIGMR^ 60 

Query: 61 WGVLQPLSGALADRFGAFRVLSGGAIiLLVCACLIASNI PTYWGLMI AVGLLLAFGTGSGG 120 
WGV QPLSGALADRFGAFRVLSGGA+LLVCACLIA NIPTYWGIMIAVGLLLAFGTGSGG 
35 Sbjct; 61 WGVSQPLSGALADRFGAFRVLSGGAVLLVCAC 120 



40 



Query: 121 FSI IMGQVAAQVPTHKRGIiASGLVNAGGSAGQFLFAPLVQGL 162 

F S I IMGQVAAQ VP HKRGLASGLVNAGGSAGQFLFAPLVQGL 
Sbjct: 121 FSI IMGQVAAQVPAHKRGLASGLVNAGGSAGQFLFAPLVQGL 162 



>gi 1 11351469 |pir | [F83484 probable MPS transporter PA1286 [imported] - Pseudomonas 
aeruginosa 

(strain PA01) 

gi| 9947221 |gb|AAG04675.l|AE004558_4 (AE004558) probable MFS transporter 
45 [Pseudomonas aeruginosa] 

Length = 399 



50 



Score a 72.5 bits (177), Expect ■ 3e-12 
Identities = 53/149 (35%), Positives = 81/149 (53%) 

Query: 14 VTCAAAAFILLITIGMRMTLGLFVQPV^^ 73 

++ + A IL +++G+R GLF+ P+ F+ I + L+WG+ QP +GALAD 

Sbjct: 8 ILLSGALILALSLGVRHGFGLFLAPMSADFGWGREOT 67 

55 Query: 74 RFGAFRVLSGGALLLVCACLIASNIPTY^ 133 

R+GA R + G LL ++ + GL ++ GLL+ G FS+I+G V VP 

Sbjct: 68 RYGAARAVLVGGIiLYALGLVLMGLSQSASGL SLSAGLLIGLGLSGT SF SVILGAVGRAVP 127 

Query: 134 THKRGLASGLVNAGGSAGQFLFAPLVQGL 162 
60 +R +A G+ +A GS GQF P GL 

Sbjct: 128 AEQRSMAMGI SSAAGSFGQFAMLPGTLGL 156 

As a homolog was found in serogroup A K meningitidis but not in serogroup B, NGS72 protein and 
nucleic acid are useful for distinguishing between gonococcus and serogroup B N.meningitidis. 
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Based on this analysis, it was predicted that this protein from gonorrhoeae, and its epitopes, could be 
usefal antigens for vaccines or diagnostics. 

Example 73 

A DNA sequence was identified in ^gonorrhoeae <SEQ ID 145> which encodes amino acid sequence 
<SEQ ID 146; NOS73X Analysis of this protein sequence reveals the Mowing: 

GvH: Examining signal sequence (von Heijne) 

Signal Score (-7.5); -4.26 

Possible cleavage site: 52 
»> Seems to have no N- terminal signal seq 
Amino Acid Composition of Predicted Mature Form- 
calculated from 1 
AIiOM: Finding transmembrane regions (Klein et al ) 

count: 3 value: -3,72 threshold: 0.0 

: "i:S — ™ S - III i SJ : Si 

modified AI*OM score: 1.24 
Rule: cytoplasmic membrane protein 

*** Reasoning Step: 2 

Final Results 

bactSS i oer^ m Tr brane "' Certaint y= 0.249 (Affirmative) < succ> i 
b^tS^o P i ? aCe "~" Certai «ty= 0.000 (Not Clear) < suco 

mLl^ X n ? <b , ane — Certai »ty= 0.000(Not Clear < succ> 
bacterial cytoplasm — Certainty- 0.000 (Not Clear) < succ> 

TTie protein has homology with the following sequences in the databases: 

Length = 405 
Score « 222 bits (567) , Expect - 3e-57 

Identities = 116/123 (94%), Positives = 117/123 (94%), Gaps = 4/123 (3%) 

Sb 3 ct: 160 QGLVVLPEVGWTGTFYVWGAIALLTLPVSWWIA G^SLqSSS 215 

Query: 163 KTAPKTPCTILLHLSFPACGFHIAFLVTHLPTEVALCGLPATVASTSIAI IGLANIAGOT 222 

KTAFKTPSYILI^SFFACGPHIAFLVTHLPTEVAI^GLPAT^TH^Tr^T^ 
Sbjct: 216 VX^Bimtmsn^^ 275 

Query: 223 FSG 225 
FSG 

Sbjct: 276 FSG 278 

Sru^nos'a^'^''^ 83484 PS ** U PA1286 [imported] - Pseudomonas 

(strain PA01) 

gi|9947221|gb|AAG04675.l|AE004SS8 4 (AB0045S8) «™>«ivi- *™„ 
[Pseudomonas aeruginosa] <AE004558) probable MPS transporter 

Length = 399 
Score = 54.4 bits (130), Expect = l e -06 

Identities - 37/115 (32%), Positives = 56/115 (48%), Gaps = 10/115 (8%, 
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Query: 111 VGWTGTFYWGAIALLIL^ 170 

+<3W+ G + LI+P++ + H QSLGEA++ A 

Sbjct: 160 IjGWS S ALLALGLLVAL IVPLAGLM- — KDRPLPPQGHEQSLGEALREACAHSG 209 

5 Query: 171 YILLHL SFFACGFHIAFLVTHLPTEVALCX3LPATVASTSIAI IGLANIAGCVFSG 225 

+ LL L FF CGF + P+ HLP + LPA V +T +A+4GL N+ G 4G 
Sbjct: 210 FWLLALGFFVCGFQVVFIGVHLPAYLVD 264 

As a homolog was found in serogroup A ^meningitidis but not in serogroup B, so NGS73 protein and 
10 nucleic acid are useful for distinguishing between gonococcus and serogroup B ^meningitidis. 

Based on this analysis, it was predicted that this protein from gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 74 

A DN A sequence was identified in gonorrhoeae <SEQ ID 147> which encodes amino acid sequence 

15 <SEQ ID 148; NGS74>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7.5): 0.41 
Possible cleavage site: 30 
>» Seems to nave no N- terminal signal seq. 
20 Amino Acid Composition of Predicted Mature Form: 

calculated from 1 

ALOM: Finding transmembrane regions (Klein et al.) | 
count: 2 value: -1.49 threshold: 0.0 i 
INTEGRAL Likelihood = -1.49 Transmembrane 15 - 31 ( 15 - 31) | 
25 INTEGRAL Likelihood = -1.28 Transmembrane 83 - 99 ( 83 - 99) 

PERIPHERAL Likelihood * 1.59 
modified ALOM score: 0.80 
Rule: cytoplasmic membrane protein 

30 *** Reasoning Step: 2 

Final Results ■ „, 



45 



bacterial inner membrane Certainty= 0.160 (Affirmative) < suco 

35 bacterial periplasmic space Certainty^ 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

bacterial cytoplasm Certainty* 0.000 (Not Clear) < succ> 

The protein has homology with the following sequences in the databases: 

40 >pir||H81995 probable transmembrane transport protein NMA0047 [imported] - 

Neisseria meningitidis (group A strain Z2491) 
emb|CAB83364.1| (AL162752) putative transmembrane transport protein [Neisseria 
meningitidis Z2491] 
Length - 405 



Score = 148 bits (374), Expect = 2e-35 

Identities = 97/106 (91%), Positives = 103/106 (96%) 



Query: 1 MVLIYIFSPKTDLNFYIFAAALGFTWIA^ 60 
50 " M+LIYIFSPKTDLNFYIFAAALGFTWLATV PTA++TGKLFGTRYLATLFGL ML+HQIG 

Sbjct: 300 MILIYIFSPKTDLNFYIFAAALGFTWLATWPTASI 359 

Query: 61 GFLGSYIGGIVITQFGDYGWMOTADAVIAGTAALLVLPVREPRTAA 106 
GFLGSYIGGIVITQFGDyGWMWyADA+LAGTAALL LP+REPRTAA 
55 Sbjct: 360 GFLGSYIGG IVITQFGDYGWMOTADALLAGTAALLNL P IREPRT AA 405 

>pir||F83484 probable MFS transporter PA1286 [imported] - Pseudomonas aeruginosa 
(strain PAOl) 



10 



40 



45 



50 



55 
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gb|AAG04675.l|AE004558_4 (AE004558) probable MFS transporter [Pseudomonas 
aeruginosa] 

Length = 399 

Score « 59.0 bits (142), Expect = 2e-08 

Identities = 40/101 (39%), Positives = 61/101 (59%) 

Query: 1 MVLryiFSPKTDLNFYTFAAALGFT^ g 0 

+++++++ P + + Y F A+G WL+TV T LF6 R L+ L G+V L HQ+G 

Sbjct: 286 VTVLFLWLPLSVYSAYAFGVAMGLLWLSWPLT^ IVFLFHQLG 345 



Query: 61 GFLGSYIGGIVITQFGDYGWMWYADAVL 101 

FLG ++GG+V + G Y +W +L+ AALL PVRB 
Sbjct: 346 AFLGGWLGGVVYDRTGSYDLVWQ^^ 386 

As a homolog was found in serogroup A N. meningitidis but not in serogroup B, NGS74 protein and 
nucleic acid are useful for distinguishing between gonococcus and serogroup B N.meningitidis. 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

20 Example 75 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 149> which encodes amino acid sequence 
<SEQ ID 150; NGS75>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) i 
Signal Score (-7.5): -5.2 
25 Possible cleavage site: 22 i 

»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
30 count: 0 value: 4.45 threshold: 0.0 

PERIPHERAL Likelihood = 4.45 
modified ALOM score: -1.39 
Rule: cytoplasmic protein 

35 *** Reasoning Step: 2 

— Final Results 



bacterial cytoplasm Certainty* 0.237 (Affirmative) < suco 

bacterial periplasmic space Certainty* 0.0 00 (Not Clear) < suco 

bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

bacterial inner membrane Certainty* 0.000 (Not Clear) < suco 

The protein has homology with the following sequences in the databases: 

pir||s04419 type II site-specific deoxyribonuc lease (EC 3.1.21.4) NgoPII - 
Neisseria gonorrhoeae 
emb|CAA36887.1| (X52661) NgoPII restriction and modification [N.gonorrhoeae] 
prf | (1613419A NgoPII endonuclease (Neisseria gonorrhoeae] 
Length = 278 

Score * 240 bits (614), Expect a 4e-63 

Identities ■ 128/129 (99%), Positives = 128/129 (99%) 

Qu-ery: 1 MNI IDAI INLANNPWG^SHSQSNNRANQ 60 

MNI IDAI INLANNP WGV SHSQSNNRANQAGDALEEYVKDLFSGSFNLNETQRIARHAK 
Sbjct: 1 MNIIDAIINLANNPW<^SHSQSNNRAN^ 60 

Query: 61 VFSYIXSNNSNPPDAMLIWGI^r 120 
WSYLGNNSNPPDAMLRNGnAlEVKKIESKDSALALNSSHPKSKLSTO 
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Sbjct: 61 VFSYLGNNSNPPDAMLRNGDAIEVKKIBSKDSALALNS 120 

Query: 121 KWEEKDIIY 129 
KWEEKDIIY 
5 Sbjct: 121 KWEEKDIIY 129 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 76 

10 A DNA sequence was identified in N.gonorrhoeae <SEQ ID 151> which encodes amino acid sequence 

<SEQ ID 152; NGS76>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7.5): -1.73 
Possible cleavage site: 13 
15 >» Seems to have no N-terminal signal seq. 

Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 9.39 threshold: 0.0 
20 PERIPHERAL Likelihood =9.39 

modified ALOM score: -2.38. 
Rule: cytoplasmic protein 



25 



50 



*** Reasoning Step: 2 
Final Results — 



bacterial cytoplasm Certainty* 0.272 (Affirmative) < suco 

bacterial periplasmic space Certainty= 0.000 (Not Clear) < suco 

30 bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

bacterial inner membrane Certainty* 0.000 (Not Clear) < suco 

The protein has homology with the following sequences in the databases: 

pir||S04419 type II site-specific deoxyribonuclease (EC 3.1.21.4) NgoPII - 
35 Neisseria gonorrhoeae 

emb|CAA36887.l| (X52661) NgoPII restriction and modification [N.gonorrhoeae] 
prf | 1 1613419A NgoPII endonuclease [Neisseria gonorrhoeae] 
Length = 278 

40 Score « 268 bits (687) , Expect = 2e-71 

Identities = 136/136 (100%), Positives = 136/136 (100%) 

Query: 1 LAMVYGIDYC^AECTLKIKNQIKEXSI^ 60 
LAMVYG IDYCADAECYLK IKNQ IKEG I GNIGG IQF AETKELGRVNRIDPLN ITYLRVRGM 
45 Sbjct: 143 LAMVYGIDYCADAECYLKIKNQIKEGIGNIGGIQFAETKEIXSR^ 202 

Query: 61 WGIENFWFVFNYI YQRNMEKSFNFMAI INEDKWNSFNNTDKLLAIQDSKLAI SDIKIKNP 120 

WGIENPWFVFNYIYQRNMEKSFNFMAI INEDKWNSFNNTDKLLAIQDSKLAI SDIKIKNP 
Sbjct: 203 WGIENPWFVFNYIYQRNMEKSFNFMAI INEDKWNSFNNTDKLLAIQDSKLAI SDIKIKNP 262 



Query: 121 NNPARLRNAKLITYHL 136 

NNPARLRNAKL ITYHL 
Sbjct: 263 NNPARLRNAKLITYHL 278 



55 Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 
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Esaniple 77 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 153> which encodes amino acid sequence 

<SEQ ID 154; NGS77>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
5 Signal Score (-7.5): -4.51 

Possible cleavage site: 58 
»> Seems to nave no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 
calculated from 1 
10 ALOM: Finding transmembrane regions (Klein et al.) 

count: 0 value: 3.18 threshold: 0.0 
PERIPHERAL Likelihood =3.18 
modified ALOM score: -1.14 
Rule: cytoplasmic protein 



15 



45 



*** Reasoning Step: 2 
Final Results — 



20 bacterial cytoplasm Certainty* 0.127 (Affirmative) < suco 

bacterial periplasmic space Certainty* 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

bacterial inner membrane Certainty* 0.000 (Not Clear) < suco 

25 The protein has homology with the following sequences in the databases: 

>pir| |CTNHP2 site-specific DNA-me thy 1 transferase (cytosine-specif ic) (EC 1 
2.1.1.73) NgoPII - Neisseria gonorrhoeae 
emb|CAA30038.l| (X06965) NgoPII me thy las e (AA 1-341) [Neisseria gonorrhoeae] ■ 
emb|CAA36888.l| (X52661) NgoPII restriction and modification [Neisseria 
30 gonorrhoeae] 

gb|AAA17019.1| (L14564) cytosine methylase [Neisseria gonorrhoeae] 
prf | |1613419B NgoPII methylase [Neisseria gonorrhoeae] 
Length = 341 

35 Score = 682 bits (1761), Expect * 0.0 

Identities = 341/341 (100%) , Positives * 341/341 (100%) 

Query: 1 MQNSSPTTYNPMKI I SLFSGCGGLDLGFEKAGFEI PAANEYDKTIWATFKANHPKTHLIE 60 
MQNSSPTTYNPMKI I SLFSGCGGLDLGFEKAGFEIPAANEYDKTIWATFKANHPKTHLIE 
40 Sbjct: 1 MQNSSPTTYNPMKI I SLFSGCGGLDLGFEKAGFEI PAANEYDKTIWATFKANHPKTHLIE 60 



Query: 61 GDIRKIKEEDFPEEIDGIIGGPPCQSWSEAGALRGIDDARGQLFFDYIRILKSKQPKFFL 120 

GDIRKIKEEDFPEEIDGIIGGPPCQSWSEAGALRGIDDARGQLFFDYIRILKSKQPKFFL 
Sbjct: 61 GDIRKIKEEDFPEEIDGIIGGPPCQSWSEAGALRGIDDARGQLFFDYIRILKSKQPKFFL 120 

Query: 121 AENVSCMJVNRHNGAVQNLLKMFDGCGYD^ 180 

AENVSGMLANRHNGAVQNLLKMFDGCGYDV^ 
Sb j Ct : 121 AE1WSGMLANRHNGAVQNLLKMFDGCGYDVTLTM I 180 



50 Query: 181 KFSFPKGSTVEDKDKITLKDVIVroLQDTAVPSAPQNKTNPDAV^ 240 

KF SFPKGSTVEDKDKITLKDVIVroLQDTAV^SAPQNKTOT 
Sbjct: 181 KF SF PKGSTVEDKDKI TLKDVIWDLQDT AVP SAPQNKTNPDAVNNNEYFTGSFS PIFMSR 240 

Query: 241 NRVKAWDEQGFWQASGRQCQLHPQAPKMEKHGANDYRFAA 300 
55 NRVKAVTOEQGFTVQASGRQCQLHPQAPKMEKHGAND*^ 

Sbjct: 241 NKVKAWDEQGFWQASGRQCQLHPQAPKMEKHGANDYRFAA^^ 300 

Query: 301 FPDNFKF ryQNVNDAYKMIGNAVPVNLAYE IAAA1KKTLER 341 
FPDNFKF I YQNVNDAYKMIGNAVPVNLAYE IAAAIKKTLER 
60 Sbjct: 301 FPDNFKF IYQNVNDAYKMIGNAVPVNLAYE IAAAIKKTLER 341 
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Based on this analysis, it was predicted tbat this protein from ^gonorrhoeae^ and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 78 

A DNA sequence was identified in N. gonorrhoeae <SEQ ID 155> which encodes amino acid sequence 
<SEQ ID 156; NGS78>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 

Signal Score (-7.5): -8.33 

Possible cleavage site: 24 
»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALQM: Finding transmembrane regions (Klein et al.) 

count: 0 value: 2.17 threshold: 0.0 

PERIPHERAL Likelihood = 2.17 
modified ALQM score: -0.93 
Rule: cytoplasmic protein 

*** Reasoning Step: 2 

Final Results 

bacterial cytoplasm Certainty^ 0.220 (Affirmative) < suco 

bacterial periplasmic space Certainty* 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty= 0.000 (Not Clear) < suco 

bacterial inner membrane Certainty^ 0.000 (Not Clear) < suco 



The protein has homology with the following sequences in the databases: 

>pir| |H82000 hypothetical protein NMA0089 [imported) - Neisseria meningitidis 
(group A strain Z2491) 
emb|CAB83405.l| (AL162752) hypothetical protein NMA0089 [Neisseria meningitidis 
Z2491] 
Length =226 

Score = 422 bits (1085), Expect = e-117 

Identities = 217/228 (95%), Positives = 220/228 (96%), Gaps * 2/228 (0%) 

Query: 1 MERYKNAVRKDKAAELYLLNLSL 60 

MERYKNAV KDKAAELYLLNLSLSRELFHVVSIFEIVIjRNKIDIC +Q FKD N LY+SI 
Sbjct: 1 MERYKNAVGKDKAAELYLLN^^ 60 

Query: 61 QPQTNPALKYQGCFLRNGTKE SAELIKVALSKI QNNSGGKFDHNQLVAGLGFGFWRYIiFA 120 

QPQTNPALKYQGCFLRNGTKESABLIKVALSKIQNNSGGK^ 
Sbjct: 61 QPQTNPALKYQGCFLRNGTKESAELIKVALSKIQ 120 

Query: 121 GGKBAQFDAAGKVLMKvPPKKFKSTPSV^^ 180 

GGKDAQFDA GKVIiMKVFPKKPKSTPSVQ+NQKWIFRELSNINNFHNRLAHHE FK 
Sbjct: 121 GGKDAQFDATGKVLMKVFPKKPKSTPSVQHNQKWIFR — FK 178 

Query: 181 GAIKDTCYARNIHQSIFELLNYMDVDTASW 228 

GAIKDTGYARNIHQ S IFELLNYMDVDTASVF SHF SDQVTAVCDEIDKIi 
Sbjct: 179 GAIKDTGYARNIHQ S IFELLNYMD^OTTASVF SHFSDQVIAVCDE IDKL 226 

>ref |NP_053274.1) Hypothetical gene [Agrobacterium tumefaciens] 
dbj |BAA87659.l| (AB016260) Hypothetical gene [Agrobacterium tumefaciens] 
Length = 380 

Score = 43.6 bits (102), Expect = 0.002 

Identities =-53/215 (24%), Positives = 86/215 (39%), Gaps = 42/215 (19%) 

Query: 5 KNAVRKDKAAELYLLNLSLSREIJ^ 64 
K ++ A LYL N +++ + +++ E+ LRN +D F 
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Sbjct: 55 KGGNHEEYTmLYLYNARVAKAFLYPLNVAEVT^ 101 

Query: 65 NPALKYQGCFLRNGTKES AEL IKVALSKIQIJNSGGKFDHNQLVAGLGFGFWRYLFAGGKD 124 

A +Q R+ T 1» L K +G +Q+VA L F FW LF 
Sbjct: 102 — ANWHQDATFRDQTLTGNGL — ATLDKAIQRAGAGAARDQIVATLTFDFWSNLFR 153 

Query: 125 AQFDAAGKVLMKVFPKKPKSTPSVQYNQKWIFRELSN INNFRNKLAHHEPICFSFK 180 

++ + + + + P +Q+ + +B+ N IN FRNR+AHHEP+ 
Sbjct: 154 PEYGGL WRTTVNI - — AFPHLQHGESR — QEIQNLVKPINVFRNRVAHHEPVL 201 

Query: 181 GAJKDTGYARNIHQSIFELLNYM^ 215 

D +IH I L+ +TA+ H S 

Sbjct: 202 DLNVT-DIHAKIVRLIEIiRCAETATWMKHHS 231 



15 As a homolog was found in serogroup A ^meningitidis but not in serogroup B, so NGS78 protein and 
nucleic acid are useful for distinguishing between gonococcus and serogroup B N.meningitidis. 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
s useful antigens for vaccines or diagnostics. 

Example 79 

20 A DNA sequence was identified in N. gonorrhoeae <SEQ ID 157> which encodes amino acid sequence 

<SEQ ID 158; NGS79>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 

Signal Score (-7.5): 2.07 \ 
Possible cleavage site: 17 
25 »> Seems to have a cleavable N-terro signal seg. I 

Amino Acid Composition of Predicted Mature Form: 

calculated from 18 
ALOH: Finding transmembrane regions (Klein et al.) 
count: 0 value: 9.49 threshold: 0.0 
30 PERIPHERAL Likelihood = 9.49 . 

modified ALGM score: -2.40 
Score for OM-PP discrimination: -11.72 
Rule: outer membrane or periplasmic protein 
Score for OM-PP discrimination: -11.72 
35 Rule: outer membrane or periplasmic protein 

*** Reasoning Step: 2 

Periplasmic space? Score: 1.17242 
40 Periplasmic space? Score: 1.17242 



Final Results 



bacterial periplasmic space Certainty= 0 .932 (Affirmative) < suco 

45 bacterial outer membrane Certainty** 0 .240 (Affirmative) < suco 

bacterial inner membrane Certainty= 0.000 (Not Clear) < suco 

bacterial cytoplasm Certainty= 0.000 (Not Clear) < suco 



A homolog (amino acids 23-85) was found in serogroup A N. meningitidis but not in serogroup B, so 
NGS79 protein and nucleic acid are useful for distinguishing between gonococcus and serogroup B 
N.meningitidis. 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 
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Example 80 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 159> which encodes amino acid sequence 
<SEQ ID 160; NGS80>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne)- 

Signal Score (-7.5): -8.49 

Possible cleavage site: 57 
»> Seems to have no N-terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 

count: 0 value: 4.98 threshold: 0.0 

PERIPHERAL Likelihood = 4.98 
modified ALOM score: -1.50 
Rule: cytoplasmic protein 

*** Reasoning Step: 2 

Final Results 



bacterial cytoplasm Certainty^ 0.428 (Affirmative) < suco 

bacterial periplasmic space Certainty^ 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty^ 0.000 (Not Clear) < suco 

bacterial inner membrane Certainty= 0.000 (Not Clear) < suco 

The protein has homology with the following sequences in the databases: 

>pir||E81080 conserved hypothetical protein NMB1466 [imported] - Neisseria 
meningitidis (group B strain MD5 8) 
gb|AAF41823.l| (AE002496) conserved hypothetical protein {Neisseria meningitidis 
MC58] 
Length =243 

Score = 148 bits (375), Expect - 7e-35 

Identities = 82/189 (43%), Positives = 109/189 (57%) 

Query- 120 VDRMFNMAGNHFARLGI SGSGVHYWNARDF SEQAFHAEVGYRYRNSRLEWGFRPFVKQNR 179 

+ R N +HF GISG GVHYW+ +DFSEQ+ GY+ *+ +^ PFV+QN 

Sbjct: 1 MSREINAGRHHFLYGGI SGGGVHyVTONKDFSEQSLRLSFGYKNRSvTRSFGIVPFVEQNL 60 

Query- 180 LGNNKCTANTGXVliOT SVSGTFSYY 239 

LG +RY G D+S+RL+E+WR T + K Y -M- A RY+S T Y 

Sbjct: 61 I^SRYNFVGGFNADFSQRLSERWRLTLNAGNMWKHYQEDRTAARYDS 120 

Query- 240 AMSAWQLYGGISGMFDNTVEKEQASRRYGVSU^^ 299 

AW LYGG + T E EQAS R G+ +G VK DGGLGL+ YT+R+F AP T 

Sbjct: 121 APKDWLLYGGADWSHNITKEAEQASIRKGL 180 

Query: 300 LIYNFTRRD 308 

++Y F R+D 
Sbjct: 181 XVYRFPRKD 189 

>gb|AAD11779.l| (AF118122) putative outer membrane protein OmpU [Neisseria 
meningitidis] 
Length = 488 

Score = 72.1 bits (176), Expect = 7e-12 «„, 0/wl , C(L1 

Identities = 71/300 (23%), Positives = 128/300 (42%), Gaps = 17/300 (5%) 

Query: 3 EAADLYRELLSERPDLWPRFDLGVMLFEDKQYREALVQLHRAE -EVLPPDMRQLAREYI 61 

EA YREL++ +PD R L LF+++Q A Q R + E LPP + + Y 
Sbjct: 136 EAI SHYRELI2^QPDAPAVRMRLAAALFDNRQNEAAAD^ 195 

Query- 62 RQAEAVQAWHPSFNMNYEQTDNVNNASLSRDIVINGRKOT 119 

+ AW++ + N+NA+ KW +PK++G+YLG 
Sbjct: 196 KALRERDAWKVNGGFSVTREHNINQAPKRQQY GKW TFPKQVDGTAVNYRLG 246 
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Query: 120 VDRMFNMAGiraFARL^ 179 

++ +++ ' + 6 SG Y + F++ Q+ + R + G F ++ 

Sbjct: 247 AEKKWSLKNGWTOTATCWSGRVYPGNKKF 306 

Query: 180 LGND^YTANTGIVLDYSRRLNEKWRSTQSFQYGRKQYH^ SVSGTFSYY 239 

GN+ Y+ G L ++R KW++ S ++GR + R ++ + +S + +Y 

Sbjct: 307 YGNDAYSYTNGARLYFNRWQTPKWQTLSSAEWGRLK NTRRARSDNTHLQISNSLVFY 363 

10 Query: 240 AMSAWQLYGG I SGMFD- NTVEKEQ ASRRYGV SLGTVKILDG-GLGLKLGAGYTKRIFKAP 297 

+ GG+ + N ++ RYG+ + G GL L G XR ++ P 

Sbjct: 364 RNARQYVMGGLDFYRERNPADRGDNFNRYGLRF 423 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
IS useful antigens for vaccines or diagnostics. 

Example 81 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 161> which encodes amino acid sequence 

<SEQ ID 162; NGS81>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
20 Signal Score (-7.5): -6,25 

Possible cleavage site: 15 
»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 
calculated from 1 

25 ALOM: Finding transmembrane regions (Klein et al.) j 

count: 0 value: 7.05 threshold: 0.0 
PERIPHERAL Likelihood =7.05 
modified ALOM score: -1.91 
Rule: cytoplasmic protein 

30 

*** Reasoning Step: 2 

Final Results 

35 bacterial cytoplasm Certainty= 0.232 (Affirmative) < suco 

The protein has homology with the following sequences in the databases: 

gi (10803654 |ref |NP_046052 .1 1 putative ISH4 transposase [ Hal obac cerium sp. NRC-1] 
gi 1 7484109 jpirj |T08324 probable transposase H1306 - Halobacterium sp. (strain 
40 NRC-1) 

insertion sequence ISH4 plasmid pNRClOO 
gi|2822385|gb|AAC82891.l| (AF016485) putative XSH4 transposase [Halobacterium 
sp. NRC-1] 

gi|l0580476|gb|AAG19350.1| (AE005029) Vng0918h [Halobacterium sp. NRC-1] 
45 Length =294 

Score = 52.1 bits (124), Expect = 4e-06 

Identities = 36/139 (25%) , Positives = 63/139 (44%), Gaps = 7/139 (5%) 

50 Query: 31 CPHCQSTHFVKNGKDCGNQRFLCRIX^KKSFV^ 89 

CP C++ ++ G QR+LC+DC ++F +QTGT+ ++ + W ++ I 

Sbjct: 28 CPSCRAESVIRYGSYRVFQRYLCKDCDRTFNDQTC 87 

Query: 90 PLRKCAEICKINLATAFTWRHKILDALQ^ 149 ' 

55 +R+ ++ T + + L AL L+G V+ DE Y KG ++ 

Sbjct: 88 SIRQLDAE IDVS YKTVYRRVQRFLRALD- - APRPHLEGPVE IDEFYVKAGLKGRERD 142 

Query: 150 PRPAHKRGTRATKRGISKE 168 
+P+ RG RG E 

60 Sbjct: 143 -QP SRSRGL STRGRGTYAE 160 
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Based on this analysis, it was predicted that this protein from Mgonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 82 

A DNA sequence was identified in N. gonorrhoeae <SEQ ID 163> which encodes amino acid sequence 

5 <SEQ ID 164; NGS82>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7.5): -4.66 
Possible cleavage site: 57 
»> Seems to have no N- terminal signal seq. 
10 Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 1 value: -0.85 threshold: 0.0 

INTEGRAL Likelihood = -0.85 Transmembrane 76 - 92 ( 76 - 92) 
15 PERIPHERAL Likelihood = 1.75 

modified ALOM score: 0.67 
Rule: cytoplasmic membrane protein 

*** Reasoning Step: 2 

20 

Final Results 

bacterial inner membrane Certainty^ 0.134 (Affirmative) < suco 

25 The protein has homology with the following sequences in the databases: ; 

>gi| 586070 |sp|Q07 605 |T4BA^BACC0 RESTRICTION ENZYME BGCI ALPHA SUBUNIT [INCLUDES: 
ADENINE- SPECIFIC 

METHYLTRANSFERASE ACTIVITY 1 
gi| 1075788 |pir | |A53125 restriction enzyme Bcgl alpha chain - Bacillus coagulans 
30 gi 1 304140 |gb|AAA16626.l| (L17341) restriction endonuclease alpha subunit 

[Bacillus 

coagulans] 
Length =637 

35 Score = 91.4 bits (226), Expect = le-17 

Identities = 78/256 (30%), Positives = 123/256 (47%), Gaps = 42/256 (16%) 

Query: 1 MFALAASNMILRGTCKANLHQSSCFMTDFQDLIKNPKPETC 60 
+F +A +NMILRGDGK+NL + +C F+IN G+ N +NPPY+Q+K+D 
40 Sbjct: 394 LFTIATTNMILRGDGKSNLIRDNCLT — FDNTIMN GYGI NKILMNPFYSQAKND 445 

Query: 61 AELH — ELYFVKEMLDMLAEGGTG IAI I PVSCVIAPSK AKSEIVKYHRLKAVMSMP 114 

H EL F+++ L+ML GG AI+P S ++ ++ K +I+K H L+ V+++ 

Sbjct: 446 QTQHLSELSFIQQALEMLVVGGKLCAX^QSTMVGKNRHTO 505 

45 

Query: 115 SELFYPVGTVTCrVWEAHKPHFQTVVIDPDTQEE I STKKACRKTWFGYWRDDGFEKTKH 174 

+ F+ VG CIV+F+A H + ++ F + DDG KH 

Sbjct: 506 KDTFHGVGVNPCIVIFKAGIKHPEN KRVSFVNFEDDGHWRKH 548 

50 Query: 175 LGRIDLYDRWQGIKARWLEHYL NNEVHTGESVTAFVTDNDEWVAEAYLETDYSKI 229 

+G+ G+ EH L ++TV+D DEW+ Y D 

Sbjct: 549 VGLVG IXSTEKGKREHLLAVIiAGDEDDGTDL IVKTAIKDTDEWLHSFYYFND-GI P 602 

Query: 230 TRADFEQWREFALFQ 245 
55 + DF + V + FQ 

Sbjct: 603 SEDDFYKTVANYLTFQ 618 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae^ and its epitopes, could be 
useful antigens for vaccines or diagnostics. 
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Example 83 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 165> which encodes amino acid sequence 

<SEQ ID 166; NGS83>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
5 Signal Score (-7.5): -8.04 

Possible cleavage site: 43 
»> Seems to have no N- terminal signal seg. 
Amino Acid Composition of Predicted Mature Form: 
calculated from 1 
10 ALQM: Finding transmembrane regions (Klein et al.) 

count: 1 value: -1.44 threshold: 0.0 

INTEGRAL Likelihood = -1.44 Transmembrane 55 - 71 ( 55 - 71) 
PERIPHERAL Likelihood =4.03 
modified ALQM score: 0.79 
15 Rule: cytoplasmic membrane protein 

*** Reasoning Step: 2 

— Final Results 

bacterial inner membrane — - Certainty^ 0.157 (Affirmative) < suco 

The protein has homology with the following sequences in the databases: 

>gi 58607l|sp|Q07606|T4BB^BACCO RESTRICTION ENZYME BGCI BETA SUBUNIT 
25 gi 1075789 |pir| |B53125 restriction enzyme Bcgl beta chain - Bacillus coagulans 

gi 30414l|gb|AAAl6627.1| (L17341) restriction endonuclease beta sui^unit 
[Bacillus coagulans] 

Length = 341 

30 Score e 44.0 bits (103), Expect = 0.002 

Identities = 46/195 (23%), Positives = 79/195 (39%), Gaps = 23/195 (11%) 

Query: 4 LQE IFDVSYGSKLDLNKMSSFNPT INFVGRSGKNNGVTASVDIiLKNTKP YPAGIiLTVALG 63 
+ ++FDV G +D NK ++ R NG +D K K Y L + +G 

35 Sbjct: 12 ISDLFDWIGOTIDGNKAQRNENGTPYITRKATRNGFE 70 

Query: 64 GSVLSTFLQNKPFYTAQNVAVLNPKTEMTEQQKLFYCAAIFANAYRFSACG 122 

F+Q F+T V + PK ++ L Y + NA + + NT L+ 

Sbjct: 71 NETSKPFVQEFHFFTGTKVNICIPKLDLNRM 129 



20 



40 



Query: 123 QL — FVPSLDEIPSW VESVNLNPSAGVTEPKLKESLDLFVVRQSKR 166 

L +P E P W ++ ++ + GV++ + + L + 
Sbjct: 130 SLKILLPIKGEEPDWDYMNTYI SKILSNMEKNFDVQQNDGVSDLRSLKDLSW SQFK 185 



45 Query: 167 LDEIFTIQNGIAATK 181 

+DEIF+I +G+ TK 
Sbjct: 186 MDEIFS INSGVRLTK 200 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
50 useful antigens for vaccines or diagnostics. 

Example 84 

A DNA sequence was identified in N.gonorrlioeae <SEQ ID 167> which encodes amino acid sequence 

<SEQ ID 168; NGS84>. Analysis of this protein sequence reveals the following: 

Signal Score (-7.5): 3.15 
55 Possible cleavage site: 33 

»> Seems to have no N- terminal signal seq. 
Amino Acid Coinposition of Predicted Mature Form: 
calculated from 1 



10 



25 
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ALOM: Finding transmembrane regions (Klein et al.) 

count: 0 value: 1.22 threshold: 0.0 

PERIPHERAL Likelihood » 1.22 
modified ALOM score: -0.74 
Rule: cytoplasmic protein 

*** Reasoning Step: 2 

Final Results 

bacterial cytoplasm Certainty= 0. 072 (Affirmative) < suco 

The protein has homology with the following sequences in the databases: 



15 gi 
gi 



2495432 
7465604 
2182353 



sp|P55409|Y4DJ_RHISN HYPOTHETICAL TRANSCRIPTIONAL REGULATOR Y4DJ 
pir||T02773 y4dJ protein - Rhizobium sp. plasmid pNGR234a 
gb|AAB91639.l| (AE000069) Y4dJ [Rhizobium sp. NGR234] 



Length = 77 



Score = 44.4 bits (104), Expect = 7e-04 
20 Identities = 25/61 (40%), Positives = 36/61 (58%) 



Query: 92 KAGGETFVSLRMKKGFTQSELATAAGLPQPYLSRIENSKQSLQDKWQKIiANALGVSP^ 151 

K G F LR +KG TQ E+ +G Q YLS +E +++ T+ +LA ALGVS +E 
Sbjct: 5 KLVGSNFARLRREKGLTQEEVEARSGFSQQYLS SLERGRRNPTVTTLYELAQALGVSHVE 64 

Query: 152 V 152 
+ 

Sbjct: 65 L 65 



30 Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 85 

A DNA sequence was identified in hi. gonorrhoeae <SEQ ID 169> which encodes amino acid sequence 

<SEQ ID 170; NGS85>. Analysis of this protein sequence reveals the following: 

35 Signal Score (-7.5): -6.09 

Possible cleavage site: 15 
»> Seems to have no N~ terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 
calculated from 1 
40 ALOM: Finding transmembrane regions (Klein et al.) 

count: 0 value: 2.92 threshold: 0.0 
PERIPHERAL Likelihood = 2.92 
modified ALOM score: -1.08 
Rule: cytoplasmic protein 

45 

*** Reasoning Step: 2 

Pinal Results 

50 ' bacterial cytoplasm Certainty= 0.480 (Affirmative) < suco 

The protein has no homology with sequences in the databases. 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 
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Example 86 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 171> which encodes amino acid sequence 
<SEQ ID 172; NGS86>. Analysis of this protein sequence reveals the following: 

Signal Score (-7.5): -2,92 
5 Possible cleavage site: 21 

»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
10 count: 1 value: -2.76 threshold: 0.0 

INTEGRAL Likelihood = -2.76 Transmembrane 179 - 195 ( 179 - 195) 
PERIPHERAL Likelihood =2.17 
modified ALOM score: 1.05 
Rule: cytoplasmic membrane protein 

15 

*** Reasoning Step: 2 

Final Results 

20 bacterial inner membrane • Certainty^ p. 210 (Affirmative) < suco 

The protein has homology with the following sequences in the databases: 

>sp|Q05205|PPB_LYSEN ALKALINE PHOSPHATASE PRECURSOR (APASE) 
pir||A42467 alkaline phosphatase (EC 3.1.3,1) phoA precursor - Lysobacter 
25 enzymogenes : 

emb|CAA39978.l| (X56656) alkaline phosphatase [Lysobacter enzymogenes] I 
Length = 539 

l 

Score = 37.5 bits (86), Expect = 0.40 
30 Identities = 28/82 (34%), Positives = 43/82 (52%), Gaps = 8/82 (9%) 

Query: 189 VALGLQAYWDVAGANNGAT^ 248 

V GL A W+V+ A + AQV +++ R+ GT D + G+A A AS S 

Sbjct: 458 VLRGLMA-WNVSSA AGKTLTGAQVKLQV SDRST-GTYDLYFAGAAWTEANASYS 509 

35 

Query: 249 GIEAGKKVTAVIPAVRGAVAYA 270 

G+ G K+ +V+P+ GA + A 
Sbjct: 510 GVSLGSKIGSWPSATGAQSIA 531 

40 Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 87 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 173> which encodes amino acid sequence 
<SEQ ID 174; NGS87>. Analysis of this protein sequence reveals the following: 

45 Signal Score (-7.5): 0.18 

Possible cleavage site: 35 
»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 
calculated from 1 
50 ALOM: Finding transmembrane regions (Klein et al.) 

count: 0 value: 1.70 threshold: 0.0 
PERIPHERAL Likelihood =1.70 
modified ALOM score: -0.84 
Rule: cytoplasmic protein 

55 

*** Reasoning Step: 2 



Final Results 
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bacterial cytoplasm Certainty^ 0.138 (Affirmative) < suco 

The protein has homology with the following sequences in the databases: 

5 >gi| 12514207 |gb|AAG55499.l|AE005289_17 (AE005289) unknown protein encoded by 

cryptic prophage CP-933M 

[Escherichia coli 0157 :H7] 
gi 1 12514720 |gb|AAG55907.1 |AE005324_10 (AE005324) unknown protein encoded by 
prophage CP-933N 
10 [Escherichia coli 0157 :H7] 

Length = 108 

Score =30.9 bits (69), Expect =9.1 

Identities * 21/55 (38%), Positives = 28/55 (50%), Gaps « 3/55 (5%) 

^ Query: 1 MMWSLEEFKQRIGVEHDRRDDFF^ 55 
MA ++LEE K + V+HD 0D + + +A AYI S D V R 1X3 
Sbjct: 1 MTALLTIiEEIKAHLRVDHDADDDMLMDKVRQATAVLLAYI RDKVIREDG 52 

20 Based on this analysis, it was predicted that this protein from ^gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 88 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 175> which encodes amino acid sequence 
<SEQ ID 176; NGS88>. Analysis of this protein sequence reveals the following: 

25 Signal Score (-7.5): -3.69 

Possible cleavage site: 43 
»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 
calculated from 1 
30 ALOM: Finding transmembrane regions (Klein et al.) 

count: 0 value: 6.05 threshold: 0.0 
PERIPHERAL Likelihood = 6.05 
modified ALOM score: -1.71 
Rule: cytoplasmic protein 

35 

*** Reasoning Step: 2 

Final Results 

40 bacterial cytoplasm Certainty** 0 . 227 (Affirmative) < suco 

The protein has no homology with sequences in the databases. 

Based on this analysis, it was predicted that this protein from N.gonorrlweae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

45 Example 89 

A DNA sequence was identified in ^.gonorrhoeae <SEQ ID 177> which encodes amino acid sequence 

<SEQ ID 178; NGS89>. Analysis of this protein sequence reveals the following: 

Signal Score (-7.5): -4.77 

Possible cleavage site: 26 
50 >» se em s to have no N- terminal signal seq. 

Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 1.38 threshold: 0.0 
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PERIPHERAL Likelihood =1.38 
modified ALQM score: -0.78 
Rule: cytoplasmic protein 

**•* Reasoning Step: 2 

Final Results — : 

bacterial cytoplasm Certainty= 0.284 (Affirmative) < suco 

The protein has homology with the following sequences in the databases: 



>gi 
gi 

HK022] 



9634142 1 ref |NP_037684.l| gp24 CEnterobacteria phage HK022] 

6863134 |gb|AAF30375.1|AF069308_23 (AF0693G8) gp24 [Enterobacteria phage 



15 Length = 1183 

Score = 44.9 bits (102) , Expect * 0.006 

Identities = 38/127 (29%), Positives ■ 64/127 (49%), Gaps = 11/127 (8%) 

20 Query: 851 NKALRDKINLIIXSNGAGSVNERV^ I NG- LEAQYTVK — 904 

NKA + +N + + ++ + +T +GN +A+ T+A++I NG L A Y +K 
Sbjct: 989 NKAS INSLNQTF SDYQQATATQINGITATVNG^SAITTNAQAIANVNGDLSAMYN31KVG 1048 

Query: 905 VDANGK — VAGFGLATTPKNGTPE SKF IVNADRFGI - GAAGKADVFPFVVDTQKNRVGIN 961 
25 V +NG+ AG G+ +S+ I ADRF + AAG + PFV+ Q + I 

Sbjct: 1049 VS SNGQYYAAGMGIGVEOTPSGMQSQVIFIiADRFAVTTAAGNSVALPF\^ — QNGQTFIR 1106 

Query: 962 GELWNG 968 

+ +G ij ' 

30 Sbjct: 1107 ASFIQDG 1113 , 

'l 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 90 

35 A DNA sequence was identified in N.gonorrhoeae <SEQ ID 179> which encodes amino acid sequence 

<SEQ ID 180; NGS90>. Analysis of this protein sequence reveals the following: 

Signal Score (-7.5): -2.82 

Possible cleavage site: 24 
>» Seems to have no N- terminal signal seq. 
40 Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALQM: Finding transmembrane regions (Klein et al.) 
count: 4 value: -9.66 threshold: 0.0 

INTEGRAL Likelihood = -9.66 Transmembrane 321 - 337 ( 317 - 349) 
45 INTEGRAL Likelihood = -6.48 Transmembrane 351 - 367 ( 340 - 371) 

INTEGRAL Likelihood = -5.73 Transmembrane 907 - 923 ( 903 - 926) 

INTEGRAL Likelihood = -0.00 Transmembrane 430 - 446 ( 430 - 446) 

PERIPHERAL Likelihood = 2.17 
modified ALQM score: 2.43 
50 Rule: cytoplasmic membrane protein 

*** Reasoning Step: 2 

Final Results 

bacterial inner membrane Certain ty= 0 .486 (Affirmative) < suco 



The protein has homology with the following sequences in the databases: 

>gi 1 12514839 1 gb | AAG56002 . 1 1 AE005332_9 (AE005332) putative tail component of 
60 prophage CP-933X [Escherichia 
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coli 0157 :H7] 
Length = 1026 

Score = 111 bits (279) , Expect = 3e-23 

Identities » 78/274 (28%), Positives = 146/274 (52%), Gaps = 10/274 (3%) 

Query: 69 AAGNQAQQASEKVRAEVGKIGSGLSGLTKIJ^ 128 

AA + ++A ++ +++ +1 + G+T AG A ++ AD S+N++++Q 

Sbjct: 45 AAAREQRRALAELHSQLTEIRASAVGMTGAFAG AFATGHLI SLADEWS SVNARLKQA 101 

Query: 129 TSSBTEYLAVQQQI^DTANR^^ 188 

+ S E+ + Q+ L+D + RT + A L+ ++ ++++YGY+ ++LK TEA + + 
Sbjct: 102 SQSSDEFAS SQKVLMDI SQRTGTAF SDNAALFARSAASMREYGYSADDVLKVTEAI STGL 161 



15 Query: 189 TIGGVGAQQQAAALMQLSQALGSGVLQGDEFKS I SEAAPILLOTIAEYMGKSRDEIKKLG 248 

I G + + + Q SQAL GVL+G+EF S++E+ ++ +A MG +R ++K + 
Sbjct: 162 KI SGAS TAEAG SVITQF SQALAQGVLRGEEFNSVNESGDR 221 

Query: 249 SEGKLTADVIFKAISGASEKFGEQAAKMPV^ 308 
20 +GKLTAD + A+ ++ A MP T+ ++T N + + V G + 

Sbjct: 222 DDGKLTADKWPAL I SQLGILRDEYAAMPEWS SS ITKVENAFMAWV GGANEA 274 

Query: 309 AAVTKL IADNLNLVVPIVAGFAVAVAAAVAPTIiA 342 
+ VK++LNV + AAVAVA+A 
25 Sbjct: 275 SGVTKTLSGMLNGVAGQIDNVATAVGALVAVGVA 308 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 91 

30 A DNA sequence was identified in N. gonorrhoeae <SEQ ID 181> which encodes amino acid sequence 

<SEQ ID 182; NGS91>. Analysis of this protein sequence reveals the following: 

Signal Score (-7.5): -0.63 

Possible cleavage site: 36 
»> Seems to have no N-terminal signal seq. 
35 Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALGM: Finding transmembrane regions (Klein ef al.) 
count: 0 value: 0.16 threshold: 0.0 
PERIPHERAL Likelihood =0.16 
40 modified ALOM score: -0.53 

Rule: cytoplasmic protein 

*** Reasoning Step: 2 

45 Final Results 

bacterial cytoplasm Certainty= 0 .250 (Affirmative) < suco 

The protein has homology with the following sequences in the databases: 
50 (AF237934) putative integrase/recombinase 

[Pasteurella multocida] 
Length = 329 

55 Score = 449 (206.9 bits), Expect = 4.4e-91, Sum P(2) = 4.4e-91 

Identities = 937196 (47%), Positives * 129/196 (65%) 

Query: 56 IFADLIRRYLSEVTPSKRGAREESYRIGRAIjKTPLAKVRLADLRPQDF 115 
IF D+I RY +EV+ +K+GAR E R+ R L+ ++ + + DLR +DF +W RL EVS 
60 Sbjct: 55 IFRDVIERYQNEVSITKKGARNEI IRLNRFLRYDI SNLYIRDLRKEDFEEWIRIELTEVS 114 
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15 



60 



Query: 


116 


Sbjct: 


115 


Query: 


176 


Sbjct: 


175 


Query: 


236 


Sbjct: 


235 


Score 


= 248 



SV RBL T+S+V A+ +WG + +P+ I KFR S R R +EQ+I I 



++ P QRVA+A+LFAIETAMRAGEI +KW +V + +RI HLP TKNG SRDVPL 



10 S RA LI +++ +++ 



Identities « 48/76 (63%) , Positives = 57/76 (75%) 

Query: 254 VFSLDAKSLEVLFRRARDNCXSIQGIiHFHOT 313 

VF +SL FR + CG++ LHFHDTRREALTRLSKKV V LAKISGHRDLRIL 
Sbjct: 254 WQTTFESLSTTFRVLKKECXSLEHLHFHDTRREALTI^ SGHRDLRILQ 313 



20 Query: 314 NVYYRPDMADIAKMLD 329 

N YY P+M+++A +LD 
Sbjct: 314 NTYYAPNMSEVANLLD 329 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
25 useful antigens for vaccines or diagnostics. 

Example 92 

A DNA sequence was identified in N. gonorrhoeae <SEQ ID 183> which encodes amino acid sequence 

<SEQ ID 184; NGS92>, Analysis of this protein sequence reveals the following: 

Signal Score (-7.5): -7.85 
30 Possible cleavage site: 25 

»> Seems to have an uncleavable it-term signal seq 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
35 count: 1 value: -8.33 threshold: 0.0 

INTEGRAL Likelihood = -8.33 Transmembrane 6 - 22 ( 1 - 25) 
PERIPHERAL Likelihood =5.99 
modified ALOM score: 2.17 
Rule: cytoplasmic membrane protein 

40 

*** Reasoning Step: 2 

Final Results 

45 bacterial inner membrane Certainty* 0. 433 (Affirmative) < suco 

The protein has homology with the following sequences in the databases: 



>gi 

50 gi 



9632051 
7461623 
1620155 



ref |NP_048840.1| A484L [Paramecium bursaria Chlorella virus 1] 
pir||Tl7986 hypothetical protein A484L - Chlorella virus PBCV-1 
gb|AAC96851.1| (U42580) A484L [Paramecium bursaria Chlorella virus 1] 



Length = 155 



Score a 31.6 bits (70), Eapect = 3.5 
Identities = 20/72 (27%), Positives = 36/72 (49%) 
55 Frame = +1 



Query': 52 LQINLKMLEKRIDFLVENIDKYYQQYGSYPNNFDFK 231 

+ +NLKM I F +DKY +QY +Y N F T+ + + +4- + +1 N 
Sbjct: 23 IAVNLKMNGV S I PF VDKY SKQYPTYTKNALFHVTRFNNAYQKTFEYKNI S IDT INN 78 

Query: 232 CYFVKNDKDYTI 267 
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+ +++D y i 

Sbjct: 79 LFSIRDDVLYNI 90 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
5 useful antigens for vaccines or diagnostics. 

Example 93 

ADNA sequence was identified in N. gonorrhoeae <SEQ ID 185> which encodes amino acid sequence 

<SEQ ID 186; NGS93>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7.5): -5.08 
Possible cleavage site: 14 
»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 0.79 threshold: 0.0 
PERIPHERAL Likelihood =0.79 
modified ALOM score: -0.66 
Rule: cytoplasmic protein 

*** Reasoning Step: 2 

Final Results 

25 bacterial cytoplasm Certainty= 0.320 (Affirmative) < suco 



10 
15 
20 



30 



45 



The protein has homology with the following sequences in the databases: 

>gb|AAG22017.l|AF288038_2 (AF288038) putative HsdR [Streptococcus thermophilus] 
Length = 740 

Score ■ 674 bits (1738), Expect =0.0 

Identities « 364/746 (48%), Positives = 489/746 (64%), Gaps = 40/746 (5%) 



Query: 14 NENSRVKI PAVLHLMRLGYDYLSLKNANW DRQTNIFPEIFVDSLCRINPDLPPDDAR 70 

35 +E +RV+IPA HLMRLGY YL D +TNI IF + + N D 

Sbjct: 8 SELTRVQIPAAFHLMRLGYTYLPHNGKE 67 

Query: 71 RLLADIRLELDNEDLGQKFYERLTNQSGGKKLIDPQNTO 130 
R L +I++ELD DLG+ FY R+ + SG +D++N +NN+FH+ E+ C NG + FRP 
40 Sbjct: 68 RELNNIKIELDQNDLGRAFYNRTVSDSG- PTYVTJWENPEN^ 126 

Query: 131 DI ALLVNGMPLVF IEVKKPN NKGGIGEERERMGKRAKNPKFRRFINITQFMIFSNN 186 

DI + +NG+PL +IEVK+PN K I E+ R R +N +FRRF NITQ + FS+N 

Sbjct: 127 DIVTFINGLPLSYIEVKQPNAIRTCKTAIQSEQSRTAVl^FENRRFKRFNNITQLISFSDN 186 



Query: 187 MEYDDGATEPAQGAFYASSACGKPVFNYFREEHKXXXXXXXXXXXXXXH 246 

+ Y G + QG+FY S+A K FN F+EE + VL+D N 

Sbjct: 187 LPYISGQGQQKQGSFYCSNAFSKTKFNAFKEEREEELIY^ 246 



50 Query: 247 IKHSPEF I SNKS PDTPTNRILT SLLCRERL SFLLQHGLTYVK- - ASQGLVQ — KHIMRYP 302 

+K PEF +N+ P TP N ++SL ++RL FLL++GL YV+ + G +Q KH+MRYP 
Sbjct: 247 LKSQPEFKTNQDPSTPOJTFISSLYQKKRLLFLLRYGLAYVEEHSI^^ 306 

Query: 303 QLFATIjAIEKHLANGGKKGVIWHTQGSGKTALAYY 362 
55 Q FAT AIE + G +KGVIWHTQGSGKTAL+Y+N RYLT+Y+ +KQGIVP+FYF -f-VDRL 

Sbjct: 307 QFFATKAIEDAIGKGVRKGVT^raTQGSGKT^ 366 

Query: 363 DLLKQAQREFTARDLWHT IDSEEAFAADIKSAQTLHNHAGKAE ITWNIQKFQDDPDW 422 
DL QA REFT R L V I+S Q L+ ++ WNIQKF+Df D+ 

60 Sbjct: 367 DLADQATREFTKRGLKVKRINS — - PQELNEKHDAYQVAWNIQKFKDNSDLT 416 
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Query: 423 ARNDYDLAIQRVYI^DEVHRSYl^ GVTA- 476 

+ YDL Q +YF+DE HRSYN KGS+L NL +D NA+KI LTGTPLI G T 

Sbjct: 417 DHSGYDLNRQNIYFIDEAHRSYiraKGSYLPNLY^^ 476 

5 Query: 477 GNVNTOELFGDYIHKYYYNAS IAIXSYTLRLIREEIGSRYKAQLQEAIiAQLEIEKGSFDRK 536 

+ TR++FGDYIHKYYYN SI DG+TLRL+RB+I + YK LQ EI +G + 

Sbjct: 477 SHATTRDIFGDYIHKYYYNQSIDDGFTXiRLMREDIETSYKETIjQTI — NEEILRGDLSKD 534 

Query: 537 E IYAHPHFVHPMLDYILDDFAKFRKTN-QDE SLGAMWCDS AEQARQL FEHFQTASD 592 

10 +I+AHP +V FMLD+IL+DF + R D+S+G M+VCDS++QAR++ E ++ + 

Sbjct: 535 DIFAHPRYVS PMLDFILEDFNRARDWFDDDS IGGMIVCDS SKQAREIEKQIjEERRSRGE 594 

Query: 593 HNFTAALILHDVGTKEERDQWVl^ 652 
N T+ALILHD G KE + V+ ++ GKID++ VY+MLLTGFEAPRLK+LYLGR IKAH 
15 Sbjct: 595 TNITSALILHDEGDKEYKKDRVESYREGKIDLV^ 654 

Query: 653 mJ^TLTRVNRTYKSYRYGYVVDFADIEREFDKTNRAYWDEL^ -LGDEIGS-YS 706 

NLLQTLTRVNR YK Y++GYV +DFADI +EFDKTNRAY +EL+ E G+++ + + 

Sbjct: 655 NLLQTLTRVNRPYKDYQFGYVTDFADI SKEFDKTl^YLEEIiNQEYDPKOTGEDVENVFG 714 



20 



Query: 707 QLFKTAEE IEQEIADIKNALFDFDTE 732 

LF +A+EI +++ + L ++ TE 
Sbjct: 715 SLFVSADEI SKQLEKSET ILMNYPTE 740 



25 Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 



Example 94 



li 



A DNA sequence was identified in N.gonorrhoeae <SEQ ID 187> which encodes amino acid Sjequence 

<SEQ ID 188; NGS94>. Analysis of this protein sequence reveals the following: 

30 Signal Score (-7.5): -3.19 

Possible cleavage site: 35 
»> Seems to have no N-terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 
calculated from 1 
35 ALOM: Finding transmembrane regions (Klein et al.) 

count: 0 value: 5.73 threshold: 0.0 
PERIPHERAL Likelihood = 5.73 
modified ALOM score: -1.65 
Rule: cytoplasmic protein 

40 

*** Reasoning Step: 2 

Final Results 

45 bacterial cytoplasm Certainty= 0 .302 (Affirmative) < suco 

The protein has homology with the following sequences in the databases: * 

. >gi| 10717100 |gb|AAG22014.l|AF288037_3 (AF288037) putative HsdS [Streptococcus 
thermophilus] 
50 Length = 402 

Score = 154 bits (389), Expect = 2e-36 

Identities = 123/348 (35%), Positives = 168/348 (47%), Gaps = 48/348 (13%) 

55 Query: 73 GKT AFVDILDDGEVAFGSTEF IVLRAKNET — NPEFLYYFAI SPDFRKRAIECMEGT SGR 130 
GKT ++ DGE ++ IV +E+ +FLYYF + F G++ + 

Sbjct: 74 GKT VTFIX3EDSYFQDSNIWIENDESKVTNQFLYYFLQTNPFIT TNGSTIK 124 

Query: 131 QRVNENALKTLELP IPEPQIQQS IAAVLS ALDKKIALI^QINARIiEEMAKTLYDYWFVQF 190 
60 + N+N T +P Q Q I +L LDKKI +N QIN LE MAKTL YDYWFVQF 

Sbjct: 125 RLYNDNLRDTKI PNVPS IQQQNQITDILGTLDKKIQINNQINQELEAMAKTLYDYWFVQF 184 



WO 02/079243 



-116- 



PCI7IB02/02069 



Query: 191 DFPDANGKPYKSSGGDMVFDETLKREIPKGWGSIELQSCL AKIPNTTKXLNKDIKDF 247 

DFPD NGKPYKSSGG LKREIP+GWG+ +L S L + N K N++ K + 

Sbjct: 185 DFPDQNGKPYKSSGGKKnnTNPEL^ 244 

5 

Query: 248 GKYPWD QSQDFICXSFTNDEKSILNPQl^IIFGDHTRIVKLVNFQYA 295 

G Y + +S F + S LNP +1+ + F 
Sbjct: 245 SIPEFDTTGTYSLERGESIKSNKFKVEKNDLLVSKLNPWFN^ 301 

10 Query: 296 RGADGTQVILSNNERMPNYLFYQI INQIDLSSY GYARHFK FLKEFKIIL 344 

++ R YQ+ + Y G + K + F+I 

Sbjct: 302 rWKTFNRFEKOTLYQVATGKEF IEYCTRFATGTSNSHKRVS PDIMVGFQIPF 354 

Query: 345 PSKDISQKYNEIANTFFVKVRNNLKQNHHLTQIj^ 392 
15 * I QK+ EI ++ +V N +QN LTQLRD++LPMLMNGQV V 

Sbjct: 355 EKTHI- QKFGEIIDSIRTQVLQNNEQNQELTO 401 

Based on this analysis, it was predicted that this protein from gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

20 Example 95 

A DNA sequence was identified in Hi. gonorrhoeae <SEQ ID 189> which encodes amino acid sequence 

<SEQ ID 190; NGS95>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7.5): -6.12 
25 Possible cleavage site: 19 

»> Seems to have an uncleavable N-term signal seq 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
30 count: 3 value: -10.51 threshold: 0.0 

INTEGRAL Likelihood =-10.51 Transmembrane 112 - 128 ( 109 - 132) 
INTEGRAL Likelihood = -4.46 Transmembrane 50 - 66 ( 46 - 70) 
INTEGRAL Likelihood = -2.23 Transmembrane 7 - 23 ( 7 - 23) 
PERIPHERAL Likelihood = 4.19 
35 modified ALOM score: 2.60 

Rule: cytoplasmic membrane protein 

*** Reasoning Step: 2 

40 Final Results 

bacterial inner membrane Certainty* 0.520 (Affirmative) < suco 

The protein has homology with the following sequences in the databases: 

45 >pir||G69096 hypothetical protein MTH1717 - Methanobacterium thermoautotrophicum 

(strain Delta H) 

gb|AAB86189.l| (AE000928) unknown [Methanothermobacter thermautotrophicus] 
Length = 557 

50 Score =35.4 bits (80), Expect =0.50 

Identities = 25/80 (31%) , Positives = 47/80 (58%) , Gaps = 5/80 (6%) 

Query: 52 L LFYFL I PF I ATATVLWL SKYLGKDEFKQGEVKELEYVNDNFL S I PDNN 111 

L+F+F+ P + TATVL + K + +4- P++ EV L + +PS++ ++ IP++ 

55 Sbjct: 92 LVFFFI SPLLGTATVLVIYK-VARETFEREEVALLSAFLFSM7PSFVAR — TSVFIPESM 148 



60 



Query: 112 LFLLFVMYGI IFLLVSCSKS 131 

LL GI+++LV K+ 
Sbjct: 149 GLLL— TSGIL7MLVKYLKT 166 
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Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 96 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 191> which encodes amino acid sequence 

5 <SEQ ID 192; NGS96>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7.5): -7.76 
Possible cleavage site: 28 
>» Seems to have no N- terminal signal seq. 
10 Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 6.15 threshold: 0.0 
PERIPHERAL Likelihood =6.15 
15 modified ALOM score: -1.73 

Rule: cytoplasmic protein 

.*** Reasoning Step: 2 

20 Final Results 

bacterial cytoplasm Certainty= 0 .362 (Affirmative) < suco 

The protein has no homology with sequences in the databases. j 
25 Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 97 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 193> which encodes amino acid sequence 

<SEQ ID 194; NGS97>. Analysis of this protein sequence reveals the following: 

30 GvH: Examining signal sequence (von Heijne) 

Signal Score (-7.5): -0.48 
Possible cleavage site: 13 
»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 
35 calculated from 1 

ALOM; Finding transmembrane regions (Klein et al . ) 
count: 0 value: 8.86 threshold: 0.0 
PERIPHERAL Likelihood =8.86 
modified ALOM score: -2.27 
40 Rule: cytoplasmic protein 

*** Reasoning Step: 2 

Final Results 

45 

bacterial cytoplasm Certainty* 0.127 (Affirmative) < suco 

The protein has homology with the following sequences in the databases: 

>ref |NP_052265.1| P2 J homolog; baseplate or base of tail fibre [Enterobacteria 
50 phage 

186] 

gb|AAC34162.1| (U32222) P2 J homolog; baseplate or base of tail fibre 
(Enterobacteria phage 186] 
Length * 302 
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Score » 112 bits (280), Expect = 3e-24 

Identities = 65/151 (43%) , Positives = 85/151 (56ft), Gaps * 1/151 (0%) 

5 Query: 1 MGNSRLSQLPAPAAIEETDFEGIFARKKAALTALCPESIREWAQTIiEl>ESEPLTIDLQQ 60 

M LS LP P +EE DFE I A + A L +L PB +B VA+TL LESBP+ LQ+ 
Sbjct: 1 MATVDLSLLFVPDWEELDFETILAER^ 60 

Query: 61 QAYQELLVRNRINEAVKANLLAYAQGSDLDHIAAQYGLSMCT 120 
10 AY+E++ R R+NEA +A +LAYA+ SDLD++ A + + R +R EE 

Sbjct: 61 NAYREVTWRQRVNEAARAGMLAYARDSDI^ 120 

Query: 121 DDAFRARV-QAHPEKYAAGPRTAYEAHAIDA 150 
D FRR+QA AG AYE H A 

15 Sbjct: 121 DADFRLRIQQAFEGMSVAGSTGAYEFHGRSA 151 

Based on this analysis, it was predicted that this protein from N gonorrhoeae y and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 98 

20 A DNA sequence was identified in N.gonorrlioeae <SEQ ID 195> which encodes amino acid sequence 

<SEQ ID 196; NGS98>. Analysis of this protein sequence reveals the following: 

Signal Score (-7,5): -3,68 

Possible cleavage site: 33 
»> Seems to have no N- terminal signal seq. 
25 Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 4.61 threshold: 0.0 
PERIPHERAL Likelihood = 4.61 
30 modified ALOM score: -1.42 

Rule: cytoplasmic protein 

*** Reasoning Step: 2 

35 Final Results 

bacterial cytoplasm Certainty* 0.182 (Affirmative) < suco 

The protein has no homology with sequences in the databases. 

40 Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 99 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 197> which encodes amino acid sequence 

<SEQ ID 198; NGS99>. Analysis of this protein sequence reveals the following: 

45 Signal Score (-7.5): -4.87 

Possible cleavage site: 19 
>» Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 
calculated from 1 
50 ALOM: Finding transmembrane regions (Klein et al.) 

count: 0 value: 4.93 threshold: 0.0 
PERIPHERAL Likelihood = 4.93 
modified ALOM score: -1.49 
Rule: cytoplasmic protein 

55 
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*** Reasoning Step: 2 

„ Final Results 

5 bacterial cytoplasm Certainty* 0.189 (Affirmative) < suco 

The protein has homology with the f ottowing sequences in the databases: 

>gi| 10172952 |dbj|BAB04Q58.l| (AP001508) BH0339~unknown conserved protein in 
others [Bacillus 
10 halodurans] 
Length = 283 



15 



Score = 83.7 bits (206), Expect = le-15 

Identities = 59/156 (37%), Positives = 87/156 (54%), Gaps = 8/156 (5%) 

Query: 10 VRGFVQLAFAQSIDPWPPEVSITRMAVTNEKD^ 69 

VRGPV + A SIDPI IT+ + D TMG K+ V + VY G 1+ 

Sbjct: 129 VRGPVS IHTATS IDPIDIVSTQITKSVN SVTGDKRS SOTMGMKHRVDPGVYVFKGS INTQ 188 

20 Query: 70 LAAKTGFSDDDLAKLWQALTLMFEHDRSAAR — GEMAARKLWFKHDSALGSQPAHKLFD 127 

LA KTGF+++D K+ +AL +FE+D S+AR G M K+ ++H S LG + K+ 
Sbjct: 189 LAEKTGFTNEDAEKIKRALITLFENDSSSARPDGSME 248 

Query: 128 AVKVERVNGESGTPASGFGDYKI SWSDGLNGVSVE 163 
25 4-+K+E H TP S F BY + + IH-G+ VE 

Sbjct: 249 SLKIE SKTDTPKS -FDDYAVELYE — LDGLGVB 278 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

30 Example 100 

A DNA sequence was identified in Kgonorrhoeae <SEQ ID 199> which encodes amino acid sequence 

<SEQ ID 200; NGS100>. Analysis of this protein sequence reveals the following: 

Signal Score (-7.5): -3.03 

Possible cleavage site: 18 
35 >» Seems to have no N- terminal signal seq. 

Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 6.63 threshold: 0.0 
40 PERIPHERAL Likelihood = 6.63 

modified ALOM score: -1.83 
Rule: cytoplasmic protein 



45 



*** Reasoning Step: 2 

Final Results 

bacterial cytoplasm Certainty* 0 .185 (Affirmative) < suco 



50 The protein has homology with the following sequences in the databases: 

>gi| 1175791 |sp|P44189|YEl8_HAEIN HYPOTHETICAL PROTEIN HI1418 
gi | 1074769 (pir | |A64029 hypothetical protein HI1418 - Haemophilus influenzae 
(strain Rd 

RW20) 

55 gi|1574254|gb|AAC23068.l| (U32821) H. influenzae predicted coding region HI1418 

[Haemophilus 

influenzae Rd] 
Length = 201 

60 Score = 144 bits (364), Expect = le-33 
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Identities = 71/109 (65%) , Positives = 79/109 (72%) 

Query: 8 NFQQNSVRWADNKGELWFLANDVCEI 67 

NF+ VR + D KGE WF DVC ILGYTN R+ + HOC GVTKRYTPT S +QEM 
Sbjct: 24 NFKDLFVRVILDPKGEFWFCGTW &3 

Query: 68 TYINEPI^YRLXIKSRKPAAEAFEEWVMETvXPAIR 116 

T+ INEPNLYRLI IKSRKP AE FE WV E VLP IRKTG Q+ P+ A 
Sbjct: 84 TF INEPNLYRLI IKSRKPEAEPFEAWFEEVLPQIRKTGKYQLQPQQIiA 132 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 101 

A DNA sequence was identified in ^gonorrhoeae <SEQ ID 201> which encodes amino acid sequence 
<SEQ ID 202; NGS101>. Analysis of this protein sequence reveals the following: 

Signal Score (-7.5): -1.23 

Possible cleavage site: 47 
»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 

count: 0 value: 3.55 threshold: 0.0 

PERIPHERAL Likelihood = 3.55 
modified ALOM score: -1.21 
Rule: cytoplasmic protein 

*** Reasoning Step: 2 

Final Results 

bacterial cytoplasm Certainty* 0 .126 (Affirmative) < suco 

The protein has homology with the following sequences in the databases: 

>gi 9632520 ref |NP_049514.1 1 hypothetical protein [Bacteriophage 933W] 
gi 9633449 ref NP_050552.1| hypothetical protein [Bacteriophage VT2-Sa] 
gi 4585431 gb|AAD25459 . l|AF125520_54 (AF125520) hypothetical protein 

[Bacteriophage 933W] m , , _ 

gi | 5881645 |dbj|BAA84336.l| (AP000363) hypothetical protein [Bacteriophage VT2- 

Sa] 

gi|7649882|dbj|BAA94160.l| (AP000422) hypothetical protein [Escherichia coli 
0157 :H7] 

Length = 404 
Score = 177 bits (449), Expect = 3e-43 

Identities = 130/425 (30%) , Positives = 204/425 (47%) , Gaps = 27/425 (6%) 

Query: 7 TAYGDPQAMMKQAAGLFAMHMQRNSTLNRIAGKMPAGT 65 

T QA LF +S+NL + AA + KQT+ PWR 

Sbjct: 2 TTVT SAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVS PDKKSTKQTSAGAPWRIT 61 

Query: 66 DLTRGMGDE I RFNLVNP VSAL P IMGDNTAEGRGVGMSL SEAGLRVNQARF PVDGGGTtfTN 125 

DL + GDE+ F++++ +S P MGD EGRG +S ++ L++NQ R VD GG M+ 
Sbjct: 62 DLNKQAGDEVTFS IMHKLSKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQ 121 



Query: 126 QRS PADYRAL IRPAAQSLMDRYADQTLLVHMAGARGFHDNIEWGTO 185 

QR+ + + R + + DQ +VH+AGARG + 4-P A P+F +N 

Sbjct: 122 QRTKFNLASSARTIJ^TYFNDI^DQCAIVHLAGARG 181 

Query: 186 VKAPSKNRHFTASGDAVTGVGDNGGELKI^ 245 

V P+ +RHF GD +1 + D+F++ VD++ +D++ P V+ 

Sbjct: 182 VLPPTHDRHFFG GDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLS 233 
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Query: 246 GDKAAGDS PLRVWLLSPAQYNRF AADPKFRQLQASAIARASQANQNPLFLGDAGLVW 302 

• GD+ G+ P V ++P Q+N + + Q+ A+ RA N +PLF G+ +W 

Sbjct: 234 GDELHGEDPYYVLYWPRQWNDWYTSTSGKDWNQMMVRAVN^ 292 

5 

Query: 303 GFIXjVKMP-RPIRFYMDEMKYCADKFSEM 361 

++ K PIRFY G ++ + + A +DR+++LG QA+ A 

Sbjct: 293 NILVRKYAGMPIRFYQGSKVLVSENNLTATTK EVAAATNIDRAMLLGAQALANA 346 

10 Query: 362 FANTGKHGGMPFFWSEKELDHGNKVETLVGTIRGV^ ITDYGVTWD 421 

+ G+ G F EK+ DNRB + IG+KRF G ++ D+GV VD 
Sbjct: 347 Y GQKAGGHFNMVEKKTDMDNRTEIAI SWINGLKKIRFPEKSG KMQDHGVIAVD 399 

Query: 422 TWPL 426 
15 T V L 

Sbjct: 400 TAVKL 404 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

20 Example 102 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 203> which encodes amino acid sequence 

<SEQ ID 204; NGS 102>. Analysis of this protein sequence reveals the following: 

Signal Score (-7,5): -6.09 

Possible cleavage site: 15 j 
25 >» Seems to have no N- terminal signal seq. I 

Amino Acid Composition of Predicted Mature Form: 

calculated from 1 ! 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 2.92 threshold: 0.0 
30 PERIPHERAL Likelihood = 2.92 

modified ALOM score: -1.08 
Rule: cytoplasmic protein 



35 



*** Reasoning Step: 2 

Final Results 

bacterial cytoplasm Certainty^ 0 .480 (Affirmative) < succ 



40 The protein has no homology with sequences in the databases. 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 103 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 205> which encodes amino acid sequence 
45 <SEQ ID 206; NGS 103>. Analysis of this protein sequence reveals the following: 

Signal Score (-7.5): -1.29 

Possible cleavage site: 34 
»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 
50 calculated from 1 

ALOM: Finding transmembrane regions (Klein et al.) 

count: 1 value: -0.00 threshold: 0.0 

INTEGRAL Likelihood = -0.00 Transmembrane 22 - 38 { 22 - 38) 
PERIPHERAL Likelihood =4.88 
55 modified ALOM score: 0.50 
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Rule: cytoplasmic membrane protein 
*** Reasoning Step: 2 

5 Final Results 

bacterial inner membrane Certainty** 0*100 (Affirmative) < suco 

The protein has homology with the following sequences in the databases: 

10 >gi 1 11277848 |pir | |E81145 replicative DNA helicase NMB0885 [imported] - Neisseria 

meningitidis (group B strain MD58) 
gi|7226124|gb|AAF41296.1| (AE002441) replicative DNA helicase [Neisseria 
meningitidis MC58] 

Length =468 



15 



30 



45 



65 



Score = 233 bits (594), Expect = 5e-60 

Identities = 158/456 (34%), Positives = 245/456 (53%), Gaps = 34/456 (7%) 



SVGAEQNILGGILIEPTAIARCA-ILTPEKFYQAQHRIIFRALIjDMAAANEPIDIITIiN^ 73 
20 S+ AEQ++LGG+++E A R A +++ E FY+ +HR+IFR++ + + P D+IT+ + 

SMEAEQSVLGGLI^ENPAWDRIACWSGEDFYRHEH^ IFRS IAKLINESRPADVTTVQE 82 

KLEARGEAENAGGLAYL IDLNQNTPSAKNI SRYVGIVNDRFVERGLLKASAAIEKIAVSK 133 
L+ EE AGG YLI L QNTPSA NIRY IV +R + R L + I + A + 
25 Sbjct: 83 DLQRNEELEAAGGFEYL ITLAQNTPSAANIRRYAEIVRERSIMRQIiAEVGTEIARSAYNP 142 

STVAEKLSKAADELAAVGKDAVKRETK^ FG 185 

+ I> +A +++ + + K +K + DL+ + +R+D + 6 



Query: 


15 


Sbjct: 


23 


Query: 


74 


Sbjct: 


83 


Query: 


134 


Sbjct: 


143 


Query: 


186 


Sbjct: 


201 


Query: 


245 


Sbjct: 


261 


Query: 


295 


Sbjct: 


321 


Query: 


353 


Sbjct: 


374 


Query: 


411 


Sbjct: 


433 



+PTG + LD T GL G+LI++A RPSMGKT + NIA +G+ V S EM 



35 Query: 245 ELARRGMAAECNI PMQNLKTGNLTQSDYANM PIYVSQAKEWKFDVNCDLL 294 

+L R + + + LKTG L + + P+Y+ + 



40 F K L L+V+D+L +M +GR + +ELG ISR LK LA EL P+ 

^ARQFNNK LGLIVTDYLQLMAGSGRSDNRASELGElSRSLKAIiAKELQVPI 373 



VLVAQLNRGNTKQADKRPNMADIRGSGAIEQDANI I IMPHRESYYDGNENP — SIAELXI 410 
+ ++QL+R + DKRP M+D+R SGAIEQDA++I+ +R+ YY+ ++P +AB II 



K+R+G -K3 + W GQF KF+ + +A 



50 

>gi 1 11277846 |pir | |E81876 probable replicative DNA helicase (EC 3.6.1.-) NMA1105 
[imported] - 

Neisseria meningitidis (group A strain Z2491) 
gi|7379799|emb|CAB84367.1| (AL162755) putative replicative DNA helicase 
55 [Neisseria 

meningitidis Z24913 
Length * 468 

Score = 230 bits (588), Expect = 2e-59 
60 Identities = 158/456 (34%), Positives = 244/456 (52%), Gaps = 34/456 (7%) 

Query: 15 SVGAEQNILGGILIEPTAIARCA- ILTPEKFYQAQHRI IFRALLDMAAANEPIDI ITLND 73 

S+ AEQ++LGG+++E A R A +++ E FY+ +HR+IFR++ + + P D+IT+ + 
Sbjct: 23 SMEAEQSVLGGLMLENPAVTORIADVVSGEDFYRHEHRLIFR 82 



Query: 74 KLEARGEAENAGGLAYL IDLNQNTPSAKNI SRYVGrVNDRFVERGLLKASAAIEKIAVSK 133 
L+ EE AGG YLI L QNTPSA NIRY IV+R + RL+ I + A + 
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Sbjct: 83 DLQRNEELEAAGGFEYLITIAQOTPSAANIRRyAElVRBRS IMRQLAEVGTEIARSAYNP 142 

Query- 134 IXSGTVAEKLSKAADELAAVGKIIAVKRETK^ FG 185 

G + L +A +++ + + K +K + DL+ + +R+D + G 

Sbjct: 143 QGRDAGQLLDEAENKVFQIAESTAK — SKQGFLEMPDLLKEVVQRIDMLYSRDNPDEVTG 200 

Query: 186 LPTGLMKLIX*MTGGLPIX3NLIVIAARPSMGKTv^ 244 

+ TG + LD T GL G+LI++A RPSMGKT + NIA +GK V S EM 

Sbjct: 201 VSTGFIDLDKKTSGLQPGDLI IVAGRPS^^TAFSINIAEHVAVEGKLPVAVFSMBMGGA 260 

Query- 245 ELARRGMAAECNI PMQNLKTGNLTQSDYANM PIYVSQAKEWKFDVNCDLL 294 

+L R + + + LKTG L + + + 
Sbjct: 261 QLVMRMLGSVGRIiDQ SVLKTGRLEDEHWGRLNEAVVKLSDAFvYIDET PGLTALELRARA 320 

Query- 295 NVDELCFLAKEKKLTTGIiDLLVVDHLH — - VAEIjGNI SRRLKNLAAELNTPV 352 

y K I* L+V+D+L +M +GR + +ELG ISR LK LA EL P+ 

Sbjct: 321 RRLARQFNNK LGLIVIDYLQLMAGSGRSDNRASELGEI SRSLKALAKELQVPI 373 

Query 353 VLVAQLNRGNTKQADKRPNMADIRGSGAIEQDANI I IMPHRESYYDGNENP SIAELI I 410 

+ ++QL+R + DKRP M+D+R SGAIEQDA++I+ +R+ YY+ ++P +AB II 
Sbjct: 374 ^SQLSROVESRTDKRPMMSDLRESGAIEQDAD^ 432 

Query: 411 AKNRIX3EMGTWCGWKGQFMKFEEEPDLAWQAPKHD 446 

K+R+G +G + W GQF KF+ + +A D 
Sbjct: 433 GKHRNGPVGKIFLTWTGQFTKFDNAAYIPEEAKIED 468 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. j 

! 

Example 1M \ 

A DNA sequence was identified in N. gonorrhoeae <SEQ ID 207> which encodes amino acid sequence 

<SEQ ID 208; NGS104>. Analysis of this protein sequence reveals the following: 

Signal Score (-7.5): -2.11 

Possible cleavage site: 15 
»> Seems to nave no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 

count: 0 value: 5.04 threshold: 0.0 

PERIPHERAL Likelihood =5.04 
modified ALOM score: -1.51 
Rule: cytoplasmic protein 

*** Reasoning Step: 2 

Final Results — 

bacterial cytoplasm Certainty= 0.220 (Affirmative) < succ 

The protein has homology with the following sequences in the databases: 

>gi 1 7515458 Ipir I IT13296 hypothetical protein 8 - Streptococcus phage phi-O1205 

gi|2444088|gb|AAC79524.l| (U88974) ORF8 [Streptococcus thermophilus temperate 
bacteriophage 

012053 
Length =157 

Score = 62.1 bits (150), Escpect = 3e-09 

Identities = 53/161 .(32%), Positives = 86/161 (52%), Gaps = 8/161 (4%) 

Query: 5 TLYRCAADVQAGLDYYFDSETEREDTLEAV- - IGQFEVKAQSVIAYIKNQEITEKMLEGH 62 

^ TLY + + D ET + DTLEA+ +E K + + IK+ E + + 

Sbjct: 3 TLYELTDQLLEIYNMEWDDET-KL^ 61 
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Query: 63 IRQMTGKLKAAKARNQSLKDYLARNMQAAGITBIKA^ 122 

+++ G K+ +++ LK LA +M GT+D FK FRKSEAW+ +E ++P 
Sbjct: 62 KKIUjIX5LNKSDQSKIDKLKTAIjAVSMAETGQTRV- -DTTI*FKVGFRKSEAVW-NEEKLP 118 

5 

Query: 123 AEFMREAVKTEPDKTAIRKAIESGRQVAGAKIEGilKNIiQIR 163 

E+ K PDK +++ ++SG+ + GA +E R+NL IR 

Sbjct: 119 KEYQIATYK- - PDKKTLKELLKSGKHIEGATIiEERRNLNIR 157 

10 Based on this analysis, it was predicted that this protein from N.gonorr1weae 1 and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 105 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 209> which encodes amino acid sequence 

<SEQ ID 210; NGS 105>. Analysis of this protein sequence reveals the following: 

Signal Score (-7.5): -5.52 

Possible cleavage site: 31 
»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form; 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 2.60 threshold: 0.0 
PERIPHERAL Likelihood = 2.60 
modified ALOM score: -1.02 
Rule: cytoplasmic protein 

*** Reasoning Step: 2 

Final Results 

30 bacterial cytoplasm Certainty^ 0.135 (Affirmative) < suco 

The protein has no homology with sequences in the databases. 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

35 Example 106 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 21 1> which encodes amino acid sequence 

<SEQ ID 212; NGS106>. Analysis of this protein sequence reveals the following: 

Signal Score (-7.5): 4.8 

Possible cleavage site: 26 
40 >» Seems to have a cleavable N-term signal seq. 

Amino Acid Composition of Predicted Mature Form: 

calculated from 27 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 7.80 threshold: 0.0 
45 PERIPHERAL Likelihood = 7.80 

modified ALOM score:" -2.06 
Score for OM-PP discrimination: 4.38 
Rule: outer membrane or periplasmic protein 
Score for OM-PP discrimination: 4.38 
50 Rule: outer membrane or periplasmic protein 

*** Reasoning Step: 2 



15 
20 
25 



Outer membrane? Score: 0.437687 
55 Outer membrane? Score: 0.437687 
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Final Results 

bacterial outer membrane Certainty^ 0.76 8 (Affirmative) < suco 

5 

The protein has no homology with sequences in the databases, although it is similar to HMW1 fiom 
Haemophilus influenzae. 

The protein was expressed in Rcoli as an insoluble 43.56kDa His-fusion product and then purified. 

Based on this analysis, it was predicted that this protein from ^gonorrhoeae* and its epitopes, could be 
10 useful antigens for vaccines or diagnostics. 

Example 107 

A DNA sequence was identified in ft. gonorrhoeae <SEQ ID 213> which encodes amino acid sequence 

<SEQ ID 214; NGS107>. Analysis of this protein sequence reveals the following: 

Signal Score ( -7 . 5 ) : -3 . 83 
15 Possible cleavage site: 51 

»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
20 count: 0 value: 4.61 threshold: 0.0 

PERIPHERAL Likelihood =4.61 j: 
modified ALOM score: -1.42 
Rule: cytoplasmic protein jj 

25 "■*** Reasoning Step: 2 

Final Results 

bacterial cytoplasm Certainty* 0.146 (Affirmative) < succ> 

30 

The protein has no homology with sequences in the databases. 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 108 

35 A DNA sequence was identified in N.gonorrlweae <SEQ ID 215> which encodes amino acid sequence 

<SEQ ID 216; NGS108>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7.5): -6.14 
Possible cleavage site: 19 
40 »> Seems to have no N- terminal signal seq. 

Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 8.43 threshold: 0.0 
45 PERIPHERAL Likelihood =8.43 

modified ALOM score: -2.19 
Rule: cytoplasmic protein 

*** Reasoning Step: 2 

50 

Final Results 
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bacterial cytoplasm Certainty* 0.574 (Affirmative) < suco 

bacterial periplasmic space Certainty* 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

bacterial inner membrane Certainty* 0.000 (Not Clear) < suco 

5 

The protein has homology with the following sequences in the databases: 

>pir||G81977 probable lipoprotein NMA0586 [imported] - Neisseria meningitidis 
(group A strain Z2491) 
emb|CAB83877.l| (AL162753) putative lipoprotein [Neisseria meningitidis Z2491] 
10 Length = 280 

Score * 52.9 bits (126), Expect - 5e-06 

Identities = 43/134 (32%), Positives = 63/134 (46%), Gaps = 23/134 (17%) 

15 Query: 174 LGDIRGVATDEDKLPKAGSFQYEGRAFGGNGV^ I 233 

+GDI G T DKLP+ G Y G AFG D+ +G YTIDF ++G G I 

Sbjct: 156 IGDIAGEHTSFDKLPEGGRATYRGTAFGS DDASGKLTYTIDFAAKQGHGKI 206 

Query: 234 EGMEQYGKIKLEEAAIERIPYRESGSSIjGLKDRVSYFGfVNEGVAMIjEKD 293 
20 E ++ ++ ++ AA + P ++ + + ++L B Y LGIF 

Sbjct: 207 EHLKS-PELNVDIiAASDIKPDKKRHAVT SGSVLYNQAEKGSYSLGIF 252 

Query: 294 GEAANEVAGAVSQE 307 
G A SVAG+ E 
25 Sbjct: 253 GGQAQEVAGSAEVE 266 

>pir||D81032 hypothetical protein NMB1870 [imported] - Neisseria meningitidis 
(group B strain MD58) 
gb|AAF42204.l| (AE002537) hypothetical protein [Neisseria meningitidis MC58] 
30 Length = 320 

Score =50.6 bits (120), Expect = 3e-05 

Identities = 50/168 (29%), Positives = 76/168 (44%), Gaps = 28/168 (16%) 

35 Query: 136 VYEQFYSVVRGYFGYSRKIXSNPIEGIX^NPEEIPFDLYI^ 195 
VY+Q +S + + +D E G+ + F +GDI G T DKLP+ G Y 
Sbjct: 163 VYKQSHSALTAFQTEQIQDS EHSGKMVAKRQFR — ■ IGDIAGEHTSFDKLPEGGRATY 217 

Query: 196 EGRAFGGNGVLSKESLDI^GWRYTIDFDRRKGSGSIEGMQYGKIKLEEAAIERI^ 255 
40 G AFG D+ G YTIDF ++G+G IE ++ ++ ++ AA + P 

Sbjct: 218 RGTAFGS DDAGGKLTYTIDFAAKQGNGKIEHLKS-PELNVDLAAADIKPDG 267 

Query: 256 ESGSSLGLKDRVSYFGVNEGVAMLEKDNEIKKYHLGIFGEA 303 
+ + + ++L E Y LGIFG A EVAG+ 

45 Sbjct: 268 KRHAVI SGSVLYNQAEKGSYSLGIFGGKAQEVAGS 302 

Based on this analysis, it was predicted that this protein from ^gonorrhoeae* and its epitopes, could be 
useful antigens for vaccines or diagnostics. 



Example 109 

50 A DNA sequence was identified in N. gonorrhoeae <SEQ ID 217> which encodes amino acid sequence 

<SEQID 218; NGS109>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7,5): -5.39 
Possible cleavage site: 25 
55 >» Seems to have no N- terminal signal seq. 

Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 7.00 threshold: 0.0 
60 PERIPHERAL Likelihood = 7.00 

modified ALOM score: -1.90 
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Rule: cytoplasmic protein 

*** Reasoning Step: 2 

5 — . — Final Results 

bacterial cytoplasm Certainty* 0.353 (Affirmative) < suco 

bacterial periplasmic space Certainty* 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty= 0.000 (Not Clear) < suco 

10 bacterial inner membrane Certainty* 0.000 (Not Clear) < suco 

The protein has homology with the following sequences in the databases: 

>pir| |A82012 hypothetical protein NMA0179 [imported] - Neisseria meningitidis 
(group A strain Z2491) 

15 emb|CAB83494.l| (AL162752) hypothetical protein NMA0 17 9 [Neisseria meningitidis 

Z2491] 
Length = 97 

Score = 183 bits (464), Expect = le-45 
20 Identities » 92/97 (94%) , Positives = 95/97 (97%) 



25 



Query: 44 MKANDKLNRQ IDVLQKQ SAAIHNEAYIEMNTLLYIUIREWS IHNRKADYAEKGKERIAI^ 103 

MK NDKLNRQIDVLQKQ SAAIHNEAYIEMNTLLYRHREWS +HNRKADYAEKGKE+IALF 
Sbjct: 1 MKTNDKLNRQIimjQKQSAAIHNEAYIEMNTLLYIU^ 60 

Query: 104 PRGLNGITKLPAAVLLPERPYHFDMKEVIjYIFSRIPR 140 

PRGIiNGITKLPAAVLLPERPYHFDMKEVL+IFS IPR 
Sbjct: 61 PRGLNGITKLPAAVLLPERPYHFDMKEVLHIFSWIPR 97 



30 As a homolog was found in serogroup A Kmeningitidis but not in serogroup B, NGS109 protein and 
nucleic acid are useful for distinguishing between gonococcus and serogroup B N.meningitidis. 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 11© 

35 A DNA sequence was identified in N.gonorrhoeae <SEQ ID 219> which encodes amino acid sequence 

<SEQ ID 220; NGS 1 10>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7.5): -2.76 
Possible cleavage site: 41 
40 »> Seems to have no N- terminal signal seg. 

Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALGM: Finding transmembrane regions (Klein et al . ) 
count: 1 value: -0.00 threshold: 0.0 
45 INTEGRAL Likelihood = -0.00 Transmembrane 88 - 104 ( 88 - 104) 

PERIPHERAL Likelihood = 7.69 
modified ALOM score: 0.50 
Rule: cytoplasmic membrane protein 

50 *** Reasoning Step: 2 

Final Results 

bacterial inner membrane Certainty= 0.100 (Affirmative) < suco 

55 bacterial periplasmic space Certainty= 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

bacterial cytoplasm Certainty= 0.000 (Not Clear) < suco 
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The protein has no homology with sequences in die databases. 

Based on this analysis, it was predicted that this protein from N.gonorrlioeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 111 

5 A DNA sequence was identified in N. gonorrhoeae <SEQ ID 22 1> which encodes amino acid sequence 

<SEQ ID 222; NGS111>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence {von Heijne) 
Signal Score (-7.5): -5.89 
Possible cleavage site: 21 
10 >» Seems to have no N-terminal signal seq. 

Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
AIiOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 2.44 threshold: 0.0 
15 PERIPHERAL Likelihood =2.44 

modified ALOM score: -0.99 
Rule: cytoplasmic protein 



20 



50 



*** Reasoning Step: 2 
Final Results — 



bacterial cytoplasm Certainty= 0.293 (Affirmative) < suco 

bacterial periplasmic space Certainty= 0.000 (Not Clear) < suco 

25 bacterial outer membrane Certainty= 0.000 (Not Clear) < suco 

bacterial inner membrane Certainty* 0.000 (Not Clear) < suco 

The protein has homology with the following sequences in the databases: 

>gb|AAC45840.1| (AF001598) restriction endonuclease [Neisseria gonorrhoeae] 
30 Length = 374 

Score = 539 bits (1390), Expect = e-152 

Identities = 285/285 (100%), Positives « 285/285 (100%) 

35 Query: 1 MGFIEPFLSSYTPLSRITn/QARTNRKRQTLLSKIVYTHSGFQRSVTENSNIRQINFLIOT 60 

MGFIEPFLSSYTPLSRDWQARTNRKRQTLLSKIVYTHSGFQRSVTENSNIRQINFLIKT 
Sbjct: 90 MGFIEPFLSSTCPLSRDYVQARTNRKRQTLLSKIV^ 149 

Query: 61 LVEHPQGKLNKKEIAAMMLVDLOT 120 
40 LVEH PQGKLNKKE IAAMMLVDLKTFQQDYLTETELNDYFQQG IESGF IERKYNQI SYLWN 

Sbjct: 150 LVEHPQGKLNKKE IAAMMLVDLKTFQQDYLTETELNDYFQQGIESGFIERKyNQISYLWN 209 

Query: 121 LLDKLDDLKRVGDDLYFAEDAQRIFGNLDE I TVRKRDPYLHRLYKNQLQEESEEHYGNVK 180 
LLDKLDDLKRVGDDLYFAEDAQRIFGNLDEITV 
45 Sbjct: 210 LLDKLDDBKRVGDDLYFAEDAQRIFGNLDEITVRK^ 269 

Query: 181 CMLEKLAYFVXIASHIKPFILSDDTEAYD^ 240 

CMLEKLAYFVliIASHIKPFILSDOTEAYDPNNG 
Sbjct: 270 CMLEKLAYPVLIASHIKPFILSDOTEAYDPNNGLLLSRTLDSLFDIjKYISro 329 



Query: 241 KRLSDDVWRRWCDVKLDNNLLNDKRKSYIA 285 

KRLSDDVWRRWCEWKLDNNLLND^ 
Sbjct: 330 KRLSDDVWRRWCDVKLDNNIiLNDKRKSYLAYHRE^ 374 



55 Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 
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ExampEe 112 

A DNA sequence was identified in N. gonorrhoeae <SEQ ID 223> which encodes anrino acid sequence 

<SEQ ID 224; NGS 1 12>. Analysis of this protein sequence reveals the following: 

GvH Examining signal sequence (von Heijne) 
5 Signal Score (-7.5): -9.08 

Possible cleavage site: 54 
>» seems to have no N- terminal signal seg. 
Amino Acid Composition of Predicted Mature Form: 
calculated from 1 
10 ALOM: Finding transmembrane regions (Klein et al.) 

count: 1 value: -1,22 threshold: 0,0 

INTEGRAL Likelihood = -1.22 Transmembrane 160 - 176 ( 160 - 177) 
PERIPHERAL Likelihood =0.58 
modified ALOM score: 0.74 
15 Rule: cytoplasmic membrane protein 



20 



25 



50 



*** Reasoning Step: 2 
— ; Final Results — 



bacterial inner membrane Certainty= 0 .149 (Affirmative) < suco 

bacterial periplasmic space Certainty^ 0.000 (Not Clear) < suco 

bacterial outer membrane Certain ty= 0.000 (Not Clear) < suco 

bacterial cytoplasm Certainty* 0.0 00 (Not Clear) < suco 

The protein has homology to the following sequences in the databases: 



a **gbp_12644572 gi 1 12644572 | sp |Q5 0973 |T2BlJNEIGO TYPE II RESTRICTION 

ENZYME NG0BI (ENDONUCLEASE NGOBI) (R.NGOBI) (R.NGOI) lj 
gb|AAB03207.2 1 (U42459) Ngol restriction endonuclease R.Ngol [N. gonorrhoeae] 
30 Length = 350 

Score = 694 bits (1791), Expect = 0.0 

Identities - 349/350 (99%), Positives = 349/350 (99%) 

35 Query: 1 MTLEEQQAKEALDG I IKKSRVHLYKP I QI AE ILYHDRC IKQLDFLNLDTYRNQ SKRWRDE 60 

MTLEEQQAKEAIJXSIIKKSRVHLYKPIQIAEILYHDRCIKQLDFLN^ 
Sbjct: 1 MTLEEQQAKEALDG I IKKSRVHLYKPIQ IAEILYHDRC IKQLDFLNLDTYRNQSKRWRDE 60 

Query: 61 ICRRFLGRISTSSAKFQDNLFEKNAIPPEKLAVIX3TLNRQSDGGVESYI 120 
40 ' ICRRFLGRI STS SAKFQDNLFEKNAI P PEKLAVLGTLNRQ SDGGVE SYIYKQFFNRFSQM 

Sbjct: 61 ICRRFLGRISTSSAKFQDNLFEKNAIPPEKIAVLGTLNRQSDGGVEOT 120 

Query: 121 SEALAYVGNTDRYSFQLSEFLNLFWLEPGLKRSIDKIYEIVVYALFD 180 
SE LAYVGOTDRYSFQLSEFLNLFV^PGLKRSIDKIYEIVVYALFDALVSEI^ITVSID 
45 Sbjct: 121 SERIJVWGOTDRYSFQLSEFltfu^FWLEP^ 180 

Query: 181 FPKENLFLV^EYQDFAEKI ITMPKNEHLKLPAKIHRVGVT^ 240 

FPKENLFLWEEYQDFAEKI I TMPKNEHLKL PAKIHRVGVTNAADRGLDMWSNFGLAIQVK 
Sbjct: 181 FPKENLFLWEEYQDFAEKIITMPKNEHLKLPAKIHRVGVTNAADRGLDMW 240 



Query: 241 HL SLDEELAEDIVSSISADRIVIVCKKAEQSVIVSLLTQIGWKS^ 300 

HLSLDEEIJ^DIVSSISADRIVTVCKKAEQS^ 
Sbjct: 241 HLSLDEELAEDIVS S ISADRIVrVCKKAEQSVIVSLLTQIGWKSRIQNIVTEDDLISWYE 300 



55 Query: 301 KALRGQYPIAEALLENIKTEIMREFPAVNEANEFLDFAQNRGYDITVTHF 350 

KALRGQYP IAEALLENIKTEIMREF PAVNEANEFLDFAQNRGYDITVTHF 
Sbjct: 301 KALRGQYP I AEALLENIKTE IMREFPAVNEANEFLDFAQNRGYDITVTHF 350 



60 



Based on this analysis, it was predicted that this protein from N -gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 
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Example 113 

A DNA sequence was identified in N. gonorrhoeae <SEQ ID 225> which encodes amino acid sequence 
<SEQ ID 226; NGS1 13>. Analysis of this protein sequence reveals the following: 

GvH Examining signal sequence (von Heijne) 
5 Signal Score (-7.5): -1.7 

Possible cleavage site: 43 
»> Seems to have no N-terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 
calculated from 1 
10 ALOM: Finding transmembrane regions (Klein et al.) 

count: 4 value: -9,77 threshold: 0.0 

INTEGRAL Likelihood = -9.77 Transmembrane 187 - 203 { 183 - 208) 
INTEGRAL Likelihood = -7.22 Transmembrane 25 - 41 ( 19 - 46) 
INTEGRAL Likelihood = -4.14 Transmembrane 139 - 155 ( 138 - 155) 
15 INTEGRAL Likelihood = -2.87 Transmembrane 86 - 102 ( 85 - 102) 

PERIPHERAL Likelihood = 1.27 
modified ALOH score: 2.45 
Rule: cytoplasmic membrane protein 

20 *** Reasoning Step: 2 

Final Results 



bacterial inner membrane Certainty^ 0.491 (Affirmative) < suco 

25 bacterial periplasmic space Certainty* 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty= 0.000 (Not Clear) < suco 

bacterial cytoplasm Certainty* 0.000 (Not Clear) < suco 

The protein has homology to the following sequences in the databases: 

30 A **gbp_15902668 gi 1 15902668 |ref |NP_358218 .1 1 \(NC_003098) ABC 

transporter membrane- spanning permease - glutamine 

transport [Streptococcus pneumoniae R6] 
gb|AAK99428.1 1 (AE008440) ABC transporter membrane- spanning permease - glutamine 

transport [Streptococcus pneumoniae R6] 
35 Length = 226 

Score = 218 bits (556), Expect ■ 7e-56 

Identities = 113/218 (51%), Positives = 155/218 (70%) 

40 Query: 1 MNWPYL IDAVPKFADAAKLTLELSVYGVVLSLLFGL PVAWTAYRIRPFYALARAYIELS 60 

M+W + . . +P + A LTL ++V+G++ S L GL V+++ YRI +A AYIELS 

Sbjct: 1 MDWS IVEQYLPLYQKAFFLTLHIAVWGILGSFLLGLIVSI IRHYRI FVLAQVATAYIELS 60 

Query: 61 RNTPLL IQLFFLYYGLPKMGIKWIX3FTCGVI ALVFLGA 120 
45 RNTPLLIQLFFLY+GLP++GI C + LVFLG SYMAE+ R+G+ A+ + Q G 

Sbjct: 61 RNTPLLIQLFFLYFGLPRIGIVLSSEVCATLGLVFLGGSYMAESFRSGL 120 

Query: 121 KAIGLSRFQVTRYVELPQWAVAVPAIGANILFLMKETSW 180 
AIGL+ QVFRYV LPQ AVA+P+ AN++FL+KETSV S V +A+L++V KD+IG+ 
50 Sbjct: 121 lAIGLTPLQVFRYvVLPQATAVALPSFSANV^FLIKETSv 180 

Query: 181 YKTNEALFLLFAAYLIILLPVSLLARRIENRVTlSAKyG 218 

Y+T+ AL +L AYLI+LLP+SL+ IB R+R A +G 
Sbjct: 181 YETDIALAMLWAYLIMLLPISLVFSWIERRIRHAGFG 218 

55 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 
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Example 114 

A DNA sequence was identified in N. gonorrhoeae <SEQ ID 227> which encodes amino acid sequence 

<SEQ ID 228; NGS 1 14>. Analysis of this protein sequence reveals the following: 

GvH Examining signal sequence (von Heijne) 
5 Signal Score (-7.5): -0.46 

Possible cleavage site: 17 
»> Seems to nave a cleavable N-term signal seq. 
Amino Acid Composition of Predicted Mature Form: 
calculated from 18 
10 ALOM: Finding transmembrane regions (Klein et al.) 

count: 3 value: -5.36 threshold: 0.0 

INTEGRAL Likelihood ■ -5.36 Transmembrane 50 - 66 ( 47 - 67) 
INTEGRAL Likelihood = -4.83 Transmembrane 183 - 199 ( 176 - 200) 
INTEGRAL Likelihood = -1.81 Transmembrane 72 - 88 ( 72 - 88) 
IS PERIPHERAL Likelihood = 0.26 

modified ALOM score: 1.57 
Rule: cytoplasmic membrane protein 

*** Reasoning Step: 2 

20 

Final Results 



bacterial inner membrane Certainty= 0 .314 (Affirmative) < suco 

bacterial periplasmic space Certainty^ 0.000 (Not Clear) < suco £• 

25 bacterial outer membrane Certain ty= 0.000 (Not Clear) < suco 

bacterial cytoplasm Certainty^ 0.000 (Not Clear) < suco 

■ . i 

The protein has homology to the following sequences in the databases: 

- **gbp_15902667 gi | 15902667 | ref |NP_358217.1 1 \(NC_003098) ABC 
30 transporter membrane- spanning permease - glutamine 

transport [Streptococcus pneumoniae R6] 
gb|AAK99427.l| (AE008440) ABC transporter membrane -spanning permease - glutamine 
transport [ Streptococcus pneumoniae R6] 
Length = 225 

35 

Score = 218 bits (555), Expect » 9e-56 

Identities =» 111/206 (53%), Positives =* 151/206 (72%) 

Query: 3 EGLLLTAQISLISVAASCVLGTLFGLVLRSRNRLVRI^^^ 62 
40 +GL +T IS++SV S + GT+ G+++ S +R++RF+ R YLE IRI+P LV LF +YF 

Sbjct: 20 QGLGVTTIGISILSvLLSMMFGTVMGIIMTSHSRIIRFLTRLY^ 79 

Query: 63 GLSVVnXJIHIGGFWCVVftTFSLVTC^AE SRGQVFRC 122 

GL+ I+I G + VF+LWG AEMGDLVRGA+ S+ KHQ ESG A GL+ Q++ 
45 Sbjct: 80 GLARNFNIKI SGETSAI IVFTLWGTAEMGDLVRGAITSIiPKHQFESGQALGLTNVQLYYH 139 

Query: 123 IELPQS IRRVDPGAVmjFTRMIKTSSIJ^IG^TEVVKVGQQI IENSLLTQPNASFWVYG 182 

I +PQ +RR+LP A+NL TRM1KT+SL LIGV+EV KVGQQII+++ LT P ASFW+YG 
Sbjct: 140 III PQVLRRLLPQAIl^VTRMIKTTSLVVLIGVVEVTKVGQQI IDSNRLT I PTASFWIYG 199 

50 

Query: 183 LIFMLYFFCCWPLSLLAAKLEQKWEH 208 

I +LYF C+P+S L+ LE+ W + 
Sbjct: 200 TILVLYFAVCYPI SKLSTHLEKHWRN 225 

55 Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 115 

A DNA sequence was identified in N. gonorrhoeae <SEQ ID 229> which encodes amino acid sequence 
<SEQ ID 230; NGS 1 15>. Analysis of this protein sequence reveals the following: 
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GvH Examining signal sequence (von Heijne), 

Signal Score (-7,5): -0.639999 

Possible cleavage site: 38 
»> May be a lipoprotein 

Amino Acid Composition of Predicted Mature Form: 

calculated from 23 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 5.25 threshold: 0.0 
PERIPHERAL Likelihood = 5.25 
modified ALOM score: -1.55 
Rule: inner or outer membrane protein 
Rule: inner or outer membrane protein 

*** Reasoning Step: 2 

Lipoprotein? 
Inner membrane? 



Final Results 



bacterial outer membrane Certainty* 0.7 90 (Affirmative) < suco 

bacterial inner membrane Certainty* 0 .700 (Affirmative) < suco 

bacterial periplasmic space Certainty* 0.000 (Not Clear) < suco 

bacterial cytoplasm Certainty* 0.000 (Not Clear) < suco 

The protein has homology to the following sequences in the databases: 

- **gbp_4588485 gi | 4588485 lgb|AAD26123 .l| \(AF109148) antigenic protein 
[Actinobacillus pleuropneumoniae] 
Length = 278 1 

Score = 407 bits (1045), Expect = e-112 

Identities = 212/282 (75%), Positives = 242/282 (85%), Gaps * 7/282 (2%) 

Ouerv- 1 MKLNAKLKALLASAAIAVGLTACGGGSGDAQ 59 
^ MKL+ LK LLA+A A LTAC *A ++QSS A + VA IKEKGVIRIGVFGDKP 

Sbjct: 1 MKLSTTLKTLLATAITAFALTACD NANNAQSSTAKDSVAQIKEKGVIRIGVFGDKP 56 

Ouerv- 60 PFGYVDANGKNQGFDVM^ 119 

' PFGYVDANGK+QGFDVEIAK++A DLLGS DKVEFVLTEAANRVEY++S KVDLILANFT 

Sbjct: 57 PFGYVDANGKSQGFE^IAKEIANDLLGS 116 

Ouerv- 120 QTPBRAEAVDFADPYMKV^^ 179 

+TPERAE VDFA PYM VALGWSPK + I+D+ QL+ +TLLVNKGTTADA+FTK+HPB+ 
Sbjct: 117 KTPERAEWDFAAPYMNVAIjGWS PKVRLI SDLKQLEGKTLLVNKGTTABAYFTKNHPEI 176 

Ouery. 180 KLLKFDQOTETFDALKDGROTALAHDT^ 239. 

LLKFDQOTETFDALKDGRGVALAHD^ F+VAIG+ +GPAE IAPAVQKG 

Sbjct: 177 NLLKFDQNTETFDALKIXSRGVALAHDNALW^ 236 

Query- 240 NADIJjNWVNGEIAAMKKTCRLKAAYEKT^ 281 

N LL+ EIA K +G+LKAAYEKTL+FVYG+ KPE L 
Sbjct: 237 NQALLDVINKEIAEFKTNGKLKAAYEKTLVPVYGD KPELL 276 

The protein was expressed in Exoli as a soluble 28.16kDa Ks-fiisian product, lacking its leader peptide 
and its poly-glycine sequence (GGGSG), and then purified. 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 116 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 231> which encodes amino acid sequence 
<SEQ ID 232; NGS116>. Analysis of this protein sequence reveals the following: 
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15 



20 



GvH Examining signal sequence (von Heijne) 

Signal Score (-7.5); -7.13 

Possible cleavage site: 61 
>» Seems to have no N- terminal signal seg. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 

count: 1 value: -1.86 threshold: 0.0 



INTEGRAL Likelihood = -1.86 
PERIPHERAL Likelihood « 1.54 
modified ALQM score: 0.87 
Rule: cytoplasmic membrane protein 

*** Reasoning Step: 2 

— Final Results 



Transmembrane 51 - 67 ( 51 - 67) 



bacterial inner membrane - 
bacterial periplasmic space — 
bacterial outer membrane - 
bacterial cytoplasm - 



— Certainty^ 0 .174 (Affirmative) < suco 

— Certainty= 0.000 (Not Clear) < suco 

— Certainty= 0.000 (Not Clear) < suco 

— Certainty= 0.000 (Not Clear) < suco 



25 



The protein has no homology to sequences in the databases. 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 



Example 117 \ 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 233> which encodes amino acid sequence 

<SEQ ID 234; NGS1 17>. Analysis of this protein sequence reveals the following: > 

GvH Examining signal sequence (von Heijne) 
30 Signal Score (-7.5): 0.25 

Possible cleavage site: 40 
»> Seems to have a cleavable N-term signal seg. 
Amino Acid Composition of Predicted Mature Form: 
calculated from 41 
35 ALOM: Finding transmembrane regions (Klein et al.) 

count: 2 value: -4.57 threshold: 0.0 

INTEGRAL Likelihood = -4.57 Transmembrane 100 - 116 ( 99 - 118) 
INTEGRAL Likelihood = -1.59 Transmembrane 54 - 70 ( 54 - 70) 
PERIPHERAL Likelihood = 0.53 
40 modified ALOM score: 1.41 

Rule: cytoplasmic membrane protein 

*** Reasoning Step: 2 

45 Final Results 



50 



bacterial inner membrane — 
bacterial periplasmic space — 
bacterial outer membrane - 
bacterial cytoplasm - 



— Certainty^ 0.283 (Affirmative) < suco 

— Certainty= 0.000 (Not Clear) < suco 

— Certainty= 0.000 (Not Clear) < suco 

— Certainty^ 0.000 (Not Clear) < suco 



The protein has homology to the following sequences in the databases: 

A **gbp_15793413 gi 1 15793413 |ref |NP_283235.l| \ (NOJJ03116) putative 
integral membrane protein [Neisseria meningitidis 224913 
55 pir||C81957 probable integral membrane protein NMA0408 [imported] - Neisseria 

meningitidis (group A strain Z2491) 
emb|CAB83707.1| (AL162753) putative integral membrane protein [Neisseria 
meningitidis Z2491] 
Length =550 



WO 02/079243 PCT/IB02/02069 

-134- 

Score = 1115 bits (2885), Expect = 0.0 

Identities = 539/550 (98%), Positives = 545/550 (99%) 

Query: 1 MVAYAFIoFLFOTAAVLLrraSHYR^ 60 
5 MVAY FLFLFVTAA+ + LI +RSHYRWTYFFAS ALFWLAGGMLMLTAQWQRALN^ 

Sbjct: 1 MVAYVFLFLFTOAALVLIIRS 60 

Query: 61 VLILFHRLKIHYYKQPLLISDFLLIADWRNWB^ 120 
VLILFHRLKIHYYKQPLLISDFLLIADWRNWETLFHY^ YAVFGWSG 
10 Sbjct: 61 VTiILFHRLKIHYYKQPLLISDFLLIADWRNWETO 120 

Query: 121 ADSLGMPWRWAGAVLFAAAFVSVRHFSKHPGAVKTVi^DSLPDTC 180 

ADSL + PWRWAGAVIaFAAAFVS+IUiFSKHPGAVKTWLDS 
Sbjct: 121 ADSLWPWRWAGAVLFAAAWSMWJFSKHPGAVKTWLDSLPD^ 180 

15 

Query: 181 QVPVFEGDGEAFARQMPSETRPYGMSDEKPDIVVTIjME STLDPHCFDFAAAKI PDLKMFG 240 

QVPVFEGDGBAFARQMPSETRP GMSDEKPDIVVTLMESTLDPHCFDFAAAKI PDLKMFG 
Sbjct: 181 QVPVFEGIXSEAFARQMPSE^RPCGMSDEKPDIVVTLMEST^ 240 

20 Query: 241 RQEDTVFSSPLRVHTFGGATWKSEFAFLAG^ 300 

RQEDTVFSSPLRVHTFGGATWKSEFAFIiAGVPSTDFGA^ 
Sbjct: 241 RQEDTVFSSPLRVHTFGGATWKSEFAFIiAGVPSTDFG 300 

Query: 301 EHGYFCVALS PFTKGNYNAKAAYDHFGFNL^ S SEEMMQYAR 360 

25 EHGYFCVAL S PFTKGNYNAKAAYDHFGFNLMFQPQDLGYPAPMGKNLWHI S SEEMMQYAR 

Sbjct: 301 EHGYFCVAL S PFTKGNYNAKAAYDHFGFNLMFQPQDLGYPAPMGKNLWHI SSE 360 

Query: 361 MILEKRHPDLENVRQPMFVYVLTMKEHGPYRTOTDNVFDLD 420 
MILEKRHPDLENVRQPMFVYVLTMKEHGPYRT^ 
30 Sbjct: 361 MILEKRHPDLENVRQPMFVYVLTMKEHGPYRTOT 420 

Query: 421 ADLDKAVESFDRYLHERGKPFVFGYFGDHQVPFEGVSVRKKWDYAQPDYVTQFA 480 

ADLDKAVE SFDRYLHERGKPFVFGYFGDHQVPFEGVSVRKKWDYAQPDYVTQFAVRSNIA 
Sbjct: 421 ADLDKAVESFDRYLHERGKPFVFGYFGDHQVPFEGVSVRKKWDYAQPDYV^ 480 



35 



Query: 481 GGFVQRQDFLDLAFAGGVLMEAAGLEAKIX3FMRANMAMRGL 540 

GGFVQRQ+FLDLAFAGGVLMEAAGLEAKIXSFM^ ELVGNYRN 
Sbjct: 481 GGFVQRQNFLDLAFAGGVLMEAAGLEAKIXSFMR^^ 540 



40 Query: 541 YLYDVLKIAR, 550 

YLYDVLKIAR 
Sbjct: 541 YLYDVLKIAR 550 

A homolog was found in serogroup A N. meningitidis but not in serogroup B, so NGS117 protein and 
45 nucleic acid are useful for distinguishing between gonococcus and serogroup B N.meningitidis. 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 118 

A DNA sequence was identified in N. gonorrhoeae <SEQ ID 235> which encodes amino acid sequence 

50 <SEQ ID 236; NGS118>. Analysis of this protein sequence reveals the following: 

GvH Examining signal sequence (von Heijne) 
Signal Score (-7.5): 0.59 
Possible cleavage site: 19 
>» May be a lipoprotein 
55 Amino Acid Composition of Predicted Mature Form: 

calculated from 22 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 8.33 threshold: 0.0 
PERIPHERAL Likelihood = 8.33 
60 modified ALGM score: -2.17 



10 



15 



55 
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Rule: inner or outer membrane protein 
Rule: inner or outer membrane protein 

*** Reasoning Step: 2 

Lipoprotein? 
Inner membrane? 



Final Results 



bacterial outer membrane Certainty= 0 .790 (Affirmative) < suco 

bacterial inner membrane Certain ty= 0.700 (Affirmative) < succ> 

bacterial periplasmic space Certainty= 0.000 (Not Clear) < suco 

bacterial cytoplasm Certainty^ 0.000 (Not Clear) < suco 

The protein has no homology to sequences in the databases. 

The protein was expressed in E.coli as a soluble 12.98kDa His-fusion product and then purified 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

20 Example 119 

A DNA sequence was identified in N. gonorrhoeae <SEQ ID 237> which encodes amino acid sequence 

<SEQ ID 238; NGS 1 19>. Analysis of this protein sequence reveals the following: ■ 

GvH Examining signal sequence (von Heijne) 

Signal Score (-7.5): -4.75 j 
25 Possible cleavage site: 47 

>» Seems to have no N-terminal signal seg. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
30 count: 0 value: 7.69 threshold: 0.0 

PERIPHERAL Likelihood =7.69 
modified ALOM score: -2.04 
Rule: cytoplasmic protein 

35 *** Reasoning Step: 2 

Final Results 



bacterial cytoplasm Certainty^ 0 .213 (Affirmative) < suco 

40 bacterial periplasmic space Certainty^ 0.000 (Not Clear) < succ> 

bacterial outer membrane Certainty^ 0.000 (Not Clear) < suco 

bacterial inner membrane Certainty^ 0.000 (Not Clear) < suco 

The protein has homology to the following sequences in the databases: 

45 A **gbp_2625122 gi| 2625122 |gb|AAB86635.1| \(AF031495) putative 

hemoglobin receptor component precursor HpuA [Neisseria gonorrhoeae] 
Length =360 

Score = 668 bits (1724), Expect = 0.0 
50 Identities = 331/331 (100%), Positives = 331/331 (100%) 

Query: 1 VS I PT AT PLP AGEVTLS SDNGNIENINTAGAG SASDAPSRSRRSLDAAPQNT SG I S IRQR 60 

VSIPTATPLPAGEVTLSSDNGNIENINTAGAGSASDAPSRSRRSiiDAAPQNTS 
Sbjct: 30 VSIPTATPLPAGEWLSSDNGNIENINTAGAGSASDAPSRSRRSLDAAPQNTSGISIRQR 89 



Query: 61 EVEKDYFGYKSKET SF IFKT PGGAQYALS SYADPITVS YS S PDFKI PDRHAGQRLADGSR 120 

KVEKDYFGYKSKCTSFIFKTPGGAQYALS 
Sbjct: 90 EVEKDYFGYKSKETSFIFKTPGGAQYALSSYADPITVSYSS PDFKI PDRHAGQRLADGSR 149 
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Query: 121 IFICCSDSGATSYAEITKQDYMKFGAWIGPNGEIDLFAGGFPVGKTPPPAFSYGSSTPET 180 

IFICCSDSGATSYAEITKQDYMKFGAWIGPNGEIDIiFAGGFPVGKTPPPAFSYGSSTPET 
Sbjct: 150 IFICCSDSGATSYAEITKQDYMKFGAWIGPNGEIDIiFAGGFPVGKTPPPAFSYGSSTPET 209 

5 

Query: 181 ALSKGKITYQVWGIRVRNGQFVTSSYTPPK^ 240 

ALSKGKITYQWGIRVRNGQFVT S SYTPPKSGSYYGTLANTPVLSF ITANFNSNTLAGKI 
Sbjct: 210 ALSKGKITYQWGIRVRNGQFVTSSYTPPKSGSYYGTIAOT 269 

10 Query: 241 LGNSDYGPDVDIQNATITGPTFSGDATSGGKSGKLEGKFFGKFASTRSSEVSIGGKITro 300 

LGNSDYGPDVDIQNATITGPTFSGDATSGGKSGKLEGKFFGKFASTRSSEVSIGGKITFD 
Sbjct: 270 LGNSDYGPDVDI QNAT ITGPTFSGDATSGGKSGKLEGKFFGKFASTRSSEVS IGGKITFD 329 

Query: 301 GDRSLDTVFGGVSYEKKLDDTSQDTNHLTKQ 331 
15 GDRSLIXIVFGGV5YEKKLPDTSQDTNHLTKQ 

Sbjct: 330 GDRSLiyiVFGGVSYEKKLDDTSQDTNHLTKQ 360 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics* 

20 Example 120 

A DNA sequence was identified in N.gonorrlweae <SEQ ID 239> which encodes amino acid sequence 

<SEQ ID 240; NGS120>. Analysis of this protein sequence reveals the following: 

GvH Examining signal sequence (von Heijne) 
Signal Score (-7.5): -7.24 
25 Possible cleavage site: 38 ( 

»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
30 count: 0 value: 6.42 threshold: 0.0 

PERIPHERAL Likelihood = 6.42 
modified ALOM score: -1.78 
Rule: cytoplasmic protein 

35 *** Reasoning Step: 2 

Final Results 

bacterial cytoplasm Certainty= 0 .280 (Affirmative) < suco 

40 bacterial periplasmic space Certainty* 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty* 0.000 (Not Clear) ^ suco 

bacterial inner membrane Certainty* 0.000 (Not Clear) < suco 

The protein has homology to the following sequences in the databases: 

45 A **gsa_j\AR91313 N. gonorrhoeae glycosyltransferase LgtC 

| WO9610086-A1 | 09-JUL-1996 
Length = 306 

Score = 535 bits (1379), Expect = e-151 
50 Identities = 252/253 (99%), Positives * 252/253 (99%) 

Query: 8 GGGNIRF IDVNPEDFAGFPLNIRHI S ITTYARLKLGEYIADCDKVLYL^ 67 

GGGNIRF IDVNPEDFAGFPLNIRHI S ITTYARLKLGEYIAIXIDKVLYLOT0VLVIUX5IjKP 
Sbjct: 54 GGGNIRFIDVNPEDFAGFPLNIRHISITTYARLK^ 113 

55 

Query: 68 LV7DTDLGGNWGAC IDLFVERQEGYKQKIGMADGEYYFNAGVLL IKLKKWRRHDIFKMSC 127 

LWDTDLGGNWVGACIDIiFVERQEGYKQKIGMADGBYYFN^ 
Sbjct: 114 LWDTDI>GGNWVGACIDLFVERQEGYKQKIGMADGEYYFNAG\^ 173 

60 Query: 128 EWVEQYKDVMQYQDQDILNGLFKGGVCTANS 187 

EWVEQYKDVMQYQDQDILNGLFKGGVC^^ 
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Sbjct: 174 EWVEQYKD^QYQDQDILNGLFKGGVCYANSRFNFMP 233 

Query: 188 OTAMFVAVSHYCGSAKPWHRTC 247 

NTAMPVAVSHYCG SAKPWHRDCTVWGAERFTELAGSLTTVPEEWRGKIiAVPPTK MLQRW 
Sbjct: 234 NTAMPVAVSHYCGSAKPWHRIXTVWGAERFra 293 

Query: 248 RKKLSARFLRKIY 260 

RKKLSARFLRKIY 
Sbjct: 294 RKKLSARFLRKIY 306 

Based on this analysis, it was predicted that this protein from ^.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 



Example 121 

A DNA sequence was identified in ^.gonorrhoeae <SEQ ID 24 1> which encodes amino acid sequence 

15 <SEQ ID 242; NGS121>. Analysis of this protein sequence reveals the following: 

GvH Examining signal sequence (von Heijne) 
Signal Score (-7.5): -6.22 
Possible cleavage site: 37 
»> Seems to have no N- terminal signal seq. 
20 Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 3.23 threshold: 0.0 

PERIPHERAL Likelihood =3.23 j 
25 modified ALOM score: -1.15 

Rule: cytoplasmic protein j 

*** Reasoning Step: 2 

30 Final Results 



bacterial cytoplasm Certainty= 0.402 (Affirmative) < suco 

bacterial periplasmic space Certainty^ 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty= 0.000 (Not Clear) < suco 

35 bacterial inner membrane Certainty* 0.000 (Not Clear) < suco 

The protein has homology to the following sequences in the databases: 

* **gbp_15281345 gi 1 15281345 |dbj |BAB63435 .1 1 \(AB058945) DNA adenine 
methylase M.Ssu4109IB [Streptococcus suis] 
40 Length = 271 

Score = 269 bits (687) , Expect * 4e-71 

Identities = 127/211 (60%), Positives = 158/211 (74%), Gaps 1/211 (0%) 

45 Query: 1 MIF ADP PYFL SNDGF SCQNGQMVSVl^GNWDKSKGMAADLEFYEEWLRIiCYALIiKPNGT^ 60 

MIFADPFYFLSN G S GQ+VSV+KG+WDK + EF +W+RL +LKPNGTI 
Sbjct: 44 MIFADP PYFL SNGG I SNSGGQWSVDKGDWDKVlTSLEEKHEFIJRKWIRIiAKNvTjKPNGT I 103 

Query: 61 WVCGTFHNIYL IGYLMQTVGYHILNNXTWEKPNPPPNLSCRFFTHSTETILW 119 
50 W+ G+FHNIY +G ++ G+ ILNNITW+K NP PNLSCR+FTHSTETILWA+K +KKA 

Sbjct: 104 WISGSFHNIYSVXSimjEQEGFKILNNITWQKTNPAPNLSCRYFTHS 163 

Query: 120 KHTFHYEMMKAQNNGKQMKCVTWFAPPNOT 179 
+H ++YE+MK N+GKQMK VW K+EK GKHPTQKP LLER IL+++ GD I 

55 Sbjct: 164 RHYYNYELMKELNIXSKQM^ 223 



60 



Query: 180 FDPFMGSGTTGVAALKHGRRFCGCELEEDFL 210 

DPF+GSGTTGV A + GR+F G + E D+L 
Sbjct: 224 LDPFVGSGTTGWAKRLGRKFIGIDAERDYL 254 
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Based on this analysis, it was predicted that this protein from gonorrhoeae, and its epitopes, conld be 
useful antigens for vaccines or diagnostics. 

Example 122 

A DNA sequence was identified in gonorrhoeae <SEQ ID 243> which encodes amino acid sequence 

<SEQ ID 244; NGS122>. Analysis of this protein sequence reveals the following: 

GvH Examining signal sequence (von Heijne) 
Signal Score (-7.5): -2.55 
Possible cleavage site: 23 
»> May be a lipoprotein 

JSnTLid Composition of Predicted Mature Form: 

calculated from 15 . a1 , 

ALOM: Finding transmembrane regions (Klein et u.j 
count: 0 value: 11.46 threshold: 0.0 
PERIPHERAL Likelihood =11.46 
15 modified ALOM score: -2.79 

Rule: inner or outer membrane protexn 
Rule: inner or outer membrane protein 



10 



20 



25 



30 



*** Reasoning Step: 2 

Lipoprotein? 
Inner membrane? 



Final Results 



bacterial inner membrane certainty- ) n*urri < suco 

The protein has no homology to sequences in the databases: 
The protein was expressed in Recti as an insoluble 14.85kDa His-fosion product and then purified. 
Based on this analysis, it was predicted that this protein from gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

35 Example 123 

A DNA sequence was identified in gonorrhoeae <SEQ ID 245> which encodes amino acid sequence 

<SEQ ID 246; NGS123>. Analysis of this protein sequence reveals the following: 

GvH Examining signal sequence (von Heijne) 
Signal Score (-7.5): -5.65 
40 possible cleavage site: 20 

>» Seems to have no N-terminal signal seq. 
Amino Acid Composition of Predicted Mature Form:- 

calculated from 1 
ALOM- Finding transmembrane regions (Klein et al.J 
45 "count: 0 value: 4.24 threshold: 0.0 

PERIPHERAL Likelihood = 4.24 
modified ALOM score: -1.35 
Rule: cytoplasmic protein 

50 *** Reasoning Step: 2 

Final Results 

bacterial cytoplasm — Certainty- 0.404 (Affirmative) < suco 
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bacterial periplasmic space Certainty= 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

bacterial inner membrane Certain ty= 0.000 (Not Clear) < suco 

5 The protein has no homology to the sequences in the databases. 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 124 

A DNA sequence was identified in N. gonorrhoeae <SEQ ID 247> which encodes amino acid sequence 

10 <SEQ ID 248; NGS 124>. Analysis of this protein sequence reveals the following: 

GvH Examining signal sequence (von Heijne) 
Signal Score (-7.5): -5 
Possible cleavage site: 18 
»> Seems to have no N- terminal signal secj. 
15 Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 1 value: -1.59 threshold: 0.0 

INTEGRAL Likelihood = -1.59 Transmembrane 289 - 305 ( 289 - 305) 
20 PERIPHERAL Likelihood =3.76 

modified ALOM score: 0.82 
Rule: cytoplasmic membrane protein 



25 



40 



*** Reasoning Step: 2 
Final Results 



bacterial inner membrane Certain ty= 0.164 (Affirmative) < suco 

bacterial periplasmic space Certainty^ 0.000 (Not Clear) < suco 

30 bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

bacterial cytoplasm Certainty* 0.000 (Not Clear) < suco 

The protein has homology to the following sequences in the databases: 

A **gbp_1617515 gi| 1617515 |gb|AAC82509.1| \(U65994) pilin gene inverting 
35 protein homolog PivNG [Neisseria gonorrhoeae] 

Length = 320 



Score ■ 614 bits (1584) , Expect = e-175 
Identities = 311/320 (97%), Positives = 316/320 (98%) 

Query: 1 MRNTVGLDXSiaTFDATAl^CT 60 

MRN VGLDI SKLTF+ A+AMVGKTEHSAKFDNDSKGLDQFSDRLXSLG QNLHICMEATG+ 
Sbjct: 1 MRNAVXSLDISKLTFNASAWGKTEHSAKFDN^ 

45 Query: 61 YYEEVADYFAQYYSVYVVNPLKISKYAESRFKRTKTDKQDAKLIA 120 

YYEEVADYFAQYYSVYVWPLKI SKYAESRFKRTKTDKQDAKLI AQYCR A+ESELVKRQ 
Sbjct: 61 YYEEVADYFAQYYSVYVVNPLKI SKYAESRFKRTKTDKQDAKLIAQYCRLAKESELVKRQ 120 

Query: 121 KPTDEQYRL SROTAAYAQlKSECAAMKNRHHAAKD^ 180 
50 KPTDEQYRL RMTAAYAQIKSECAAMKNRHHAAKDEEAAKAYA+ 1 IKAMNEQLEVLKEKI 

Sbjct: 121 KPTDEQYRLLRMTAAYAQIKSECAAMKNRHHAAKDEEAAKAYAQI IKAMNEQLEVLKEKI 180 

Query: 181 KEQTEKPNCKEGVKRLETIPAIGRMTAAVLFHHLTSSK^ 240 
KEQTEKPNCKEGVKRLETI PAIGRMTAAVLFHHLTSSKFETSNKFAAFAGLSPQQKESGT 
55 Sbjct: 181 KEQTEKPNCKEGVKRLETI PAIGROTAAVLFHHLTSSKFETSNKFAAFAGLS 240 

Query: 241 SVRGKGKLTKFGNRKLRAVLFMPAMVAYR 300 

SVRGKGKLTKFGNRKLRAVLFMPAMVAYRIRAFPDFIKRLEEKK^ 
Sbjct: 241 SVRGKGKLTKFGNRKLRAVLFMPAMVAYRIRAF^ 300 

60 
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Query: 301 IAYHVHKKGGDYDPSRYKSA 320 

IAYHVHKKGGDYDPSRYKSA 
Sbjct; 301 IAYHVHKKGGDYDPSRYKSA 320 

5 Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 125 

A DNA sequence was identified in N. gonorrhoeae <SEQ ID 249> which encodes amino acid sequence 
<SEQ ID 250; NGS 127>. Analysis of this protein sequence reveals the following: 

10 GvH Examining signal sequence (von Heijne) 

Signal Score (-7.5): -5.8 
Possible cleavage site: 52 
»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 
IS calculated from 1 

ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 1.70 threshold: 0.0 
PERIPHERAL Likelihood * 1.70 
modified ALOM score: -0.84 
20 Rule: cytoplasmic protein 

*** Reasoning Step: 2 



Final Results 

25 

bacterial cytoplasm — 
bacterial periplasmic space — 
bacterial outer membrane — 
bacterial inner membrane — 

30 



Certainty= 0 .383 (Affirmative) < suco 
Certainty= 0.000 (Not Clear) < suco 
Certainty= 0.000 (Not Clear) < suco 
Certainty^ 0.000 (Not Clear) < suco 



The protein has homology to the following sequences in the databases: 

* **gbp_1076012 gi|l076012|pir| |B55225 stress- sensitive restriction 
system protein 2 - Corynebacterium glutamicum (ATCC 13032) 
gb|AAC00044.l| (U13922) This orf may encode a typel or typelll restriction 
35 endonuclease which is stress -sensitive and 

ATP- dependent. It contains a typical ATP binding region 
(Walker motif) [Corynebacterium glutamicum] 
Length % 632 

40 Score = 298 bits (764) , Expect = 2e-79 

Identities = 199/633 (31%), Positives = 321/633 (50%), Gaps = 32/633 (5%) 



45 



Query: 2 LRTYLNQLTP- PELADSVKOT 59 

L Y+ Ii+ +L + V TVD F + 1+ Q VLL G+VQSGKT+ +LG+++ 

Sbjct: 7 LI^ITSLSDNADLREKVTATVDAFRHTV^ 66 



50 



55 



Query: 60 ALADDGDHKVFLYLTTDSVD^ 117 

D H + + LT+ + LQTR ++V F K+P + 

Sbjct: 67 DCLDSTFHT IVI - LT S PNTRLVQQTYDRVAQAF PDTLVCDRDGYNDFRANQKSLT PRKS I 125 

Query: 118 WIKKNARVXjKRWRNLFASQS SLKGYPLVIVDDEADAASLNTNSDKPAKDAST INKLLND 177 

W> K VI» W +F +1, G+P++I+DDEADA SLOT ++ D STIN L 
Sbjct: 126 VVVGKI PAVLGNWLRVFNDSGALSGHPVLI IDDEADATSLNTKVNQ- - SDVST INHQLTS 183 

Query: 178 IKNSCCQSLFIQIiTATPQSLLLQHEBSDWQPEFIHFFEAGEKYIGGNF^ 236 

I++ +++Q+T TPQ++LLQ ++S+W E + F GE YTGG FS+ + Y+ 

Sbjct: 184 IRDLATGCIYI£VTGTPQAVLIiQSDDSNWAAEH^ 243 



Query: 237 FIDSELDDMKDE SGE IAEGAKQALLSFLITCT^FAIiCDKANCNFALHPSYKIQDHQAFSK 296 
60 F +++ D+ S A+ ++Ii+T A F h ++ C +HPS+ H+ F++ 

Sbjct: 244 FANTQFDEDSRFS DAIYTTLLTAALFKLRGESLCTMLIHPSHTASSHRDFAQ 295 
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Query: 297 KIQAFLTOLVQAVNNGEDLAGSPKESYLDLQKTKPDIHHFDE IYEKLTALLENKQI STLV 356 

+ + L + + +F+ +Y I* +T ++ +1 L + ++ I + 

Sbjct: 296 EARLQLTFAFERFYEPM- IQHNFQRAYEQLAQTDSNLPPLRKILNILGGMEDDFSIH — I 352 

5 

Query: 357 VNSQTET-DFDLEKGFNIIIGGNVTGRGLTI^ 415 

VNS T + D G+NIX-M3GN +GRGI/T LQTV+Y R +K+P ADT WQH+R+FGY 
Sbjct: 353 VNSDNPTVEEDWADGYNI IVG^SLGRGLTFNI^QTVFYVRESKRPQAOTLWQHARMFGY 412 

10 Query: 416 DRDKSLLRLYI PFDVYYFFVQLNQANNLI IGQAKNSG — OrCQVIYPKNINPTRKNVLKF 473 

R K +R++4-P + P ++ N X Q + +I+VI + PTR NVL 

Sbjct: 413 KRHKDTMRVFMP AT I AQTFQEVYljGNEAIKNQIiDHGTHIND IRVII/SDGVAPTRANVIiDK 472 

Query: 474 DSiTOQIVGGVNYFPIjHPNEDNLSEINKILPSIL^ 533 
15 + + GGVNYF P N+ ++K L + L + I + + 4-L+ 

Sbjct: 473 RKVGNLSGG\mYFAADPRIKNVEALDKKl^ 529 

Query: 534 PDDWNKEKF IAGVEALKAQRP SFKTYVL IKTGRKL SRATGTML SEDDRKLGEKYPNDLFL 593 
p+D + F A + + +P ++++T RK+++ TG +LS D+ h Ii 

20 Sbjct: 530 PNDLDIATFKAALLDFEMSQPHLT^ 589 

Query: 594 TLYQWGNKDKG WQGKDFWLPNIKLP 619 

IiY++ G D W W+PNIKLP 

Sbjct: 590 ILYRIEGVNDAAAQRGEPTWSSDPIWVPNIKIiP 622 

25 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 126 . 1 

A DNA sequence was identified in N. gonorrhoeae <SEQ ID 25 1> which encodes amino acid sequence 

30 <SEQ ID 252; NGS 128>. Analysis of this protein sequence reveals the following: 

GvH Examining signal sequence (von Heijne) 
Signal Score (-7.5): -4.98 
Possible cleavage site: 20 
>» Seems to have no N- terminal signal seg. 
35 Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOML: Finding transmembrane regions (Klein et al.) 
count: 0 value: 7.37 threshold: 0.0 
PERIPHERAL Likelihood = 7.37 
40 modified AL0M score: -1.97 

Rule: cytoplasmic protein 

*** Reasoning Step: 2 

45 Final Results - 

bacterial cytoplasm Certainty= 0.225 (Affirmative) < suco 

bacterial periplasmic space Certainty^ 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

50 bacterial inner membrane Certainty* 0.000 (Not Clear) < suco 

The protein has homology to the following sequences in the databases: 

A **gbp_11387195 gi| 11387195 |sp|Q50976|T2F7_JJEIGO TYPE II RESTRICTION 
ENZYME NGOFVII (ENDONUCLEASE NGOFVII) (R.NGOFVII) 
55 (R.NGOVII) 

pir||Tl0166 restriction endonuclease (EC 3.1.21.-) NgoVII - N . gonorrhoeae 
gb|AAA86271.l| (U43736) R.NgoVTI [Neisseria gonorrhoeae] 
Length = 326 

60 Score = 651 bits (1679), Expect =0.0 

Identities = 317/326 (97%) , Positives = 320/326 (97%) 
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Query: 1 MNTVFSNIANi^ITEKSLNAVW^ 60 

MNTVTSNIANAKITEKSLNAVW^ 
Sbjct: 1 MNTVFSNIANAKITEKSLNAVWMDLFKSADEVLI^ 60 

5 

Query: 61 DLIiVGMHYLEGFSHLQYDSLCKLOT 120 

DIJjVGMHYLEGFSHLQYDSLO^^ 
Sbjct: 61 DLLVGMHYLEGF SHI^YDSLCKLNDFLRHEKRGAVYVSPFVKFHGKMYS 120 

10 Query: 121 GSANLTCFWTOSTERTYBTMLHI^ 180 

GSANLTCFTOSTERTYETMLHLNGKPAQILQADIQSTIHKLGKNIQETO 
Sbjct: 121 GSANLTCFWDSTERTYETMLHLNGKPAQIIiQADIQSTIHKLGKNI^ 180 

Query: 181 IJSNCLGVQKIAPEQ IRQLFAQTSEYHFS I PAKTEEKSNLNVFFGEGRRDKRGFVRPRFWY 240 
15 LENCLGVQKIAPEQIRQLFAQTSEYHF S IPAKTEEKSNIaNVFFGEGRRDKRGFVKPRPVnf 

Sbjct: 181 I^OiGVQKIAPEQIRQLFAQTSEYHFSlPAKTEBKSI^^ 240 

Query: 241 EVEL IVSKDIT SQEGYFVLKSFTVTTDIX^QFQCKTSGDYSKOT 300 
EVELrVSKDITSQEGYPVLKSFTVITDIXSWQFQCICrSGDYSK + +LKTLGKWIKGR 
20 Sbjct: 241 EVELIVSKDITSQEGYPVLKSFTVITDDGWQFQCKT^ 300 



25 



Query: 301 liESHGCLQNNEKITHETLREYGNDHF 326 

LESHGCLQNNEKITHETLREYGN+ F 
Sbjct: 301 LESHGCLQNNEKITHETLREYGNESF 326 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 



Example 127 

A DNA sequence was identified in ^gonorrhoeae <SEQ ID 253> which encodes amino acid sequence 

30 <SEQ ID 254; NGS 129>. Analysis of this protein sequence reveals the following: 

GvH Examining signal sequence (von Heijne) 
Signal Score (-7.5): -5.5 
Possible cleavage site: 48 
>» Seems to have no N- terminal signal sea;. 
35 Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 10.03 threshold: 0.0 
PERIPHERAL Likelihood = 10.03 
40 modified ALOM score: -2.51 

Rule: cytoplasmic protein 

*** Reasoning Step: 2 

45 Final Results 



bacterial cytoplasm Certainty= 0.545 (Affirmative) < suco 

bacterial periplasmic space Certainty* 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

50 bacterial inner membrane Certainty* 0.000 (Not Clear) < suco 

The protein has homology to the following sequences in the databases: 

A **gbp_15804186 gi (15804186 |ref |NP_290225.1 | \(NC_002655) 

DNA-damage- inducible protein [Escherichia coli 0157 :H7 
55 EDL933] 

ref |NP_312547.1| (NC_002695) DNA-damage- inducible protein [Escherichia coli 

015*7 sH7 3 

gb|AAG58789.l|AB005591_13 (AE005591) DNA-damage-inducible protein [Escherichia 
coli 0157 :H7 
60 EDL9333 
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dbj |BAB37943.l| (AP002566) DNA-damage- inducible protein [Escherichia coli 
0157 :H7] 

Length = 278 

5 Score * 340 bits (872), Expect * 2e-92 

Identities = 161/266 (60%), Positives = 197/266 (73%) 

Query: 1 MTTENNAFENAKHIDETGNEYWSARTL 60 
H f FB +H 6 B+WSAR L +L+Y +WRNFQ+ + +A ACE S + H 
10 Sbjct: 5 MNEHHQPFEEIRHYGTEGQEFWSARELAPI^ 64 

Query: 61 FVETNKMIALGKGGQREVADYRLSRYAC YL rVQNGDPSKSVXAAGQTYF AVQARRQELQD 120 

FVET KM+ LG G QRE+ D LSRYACYL+VQNGDP+K V1AAGQTYFA+Q RRQEL D 
Sbjct: 65 FVETTKMVVLGSGAQRELEDVHLSRYAC^ 124 



15 



Query: 121 EAAFRSLGEDKQRLLLRRQLKEHOTDLAAAAKDAGVEKPVEYA 180 

+ AF+ L ED++RL LR +L+EHN L AA+ A V ++A+FQNHGY+GLYGGLD++ 
Sbjct: 125 DEAFKQLREDEKRLFLENELKEHNKQLVEA 184 



20 Query: 181 GIHSRKGLKKSQRILDHMNASEPAANLFRATQ^ 240 

IH KGLKKSQ+ILDHM ++E AANLFRATQTEEKli+R + K QAN HP+VG KVR 
Sbjct: 185 AIHQLKGLKKSQXILDHMGSTELAANLFRATQTEBKLK^^ 244 

Query: 241 QTIEELGGIMPENQPVPEKSIKQIiEN 266 

25 qti+elgg mpe p p+ sikqlen 

Sbjct: 245 QTIQELGGTMPEBLPTPQVSIKQLEN 270 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. j 

30 Example 128 • I 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 255> which encodes amino acid sequence 

<SEQ ID 256; NGS 130>. Analysis of this protein sequence reveals the following: 

GvH Examining signal sequence <von Heijne) 
Signal Score (-7.5): -3.68 
35 Possible cleavage site: 14 

»> Seems to have an uncleavable N-term signal seq 
Amino Acid Composition of Predicted Hature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
40 count: 3 value: -3.45 threshold: 0.0 

INTEGRAL Likelihood = -3.45 Transmembrane 68 - 84 ( 68 - 92) 
INTEGRAL Likelihood = -1.59 Transmembrane 10 - 26 ( 10 - 26) 
INTEGRAL Likelihood = -1.44 Transmembrane 46 - 62 ( 45 - 62) 
PERIPHERAL Likelihood =1.48 
45 modified ALOM score: 1.19 

Rule: cytoplasmic membrane protein 

*** Reasoning Step: 2 

50 Final Results 



bacterial inner membrane certainty^ 0.238 (Affirmative) < suco 

bacterial periplasmic space Certainty* 0.000 (Not Clear) < suco 

bacterial outer membrane Certain ty= 0.000 (Not Clear) < suco 

55 bacterial cytoplasm Certainty* 0.000 (Not Clear) < suco 

The protein has homology to the following sequences in the databases: 

A **gbp_17988861 gi 1 17988861 |ref |NP_541494.1 | \<NC_003318) hypothetical 
protein [Brucella melitensis] 
60 gb|AAL53758.l| (AE009687) hypothetical protein [Brucella melitensis] 

Length =99 
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Score = 108 bits (270), Expect = 3e-23 
Identities = 59/91 (64%), Positives ■ 69/91 (74%) 

5 Query: 11 LIjFSCMIiAVTC PTRLIGFFALRNRTL SRRAQTVMEAAPGCVL I SVIAPYFVSDKPHEL IA 70 

L M 4-VT TR+ G+ LRNRTLS RA VMEAAPGCVLISVTAP FVSDKP LIA 
Sbjct: 8 LTILAMASVTYLTRIGGYVLLRmTLSNRAMAVMEAA 67 

Query: 71 IALTAFAACRFSMLFTVI^IGVGSSGISGWLM 101 
10 +A+T FAA RFSML TVLIG+G++ I +L+ 

Sbjct: 68 LAVTVFAATRF SMLPTVL IGMGAAS ICRYLI 98 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
usefiil antigens for vaccines or diagnostics. 



15 Example 129 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 257> which encodes amino acid sequence 

<SEQ ID 258; NGS131>. Analysis of this protein sequence reveals the following: 

GvH Examining signal sequence (von Heijne) 
Signal Score (-7.5): -1.65 
20 Possible cleavage site: 43 

»> Seems to have no N-terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
25 count: 0 value: 7.05 threshold: 0.0 

PERIPHERAL Likelihood ■ 7.05 
modified AIjOM score: -1.91 
Rule: cytoplasmic protein 

30 *** Reasoning Step: 2 

Final Results 



bacterial cytoplasm Certainty* 0 . 152 (Affirmative) < suco 

bacterial periplasmic space Certainty* 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

bacterial inner membrane Certainty* 0.000 (Not Clear) < suco 



The protein has homology to the following sequences in the databases: 

40 * **gbp_16760390 gi 1 16760390 |ref |NP_456007 . 1 1 \(NC_003198) hypothetical 

protein [Salmonella enterica subsp. enterica serovar 
Typhi] 

emb|CAD01841.1| (AL627270) hypothetical protein [Salmonella enterica subsp. 
enterica serovar Typhi] 
45 Length = 227 

Score * 104 bits (259), Expect * 2e-21 

Identities * 68/221 (30%) , Positives * 115/221 (51%) , Gaps * 11/221 (4%) 

50 Query: 2 DKEKVLDKXKKCLALGRSVNEHEAAQALRQ — rx 59 

D++K ++K+KK LAL S N HEAA ALR+A+ LM+ + + DIA+S + E + 
Sbjct: 3 DQDKHIEKLKKLLALAASGNPHEAALALRRA 62 

Query: 60 MAP3CLAGWQWGVANMIADIFGCKSYQRGKT MMFY6IGNRAETSAYAFDWYRQI SAD 116 

55 + + + G+ N+I + FG S T + FYG RA +AY ++V+ RQ+ 

Sbjct: 63 GSLRPPRYMLGLMNI IREAFGVNS I IHPGTYPGVGFYGNRERAALAAYTWEVIaARQLKKA 122 

Query: 117 RRKFLKT-CRAGKPSHRTYLADRFCGGW^S 175 
R++++ + K + RT IHF GW+ + ++ F ++D+E+ +M 4* +■ 4-YP 
60 Sbjct: 123 RQQYISAQNKRIKTATRTSRGI^FAEGWvTAVT^SEIQSFALT 182 
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Query: 176 AEARTRDAKSSI^SKMEYEALTRGMESGKQVKLHYAVNG 216 

R R S G Y G G+ V+LH V+G 

Sbjct: 183 QTTRARKPGRS -RNGDASRY AGFREGQNVRLHRPVSG 218 

5 Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 130 

A DNA sequence was identified in ^.gonorrhoeae <SEQ ID 259> which encodes amino acid sequence 

<SEQ ID 260; NGS132>. Analysis of this protein sequence reveals the following: 

10 GvH Examining signal sequence (von Heijne) 

Signal Score (-7.5): -4.06 
Possible cleavage site: 30 
»> Seems to have no N-terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 
15 calculated from 1 

ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 2.49 threshold: 0.0 
PERIPHERAL Likelihood =2.49 
modified ALOM score: -1.00 
20 Rule: cytoplasmic protein 

*** Reasoning Step: 2 

Pinal Results , 

25 ! i • 

bacterial cytoplasm Certainty^ 0.075 (Affirmative) < suco j. 

bacterial periplasmic space Certainty^ 0.000 (Not Clear) < suco 'i 

bacterial outer membrane Certainty^ 0.000 (Not Clear) < suco 

bacterial inner membrane Certainty= 0.000 (Not Clear) < suco 

The protein has no homology to sequences in the databases. 

Based on this analysis, it was predicted that this protein from N ^gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 131 - . 

35 A DNA sequence was identified in ^.gonorrhoeae <SEQ ID 261> which encodes amino acid sequence 

<SEQ ID 262; NGS 133>. Analysis of this protein sequence reveals the following: 

GvH Examining signal sequence (von Heijne) 
Signal Score (-7.5): 1.64 
Possible cleavage site: 53 
40 »> Seems to have no N- terminal signal seq. 

Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 3.82 threshold: 0.0 
45 PERIPHERAL Likelihood = 3.82 

modified ALOM score: -1.26 
Rule: cytoplasmic protein 



30 



50 



*** Reasoning Step: 2 
Final Results — 



bacterial cytoplasm Certainty= 0.068 (Affirmative) < suco 

bacterial periplasmic space Certainty= 0.000 (Not Clear) < suco 

55 bacterial outer membrane Certainty= 0.000 (Not Clear) < suco 
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bacterial inner membrane Certainty= 0.000 (Not Clear) < suco 

The protein has no homology to the following sequences in the databases: 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
5 useful antigens for vaccines or diagnostics. 

Example 132 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 263> which encodes amino acid sequence 

<SEQ ID 264; NGS135>. Analysis of this protein sequence reveals the following: 

GvH Examining signal sequence (von Heijne) 
10 Signal Score (-7.5): -4.67 

Possible cleavage site: 39 
»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 
calculated from 1 
15 ALOM: Finding transmembrane regions (Klein et al.) 

count: 0 value: 5.52 threshold: 0.0 
PERIPHERAL Likelihood = 5.52 
modified ALOM score: -1.60 
Rule: cytoplasmic protein 

20 

*** Reasoning Step: 2 

Final Results j 

25 bacterial cytoplasm Certainty* 0.457 (Affirmative) < suco 

bacterial periplasmic space Certainty* 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

bacterial inner membrane Certainty* 0.000 (Not Clear) < suco 

30 The protein has no homology to sequences in the databases: 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 133 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 265> which encodes amino acid sequence 

35 <SEQ ID 266; NGS 136>. Analysis of this protein sequence reveals the following: 

GvH Examining signal sequence (von Heijne) 
Signal Score (-7.5): -3.97 
Possible cleavage site: 15 
>» Seems to have no N- terminal signal seq. 
40 Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 11.35 threshold: 0.0 
PERIPHERAL Likelihood ~ 11.35 
45 modified ALOM score: -2.77 

Rule: cytoplasmic protein 

*** Reasoning Step: 2 

50 Final Results 

bacterial cytoplasm Certainty= 0.523 (Affirmative) < suco 

bacterial periplasmic space Certainty^ 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty= 0.000 (Not Clear) < suco 
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bacterial inner membrane Certainty^ 0.000 (Not Clear) < suco 

The protein has no homology to sequences in the databases: 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
5 useful antigens for vaccines or diagnostics. 

Example 134 

A DNA sequence was identified in N. gonorrhoeae <SEQ ID 267> which encodes amino acid sequence 

<SEQ ID 268; NGS 137>. Analysis of this protein sequence reveals the following: 

GvH Examining signal sequence (von Heijne) 
10 Signal Score (-7.5); -8.52 

Possible cleavage site: 51 
»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 
calculated from 1 
15 ALOM: Finding transmembrane regions (Klein et al.) 

count: 0 value: 2.81 threshold: 0.0 
PERIPHERAL Likelihood » 2.81 
modified ALOM score: -1.06 
Rule: cytoplasmic protein ' 



20 



*** Reasoning Step: 2 
Final Results 



I 

25 bacterial cytoplasm Certainty^ 0.374 (Affirmative) < suco j 

bacterial periplasmic space Certainty* 0.000 (Not Clear) < suco ' 

bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

bacterial inner membrane Certainty* 0.000 (Not Clear) < suco 

30 The protein has no homology to sequences in the databases. 

Based on this analysis, it was predicted that this protein from JV. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 



Example 135 

A DNA sequence was identified in N. gonorrhoeae <SEQ ID 269> which encodes amino acid sequence 

35 <SEQ ID 270; NGS138>. Analysis of this protein sequence reveals the following: 

GvH Examining signal sequence (von Heijne) 
Signal Score (-7.5): -7 
Possible cleavage site: 36 
>» Seems to have no N- terminal signal seq. 
40 Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 10.66 threshold: 0.0 
PERIPHERAL Likelihood = 10.66 
45 modified ALOM score: -2.63 

Rule: cytoplasmic protein 

*** Reasoning Step: 2 



50 



Final Results 



bacterial cytoplasm — 
bacterial periplasmic space — 
bacterial outer membrane — 



Certainty= 0 .415 (Affirmative) < suco 
Certainty= 0.000 (Not Clear) < suco 
Certainty^ 0.000 (Not Clear) < suco 
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bacterial inner membrane Certainty* 0.000 (Not Clear) < suco 

The protein has homology to the following sequences in the databases: 

A **gbp_13559865 gi| 13559865 |ref |NP_112 075.1 | \(NC_002730) terminase 
5 small subunit [Bacteriophage HK62Q] 

gb|AAK28890.1|AF335538_42 (AF335538) terminase small subunit [Bacteriophage 
KK620] 

Length =140 

10 Score » 125 bits (313), Expect ■ 5e-28 

Identities * 56/122 (45%), Positives * 85/122 (68%) 

Query: 4 TKRKIXSRPTDYTKDMADKI^ 63 
T+ K GRP+DY ++AD IC +++G SL +C G+P T++RWL +E+FR +YA+ 
15 Sbjct: 3 TEPKAGRPSDYMPEVADDIC SIoLS SGESLLKVCKRPGMPDKSTVFRWIJ^HEDFRDKY^ 62 

Query: 64 AREKQ AD YF AEE HE IAD S AQ AE S AAVSKAKLQ I D ARKWAASKI APKKYG DKS ELDVK SGDG 125 

A E +AP EEI EIAD+A ++A V+KA+L++D RKWA +++ P+KYGDK ++ DG 
Sbjct: 63 ATEARADSIFEEIFEIADNAIPDAAEVAKARLRVDTRI^^ 124 

20 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 136 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 271> which encodes amino acid sequence 

25 <SEQ 3D 272; NGS139>. Analysis of this protein sequence reveals the following: 

GvH Examining signal sequence (von Heijne) 
Signal Score (-7.5): -1.49 
Possible cleavage site: 32 
»> Seems to have no N-terminal signal seq. 
30 Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 8.65 threshold: 0.0 
PERIPHERAL Likelihood = 8.65 
35 modified ALOM score: -2.23 

Rule: cytoplasmic protein 

*** Reasoning Step: 2 

40 Final Results 

bacterial cytoplasm Certainty* 0.301 (Affirmative) < suco 

bacterial periplasmic space Certainty* 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

45 bacterial inner membrane Certainty* 0.000 (Not Clear) < suco 

The protein has homology to the following sequences in the databases: 

a **gbp_16127009 gi | 16127009 |ref |NP_421573 . 1 | \(NC_002696) hypothetical 
protein [Caulobacter crescentus] 
50 gb|AAK24741.1| (AE005943) hypothetical protein [Caulobacter crescentus] 

Length = 184 

Score = 59.7 bits (143)., Expect = 4e-08 

Identities = 50/164 (30%) , Positives = 74/164 (44%) , Gaps = 20/164 (12%) 

55 

Query: 30 ASGREFRTAYYTYPQWRFSLSFEVLRTKASVNELEKLA^ — 87 

ASG E RT+ ++ + R+ ++ ++E+ +L FF AR+G F + DPAD 

Sbjct: 5 ASGHERRTSPWSQSRRRYLIA TAPRPLDE IAELVAFFEARRGRLHGFRFRDPADFK 60 

60 Query: 88 NAVTDQPVGmVQGVAR-YQLVRSMGGFIEPVSAVKERP AVKVGGTAL 134 
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A DQ+GT GV++QLHG E V+ +P V V G L 

Sbjct: 61 SCAPSVQPAAGDQAIC^TGDGVRKA^ 119 

Query: 135 AYGRDYTVTDKGVLVFNTPQPPGRPITOTGGF^ 178 
5 A G G++ NT P G +T F VRF D +D 

Sbjct: 120 APGAFAVDVTTGLITLNTAPPAGAAV^ 163 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

10 Example 137 

A DNA sequence was identified in K gonorrhoeae <SEQ ID 273> which encodes amino acid sequence 

<SEQ ID 274; NGS140>. Analysis of this protein sequence reveals the following: 

GvH Examining signal sequence (von Heijne) 
Signal Score (-7.5): -3.86 
15 Possible cleavage site: 31 

»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
20 count: 1 value: -4.94 threshold: 0.0 

INTEGRAL Likelihood = -4.94 Transmembrane 34 - 50 ( 31 - 54) 
PERIPHERAL Likelihood =8.01 
modified ALOM score: 1.49 
Rule: cytoplasmic membrane protein 



25 



40 



*** Reasoning Step: 2 
Final Results — 



30 bacterial inner membrane Certain ty= 0.297 (Affirmative) < suco 

bacterial periplasmic space Certainty^ 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty= 0.000 (Not Clear) < suco 

bacterial cytoplasm Certainty^ 0.000 (Not Clear) < suco 

35 The protein has homology to the following sequences in the databases: 

A **gbp_17987625 gi 1 17987625) ref |NP_540259 ,l| \(NC_003317) Hypothetical 
Phage Protein [Brucella melitensis] 
gb|AAL52523.l| (AE009572) Hypothetical Phage Protein [Brucella melitensis] 
Length = . 144 



Score ■ 72.4 bits (176), Expect = 5e-12 

Identities - 43/119 (36%) , Positives = 64/119 (53%) , Gaps » 7/119 (5%) 



Query: 10 RIVTSEARSWLGTFYHHHAMVKGAGV^ 68 
45 R++ EA W+GTPY HA G DC L+ ++ A+ G+ PE +P Y DW 

Sbjct: 6 RVLAEAHRWIGTPYRHGASTIX^SCIX:!^^ 63 

Query: 69 CERYLGFVTQPC — RETESPQAGDIAV — WRFGRSFSHGGILAGGGKVIHSYIGRGWS 123 
+ L ++ RE +PQ GD+ V W+ G + H GI+A G+ IH+Y G GV++ 
50 Sbjct: 64 GDPMLEAAVRYMVRREEHAPQ PGDLLVFRWKPGFAAKHMG IMAREGRF IHAYQGHGVLA 122 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 138 

55 A DNA sequence was identified in N. gonorrhoeae <SEQ ID 275> which encodes amino acid sequence 
<SBQ ID 276; NGS 141>. Analysis of this protein sequence reveals the following: 
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GvH Examining signal sequence (von Heijne) 
Signal Score (-7.5): 5.35 
Possible cleavage site: 28 
»> seems to have a cleavable N-term signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 29 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 8.86 threshold: 0.0 
PERIPHERAL Likelihood = 8.86 
modified ALOM score: -2.27 
Score for OM-PP discrimination: 1.53 
Rule: outer membrane or periplasmic protein 
Score for OM-PP discrimination: 1.53 
Rule: outer membrane or periplasmic protein 

*** Reasoning Step: 2 

Outer membrane? Score: 0.152929 
Outer membrane? Score: 0.152929 



Final Results 



bacterial outer membrane Certainty* 0 . 512 (Affirmative < suco 

bacterial periplasmic space Certainty* 0 .320 (Affirmative) < suco 

bacterial inner membrane Certainty* 0.000 (Not Clear) < suco 

bacterial cytoplasm Certainty* 0.000 (Not Clear) < suco 

The protein has homology to the following sequences in the databases: 

A **gbp_5915870 gi| 5915870 |sp|Q50940 |CAHJHEIGO Carbonic anhydrase j 
precursor (Carbonate dehydratase) 
emb|CAA72038.ll (Y11152) carbonic anhydrase [Neisseria gonorrhoeae] , 
Length = 252 

Score = 523 bits (1347), Expect = e-147 

Identities * 252/252 (100%), Positives = 252/252 (100%) 

Ouerv 1 MPRFPRTLPRLTAVLLIACTAFSAAAH^ 60 

MPRFPRTLPRLTAVLLLACTAFSAAAHGNHTHWGYTGH^ 
Sbjctr 1 MPRFPRTLPRLTAVLLLACTAFSAAAHGNHT^ 60 

Ouerv- 61 SFVNITETVSGKLPAIKVNYKPSMVDVENNGHTIQV^ 120 
SPVNITETVSGKLPAIKVNYKP SMVDVENNGHTIQVNYPEGGNTLTVNGRTYTLKQFHFH 

Sbjct: 61 SFVNITETVS(^PAIK\mYKP 120 

Ouerv- 121 VP SENQIKGRTFPMEAHFVHLDENKQPLVIiAVLYEAGKTNGRLSS IWNVMPMTAGKVKLN 180 

VP SENQ IKGRTFPMEAHFVHLDEl^QPLVIiAVL»YEAGKTNGRLS S IWNVMPMTAGKVKLN 
Sbjct: 121 VPSENQ IKGRTFPMEAHFVHLDENKQPLVLAVLYEAGKTNGRLS S IWNVMPMTAGKVKLN 180 

Ouerv- 181 QPFDASTLL PKRLKYYRFAGSLTT PPCTEGVSWLVLKTYDHIDQAQAEKFTRAVGSENNR 240 

QPFDASTLLPKRLKYYRFAGSLTTPPCTEC^SV^VLKTTO 
Sbjct: 181 QPFDASTLLPKRLKYYRFAG SLTTPPCTEGVSWLVLKTYDHIDQAQAEKFTRAVGSENNR 240 

Query: 241 PVQPLNARWIE 252 

PVQPLNARWIE 
Sbjct: 241 PVQPLNARWIE 252 

Based on this analysis, it was predicted that this protein from ^gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 139 

A DNA sequence was identified in ^gonorrhoeae <SEQ ID 277> which encodes amino acid sequence 
<SEQ ID 278; NGS142>. Analysis of this protein sequence reveals the following: 
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t 

GvH Examining signal sequence (von Heijne) 
Signal Score (-7.5): -0.49 
Possible cleavage site: 22 
»> Seems to have no N-terminal signal seq. 
5 Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 1.22 threshold: 0.0 
PERIPHERAL Likelihood = 1.22 
10 modified ALQM score: -0.74 

Rule: cytoplasmic protein 

*** Reasoning Step: 2 

15 Final Results 



40 



bacterial cytoplasm Certainty= 0 . 145 (Affirmative) < suco 

bacterial periplasmic space Certainty* 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

20 bacterial inner membrane Certainty* 0.000 (Not Clear) < suco 

The protein has homology to the following sequences in the databases: 

A **gbp_15794480 gi 1 15794480 |ref |NP_2 843 02.1 1 \(NC_003116) hypothetical 
protein [Neisseria meningitidis 22491] 
25 pir | |F81851 hypothetical protein NMA1587 [imported] - Neisseria meningitidis 

(group A strain Z2491) 
emb|CAB84814.l| (AL162756) hypothetical protein [Neisseria meningitidis Z24911 
Length = 181 

30 Score = 358 bits (919), Expect = 6e-98 

Identities = 173/181 (95%), Positives = 178/181 (97%) I 

Query: 1 I^TOTARMNNLIPEHLAAYAHSDNLQIEGGHRCFSLSCQGRDTFH^ 60 
+KTDTA+MNNLIPEHLAAYAHSD+LQIEG HRCFSLSCQGRDTFHIRYYGEPFDGL+TDT 
35 Sbjct: 1 MKTDTAKMNNLI PEHLAAYAHSDSLQIEGV^ 60 

Query: 61 DKAFVKlVAVEAVSGDEIVLFDGtftf^ 120 

DKAPVK IVAVEAVSGDE IVLFDGAEHGYNAMFCDKYS NQKQNRTLTDLDEYTYRV IHL 
Sbjct: 61 DKAFVKIVAVEAVSGDErVLFIXaAEHGYNA 120 



Query: 121 YYNIDYEDjETYEDFVNSEGQVPL IDGRI I SFDSLKRNGFDAI S I PL I DEKHSVR KT J iNEKDS 181 

YYNIDYEDEYEDFVNSEGQVPLIDGRI ISFDSLKRNGFDAIS+DLIDEKHSVRELLNEELS 
Sbjct: 121 YYNIDYEDEYEDFVNSEGQVPL IDGRI I SFDSLKRNGFDAI SVDL IDEKHSVRELLNEELS 181 



45 A bomolog was found in serogroup A N.merungitidis but not in serogroup B, so NGS142 protein and 
nucleic acid are useful for distinguishing between gonococcus and serogroup B N.nieningitidis. 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae y and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 140 

50 A DNA sequence was identified in N. gonorrhoeae <SEQ ID 279> which encodes amino acid sequence 

<SEQ ID 280; NGS143>. Analysis of this protein sequence reveals the following: 

GvH Examining signal sequence (von Heijne) 
Signal Score (-7.5): -2.51 
Possible cleavage site: 57 
55 »> Seems to have no N- terminal signal seq. 

Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 4 value: -15.23 threshold: 0.0 
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25 



45 



INTEGRAL Likelihood =-15.23 Transmembrane 84 - 100 ( 79 - 107) 
INTEGRAL Likelihood « -8.12 Transmembrane 259 - 275 ( 250 - 281) 
INTEGRAL Likelihood = -4.14 Transmembrane 159 - 175 ( 153 - 176) 
INTEGRAL Likelihood = -3.88 Transmembrane 216 - 232 ( 216 - 235) 
PERIPHERAL Likelihood = 1.11 
modified ALGM score: 3.55 
Rule: cytoplasmic membrane protein 

*** Reasoning Step: 2 

Final Results 



bacterial inner membrane Certainty* 0 . 709 (Affirmative) < suco 

bacterial periplastic space Certainty^ 0.000 (Not Clear) < suco 

15 bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

bacterial cytoplasm Certainty* 0.000 (Not Clear) < suco 

The protein has homology to the following sequences in the databases: 

a **gbp_5764059 gi|5764059|emb|CAB53350.l| \(AJ010260) NosR protein 
20 CParacoccus denitrif icans) 

Length = 724 



Score * 393 bits (1009), Expect « e-108 

Identities * 191/379 (50%), Positives = 249/379 (65%), Gaps = 22/379 (5%) 

Query: 1 LMVQRVLSVlsnDKAFVTADI^^ 60 

L+VQR + +K F T DL Y+LPQ Y AP A PAA +D 
Sbjct: 358 LLVQREVGPIEKVFHTFDLGYQLPQKYLRSIAPAPEA AAPAAQAD 402 

30 Query: 61 DASAENGV SNQLWKQ IWKAKQGQ IWVG IALT ILLLVFLFQDWIVRYEKWYDRFRFAFLT 120 I 

E+ QLWK+IW + +1 + L +L VP FQ + RYE+ + FR A+LT 
Sbjct: 403 ESQAQAQLWKRIWLDSKPKIAGLAAMLLVLTGVFF 458 ! 

Query: 121 FTLFYIGWYAQAQLSVVim^TI^SAILTEFHWEFFLMDPrVFIL 180 
35 TL ++GWYA AQLSWN + LF +++ F W+ FL+DP+ FILW AA +L W RG + 

Sbjct: 459 VTLVFLGWYANAQLSVVNLMALFGSLVNGFSWQAFLLD 518 

Query: 181 CGWLC PFG SLQELTNRIAKKLGVKQ ITVPHMLHTRLNVIKYLILFGFLAI SLYDLGTAEK 240 
CGWLCPFG+LQELTN++A+KL + Q T+P LH RL +KY+I G +SL + AE 
40 Sbjct: 519 CGWLCPFGALQELTNQVARKIjRI PQWTLPWGLHERLWPVIOTIIFLGLFGVSLMSVEQAEH 578 

Query: 241 FAEVEPFKTAI ILKFMCDWWFVAF AVALLIAGLF IERFFCRYLCPLGAGI ALPGRFRVFD 300 

AEVEPFKTAI ILKF+ W FVA+A ALLIAGLF+BRF+CRYLCPLGA +A+P R R+FD 
Sbjct: 579 LAEVEPFKTAI ILKFIRAWPFVAYAAALLIAGLFVERFYCRYIiC PLGAALAI PARMRMFD 638 

Query: 301 WLRIOfKMCGNPCQICTHBCPVQAXAPE 360 

WL+RY CGNPCQ C +CPVQ+I P G+I+PNBCI CLHCQV+Y +T CP V+ KK 
Sbjct: 639 WLKRYHECGNPCQTCARQCPVQSIHPTGEINPNEC INCLHCQVLYQSBTTCPWT KK 695 

50 Query: 361 KQKQAAAKSGELENVSKQP 379 

+++ A +G + + + P 
Sbjct: 696 LKRREAVAAGSMPKLGQPP 714 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
55 useful antigens for vaccines or diagnostics. 

Example 141 

A DNA sequence was identified in N. gonorrhoeae <SEQ ID 281> which encodes amino acid sequence 

<SEQ ID 282; NGS144>. Analysis of this protein sequence reveals the following: 

GvH Exa m i n ing signal sequence (von Heijne) 
60 Signal Score (-7.5): 1.23 

Possible cleavage site: 21 
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»> May be a lipoprotein 

Amino Acid Composition of Predicted Mature Form: 

calculated from 20 
ALOM: Finding transmembrane regions (Klein et al.) 
5 count: 1 value: -0.37 threshold: 0,0 

INTEGRAL Likelihood = -0.37 Transmembrane 90 - 106 ( 89 - 106) 
PERIPHERAL Likelihood = 10.82 
modified ALOM score: .0.57 
Rule: inner or outer membrane protein 
10 Rule: inner or outer membrane protein 

Rule: cytoplasmic membrane protein 

*** Reasoning Step: 2 

15 Lipoprotein? 

Inner membrane? 



45 



50 



Final Results 



20 bacterial outer membrane Certainty^ 0.790 (Affirmative) < suco 

bacterial inner membrane - — Certainty* 0.734 (Affirmative) < suco 

bacterial periplasmic space Certainty* 0.000 (Not Clear) < suco 

bacterial cytoplasm Certainty* 0.000 (Not Clear) < suco 

25 The protein has homology no to sequences in the databases. 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

li • . 

Example 142 jj 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 283> which encodes amino acid sequence 

30 <SEQ ID 284; NGS 145>. Analysis of this protein sequence reveals the following: 

GvH Examining signal sequence (von Heijne) 
Signal Score (-7.5): 1.5 
Possible cleavage site: 19 
»> Seems to have a cleavable N-term signal seq. 
35 Amino Acid Composition of Predicted Mature Form: 

calculated from 20 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 8.70 threshold: 0.0 
PERIPHERAL Likelihood =8.70 
40 modified ALOM score: -2.24 

Score for OM-PP discrimination: -9.24 
Rule: outer membrane or periplasmic protein 
Score for OM-PP discrimination: -9.24 
Rule: outer membrane or periplasmic protein 



*** Reasoning Step: 2 

Periplasmic space? Score: 0.924443 
Periplasmic space? Score: 0.924443 



Final Results 



bacterial periplasmic space Certainty^ 0.931 (Affirmative) < suco 

bacterial outer membrane Certainty= 0.231 (Affirmative) < suco 

55 bacterial inner membrane Certainty= 0.000 (Not Clear) < suco 

bacterial cytoplasm Certainty= 0.000 (Not Clear) < suco 

The protein has homology to the following sequences in the databases: 

A **gt>P_5051426 gi 1 5051426 |emb|CAB45007 .1| \(AJ242839) OpcA protein 
60 [Neisseria gonorrhoeae] 
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Length =263 

Score = 531 bits (1369), Expect = e-150 

Identities = 262/263 (99%), Positives = 263/263 (99%) 

Query: 1 MKKAIiLALTIAAISGTAMAQLPDFLGKGEY^ 60 

MKKALLALTIAAISGTi^QLPDFLGKGEYTVRTO 
Sbjct: 1 MKKALLALT IAAI SGTAMAQLPDFLGKGEYTVRTDI SKQTLKNADLKEKHKVQKNIGFRA 60 

Query: 61 DMPFDDIHHGMRFEV SHSRDKKEMYVVTESTTKPFGK1J V KKKKTDVYAGYTYTQPI SEAT 120 

rMPFDDIHHGMRFWSHSRDKKDMYVVTESTTKPFGKDV 
Sbjct: 61 DMPFDDIHHGMRFEVSHSRDKKDMYVVra 120 

Query: 121 KLRAGLGLGYEKYKDAVANEKGTVSTEREAFYTKAHADL 180 

KLRAGLGI/?5fEKyKDAVANEKGTV STEREAFYTKAHADLTSDI/3GGWYLNPWAEVKVDLD 
Sbjct: 121 KIiRAGLGLGYEKYKDAVANEKGTV STEREAFYTKAHADLTSDLGGGVnfLNPWAEVKVDLD 180 

Query: 181 AKLKHNATVAGVSADINAKTRGWGVGVGAN^ 240 

AKLKHNATVAGVSADINAKTRGWGVGVGLANIG 
Sbjct: 181 AKLKHNAWAGVSADINAKTI^WGVG^GANIGKQITDWGIEAGPFYKH^ 240 

Query: 241 DGGNIRVDPTKINEYGVRVGVKF 263 

DGGNIRVDPTKINEYGVRVGVKP 
Sbjct: 241 DGGNIRVDPTKINEYGVRVGVKF 263 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 143 

A DNA sequence was identified in N. gonorrhoeae <SEQ ID 285> which encodes amino acid sequence 
<SEQ ID 286; NGS146>. Analysis of this protein sequence reveals the following: 

GvH Examining signal sequence (von Heijne) 

Signal Score (-7.5): 0.1 

Possible cleavage site: 51 
>» seems to have a cleavable N-term signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 52 
ALOM: Finding transmembrane regions (Klein et al.) 

count: 0 value: 3.50 threshold: 0.0 

PERIPHERAL Likelihood = 3.50 
modified ALOM score: -1.20 
Score for OM-PP discrimination: -15.70 
Rule: outer membrane or periplasmic protein 
Score for QM-PP discrimination : -15.70 
Rule: outer membrane or periplasmic protein 

*** Reasoning Step: 2 

Periplasmic space? Score: 1.56979 
Periplasmic space? Score: 1.56979 

Final Results 

bacterial periplasmic space Certainty^ 0.944 (Affirmative) < suco 

bacterial outer membrane Certainty= 0 . 375 (Affirmative) < suco 

bacterial inner membrane Certainty* 0.000 (Not Clear) < suco 

bacterial cytoplasm Certainty= 0.000 (Not Clear) < suco 

The protein has homology to the following sequences in the databases: 

A **gbp_5051429 gi| 5051429 |emb|CAB45013 .1 | \(AJ242839) hypothetical 
protein [Neisseria gonorrhoeae] 
Length = 109 
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Score a 216 bits (549), Expect - 2e-55 

Identities = 109/109 (100%) , Positives * 109/109 (100%) 

5 Query: 1 MFKRPEEI IVLILAVLWIAGTYFLAALFGADAYTVLXIT^ 60 

MFKRPEEIIVLIIAVLWIAGTYETiAALFGAI^^ 
Sbjct: 1 MFKRPEEI rVLILAVLWIAGTYFIAALFGADAYTVLM 60 

Query: 61 YIAAAARLPDHLLVAVSESIGRTRFFTLACIJ^^ 109 
10 YIjAAAARLPDHLLVAVSE S IGRTRFFTLAC IMDVQNHL S PD SRNRRLSV 

Sbjct: 61 YLAAAARLPDHLLVAVSBSIGRTRFFTLACim^^ 109 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

15 Example 144 

A DNA sequence was identified in N. gonorrhoeae <SEQ ID 287> which encodes amino acid sequence 

<SEQ ID 288; NGS147>. Analysis of this protein sequence reveals the following: 

GvH Examining signal sequence (von Heijne) 
Signal Score (-7.5): -4.09 
20 Possible cleavage site: 40 

»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 
calculated from 1 

ALOM: Finding transmembrane regions (Klein et al.) », '■' 

25 count: 1 value: -2.28 threshold: 0.0 ! ! 

INTEGRAL Likelihood = -2.28 Transmembrane 36 - 52 ( 36 - 52) \. 
PERIPHERAL Likelihood =5.20 " 
modified ALOM score: 0.96 
Rule: cytoplasmic membrane protein 

30 

*** Reasoning Step: 2 
Final Results 

35 bacterial inner membrane Certainty** 0. 191 (Affirmative) < suco 

bacterial periplasmic space Certainty* 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

bacterial cytoplasm Certainty* 0.000 (Not Clear) < suco 

40 The protein has homology to the following sequences in the databases: 

* **gbp_6606516 gi 1 6606516 |gb|AAF19189 .1 |AF200716_2 \(AF200716) 
trafficking protein B [Neisseria gonorrhoeae] 
Length * 139 

45 Score* 274 bits (700), Expect * 7e-7 3 

Identities = 139/139 (100%), Positives = 139/139 (100%) 

Query: 2 MILLDTNVI SEPLRPQPNERWAWLDSL ILEDVYL SAI WAELRIiGVALLLNGKKKNVLH 61 
mLLOTNvISEPLRPQPNERWAVtoDSLILE^ 
50 Sbjct: 1 MILLDTNVISEPLRPQPNERWAWLDSLILEDVYLSA^ 60 

Query: 62 ERLEQSILPLFAGRILPFDEFVAAIYAQIRSYAKTHGKE 121 

ERLEQSILPLFAGRILPFDEPVAAIYAQIRSyARTHGKEIAAADGYIAATAKQHSLTVAT 
Sbjct- 61 ERLEQSILPLFAGRILPFDEPVAAIYAQIRSYAKTHGKEIAAADGYIAATAKQHSLTVAT 120 

55 

Query: 122 RDTGSFFAADVAVFNPWHD 140 

RDTGSFFAADVAVFNPWHD 
Sbjct: 121 RDTGSFFAADVAVFNPWHD 139 
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Based on this analysis, it was predicted that this protein from ^gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 145 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 289> which encodes amino acid sequence 

5 <SEQ ID 290; NGS 148>. Analysis of this protein sequence reveals the following: 

GvH Examining signal sequence (von Heijne) 
Signal Score (-7.5): 0.86 
Possible cleavage site: 47 
>» Seems to have a cleavable N-term signal seg. 
10 Amino Acid Composition of Predicted Mature Form: 

calculated from 48 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 5 value: -15.44 threshold: 0.0 
INTEGRAL Likelihood =-15.44 
15 INTEGRAL Likelihood =-12.15 

INTEGRAL Likelihood = -6.32 
INTEGRAL Likelihood = -4.30 
INTEGRAL Likelihood = -2.60 
PERIPHERAL Likelihood « 2.92 
20 modified ALOM score: 3.59 

Rule: cytoplasmic membrane protein 

*** Reasoning Step: 2 

25 Final Results 



30 



55 



Transmembrane 


157 - 


173 


( 


142 


- 181) 


Transmembrane 


62 - 


78 


( 


56 


- 83) 


Transmembrane 


194 - 


210 


( 


191 


- 212) 


Transmembrane 


87 - 


103 


( 


85 


- 104) 


Transmembrane 


121 - 


137 


( 


121 


- 142) 



bacterial inner membrane Certainty* 0.7 18 (Affirmative) < suco 

bacterial periplasmic space Certainty- 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

bacterial cytoplasm Certainty* 0.000 (Not Clear) < suco 



The protein has homology to the following sequences in the databases: 

- **gbp_15777859 gi|l5777859|gb|AAL05955.l| \(AY048756) putative cadmium 
binding protein [Staphylococcus aureus] 
35 Length * 209 

Score = 354 bits (908), Expect = le-96 

Identities = 177/208 (85%), Positives = 194/208 (93%) 

40 ouerv 14 MRCFMFSTVITAAVLYIATAVDIiLVILL IFF ILVSLF 73 

MRC M TV+ AAVLYI ATAVDLLVILL IFFARA TRKEYRDI Y+GQYLGS + IL ILVSLF 
Sbjct: 1 MRC IMIQTVVAAAVLY lATAVDLLVTLLIFFARAKTRKEYRDI WGQYI^ I IL ILVSLF 60 

Query- 74 iafVLNYVPEKWVLGLLGLI 133 
45 xafvlnYVPEKW+LGLLGLIPIYLGIOT VA+VT 

Sbjct: 61 LAFVLNYVPEKWI LGLLGL I P r^LGIKVAI YDIX^EGEKRAKKELNEKGLSKLVGT^rTO 120 

Ouerv- 134 VASCGADNIGLFVPYFOTLDLV^ 193 
+ASCGADNIGLFVPYFVTL + +LL+TL VFLILIF LV+TAQ+LANI G+GEIVEKFSR 
50 Sbjct- 121 IASCGADNIGLFVPYFVTLSVTNLLLTLFVFLILIFFLV^ 180 



Query: 194 WIMAVIYIGLGLFIIIENNTIRTIISII 221 

WIMA+IYI LGLFIIIEN+TI+TI+ 1 
Sbjct: 181 WIMAIIYIAIiGLFIIIENDTIQTILGFI 208 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 



WO 02/079243 



-157- 



PCT/EB02/02069 



15 



35 



Example 146 

A DNA sequence was identified in N. gonorrhoeae <SEQ ID 291> which encodes amino acid sequence 
<SEQ ID 292; NGS 149>. Analysis of this protein sequence reveals the following: 



GvH Examining signal sequence (von Heijne) 
5 Signal Score (-7.5): -0.63 

Possible cleavage site: 43 
»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 
calculated from 1 
10 ALOM: Finding transmembrane regions (Klein et al.) 

count: 0 value: 2.12 threshold: 0.0 
PERIPHERAL Likelihood « 2.12 
modified ALOM score: -0.92 
Rule: cytoplasmic protein 



*** Reasoning Step: 2 
Final Results — 



20 bacterial cytoplasm Certainty* 0.122 (Affirmative) < suco 

bacterial periplasmic space Certainty* 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

bacterial inner membrane Certainty= 0.000 (Not Clear) < suco 

25 The protein has homology to the following sequences in the databases: 

* **gbp_15675455 gi | 15675455 | ref |NP_269629 .l| \(NC_002737) conserved j 
hypothetical protein [Streptococcus pyogenes] 

(Streptococcus pyogenes Ml GAS] I 
gb|AAK34350.l| (AE006588) conserved hypothetical protein (Streptococcus pyogenes 
30 " Ml GAS] 

Length » 224 



Score = 106 bits (264) , Expect = 3e-22 

Identities = 63/151 (41%), Positives = 85/151 (55%), Gaps = 12/151 (7%) 

Query: 20 LSALQHYAFCPRQCALIHNEQAWAENYLTAQGKALHERVDSDE- PETCKGVRFEWTVHVL 78 

LS +QH+ FC RQ AL1H EQ W +N TA G+ LH + D+ E K + + + 

Sbjct: 11 LSGIQHFQFCKRQWALIHIEQQWLDNEATAHGQVLH^ 70. 

40 Query: 79 ADKLG I SG ILDLVE VDTKTGRLKP — VEYKRGKPKPDPGDEIQLCAQGLCL 127 

* * + +LG+ SGI +D+VE + K G+ P VEYKRGKPK D D +QL AQ +CL 

Sbjct: 71 SAELGLSGIMDVVEFYKDDQGVSLRGKRGKWL 130 

Query: 128 EEMTGQTVSEGALWYMQTRHRVPVVFSDGLR 158 
45 " EE ++EG L+Y BV V + LR 

Sbjct: 131 EETLDCDINEGCLYYHSVNQRVIVPMTSALR 161 

Based on this analysis, it was predicted that this protein from ^gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

50 Example 147 

A DNA sequence was identified in N. gonorrhoeae <SEQ ID 293> which encodes amino acid sequence 

<SEQ ID 294; NGS150>. Analysis of this protein sequence reveals the following: 

GvH Examining signal sequence (von Heijne) 
Signal Score (-7.5): -0.71 
55 possible cleavage site: 19 

>» seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 
calculated from 1 
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ALOM- Finding transmembrane regions (Klein et al.) 
'count: 1 value: -0.85 threshold: 0.0 

SrBGRAL Likelihood = -0.85 Transmembrane 79 - 95 ( 79 96) 
PERIPHERAL Likelihood = 6.52 
5 modified ALOM score: 0.67 

Rule: cytoplasmic membrane protein 

*** Reasoning Step: 2 

10 Final Results 

bacterial inner membrane — Certainty* 0.13 4 (Affirmative) < suco 

15 ^ ^cterSrcSlasm — Certainty, 0.000<Not Clear) < suco 

The protein has no homology to sequences in the databases. 

Based on this analysis, it was predicted that this protein from gonorrhoeae, and its epitopes, could be 
useful antigens for yaccines or diagnostics. 

20 Example 148 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 295> which encodes amino acid sequence 

<SEQ ID 296; NGS151>. Analysis of this protein sequence reveals the following: 

GvH Examining signal sequence (von Heijne) 
Signal Score (-7.5): 3.47 
n< possible cleavage site: 23 

»> Seems to have a oleavable N-term sxgnal seq. 
Lino Acid composition of Predicted Mature Form: 

calculated from 24 
ALOM: Finding transmembrane regions (Klein et al.) 
30 count: 0 value: 14.32 threshold: 0.0 

PERIPHERAL Likelihood = 14.32 
modified ALOM score: -3.36 
Score for OM-PP discrimination: -32.29 
Rule: outer membrane or periplasms protein 
Score for OM-PP discrimination: -32.29 
Rule: outer membrane or periplasms protein 

*** Reasoning Step: 2 



35 



40 periplasmic space? Score: 3.22889 

Periplasmic space? Score: 3.22889 



Final Results 



45 bacterial periplasmic space - Certainty .. Jj^g- : ^ 

JSSS Suner ~ cSatnt£ !'.S "ar> < suco 

** JaSerSfc^lasm — Certainty- 0.000 (Mot Clear) < suco 

50 The protein has homology to the following sequences in the databases: 

* **asaAAY75310 Neisseria gonorrheae ORF 649 protein sequence SEQ ID 
H ^^NO:2094 |WO9957280-A2 1 21 r MAR-2000 
Length = 103 

55 SJ-wTCLSi" 3 8 / «. <— - «- <»> 

Query 7 3^13^™^ - 
60 sbjct: 13 ^OT3*i^p»iii«™nsicx-irc^^ 71 
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Query: 66 FLFQALPAADLGQRKQMCQ IAMDSF 90 

P+ LP A ++K C+ A ++F 
Sbjct: 72 KAFRTLPYA EQKIQCRAAYEAF 93 

5 

The protein was expressed in Rcoli as a soluble 935kDa His-fusion product and then purified. 

Based on this analysis, it was predicted that this protein from N.gonorrlweae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 149 

10 A DNA sequence was identified in KgonorrJioeae <SEQ ID 297> which encodes amino acid sequence 

<SEQ ID 298; NGS152>. Analysis of this protein sequence reveals the following: 

GvH Examining signal sequence (von Heijne) 
Signal Score (-7.5): -5.46 
Possible cleavage site: 18 
15 >» Seems to have an uncleavable N-term signal seq 

Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 1 value: -3.19 threshold: 0.0 
20 INTEGRAL Likelihood = -3.19 Transmembrane 368 - 384 ( 367 - 384) 

PERIPHERAL Likelihood =0.53 
modified ALOM score: 1.14 
Rule: cytoplasmic membrane protein j. 

25 *** Reasoning Step: 2 j ; . 



50 



Final Results 



30 bacterial inner membrane Certainty* 0.227 (Affirmative) < suco 

bacterial periplasmic space Certainty^ 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

bacterial cytoplasm Certainty* 0.000 (Not Clear) < suco 

35 Hie protein has homology to the following sequences in the databases: 

A **gsa_JtflY81609 Streptococcus pneumoniae type 4 protein sequence #109 
| WO200006737-A2 1 24-MAY-2000 
Length =1237 

40 Score = 48.1 bits (113), Expect = 4e-04 

Identities = 80/312 (25%), Positives = 142/312 (44%), Gaps = 59/312 (18%) 

Query: 57 RRQARIRVGNLI TDSLEHIRVKALLPLPL KLPVKRI NLPRNLPALPVRLRKTI 109 

RRQ R + ++ L+H RV L P+ ++FV+++ +PR A RL++ + 
45 Sbjct: 941 RRQVR- QPQQVLVHQLQHQRVHRLRRQPVHQS QQVPVllQLPH^ 999 

Query: 110 S PRQ IGDALP ILKLQRI — RLTLHLKPLPLHPQIX5LLHIKRPWIPLRHLAVQRTLVRLN 167 

+p+ P+ + Q++ RL H + PL L +P R + L QR VRLN 

Sbjct: 1000 APQP — QPQPVRQPQQVSQRLNRHQRVRPLQQVLA- PQPQRQQVHRL — QRQRVRLN 1051 

Query: 168 RRIKPPLLQHRLTVRRILRRSRRQPFPAQFPDRRIFIMFRHOT 227 

R + LQ L A P R+ +H +R++ L Q R 
Sbjct: 1052 RHQRVRPLQQVL APQPQRQQVHRLQH QRVRPLQQVLAPQPQR 1093 

55 Query: 228 IRRSRPLIKLPLLRRQRIRPGRHQRTLRVKITHRIjAAPIHIFVKSQRRRRPSARIRRARI 287 

+ R L+RQR+R +HQR + + H+L +H PV+ Q + R + ++++ + 

Sbjct: 1094 QQVHR -LQRQRVRLSQHQRWQPQQAHQL-IjNLHQPVR-QPQHRQAPQLQQVPV 1144 

Query: 288 APREIRPGPRIGGKRL1AARKP-QTGIRTPFESTRPAQPPRPI-I»NIVTAQIHHIPITRR 345 
60 + R R+ + + R+P Q +R P R + P+P+ IiN H P+ R+ 
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Sbjct: 1145 RQPQRRQVRRL QQVFVRQPQQVPVRQP- -QRRQVRRPQFVHLN RHQFV-RQ 1192 

Query: 346 PGL I IRNGTPHR 357 
P ++ + H+ 
5 Sbjct: 1193 PQQVLVHQLQHQ 1204 

Based on this analysis, it was predicted that this protein from ^gonorrhoeae* and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example ISO 

10 A DNA sequence was identified in N.gonorrhoeae <SEQ ID 299> which encodes amino acid sequence 

<SEQ ID 300; NGS153>. Analysis of this protein sequence reveals the following: 

GvH Examining signal sequence (von Heijne) 
Signal Score (-7.5): -5.48 
Possible cleavage site: 13 
IS >» Seems to have no N- terminal signal seq. 

Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 6.31 threshold: 0.0 
20 PERIPHERAL Likelihood =6.31 

modified ALOM score: -1.76 
Rule*, cytoplasmic protein 



25 



45 



*** Reasoning Step: 2 
Final Results — 



bacterial cytoplasm Certainty* 0.150 (Affirmative) < suco 

bacterial periplasmic space Certainty^ 0.000 (Not Clear) < suco 

30 bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

bacterial inner membrane Certainty* 0.000 (Not Clear) < suco 

The protein has homology to the following sequences in the databases: 

A **gbp_15832758 gill5832758lrefjNP_3U531.il \(NC_002695) hypothetical 
35 protein [Escherichia coli 0157 :H7] 

dbj|BAB36927.l| (AP002562) hypothetical protein [Escherichia coli 0157 :H7] 
Length = 188 

Score =73.9 bits (180), Expect = 3e-12 
40 Identities = 54/169 (31%), Positives * 79/169 (45%), Gaps = 15/169 (8%) 

Query: 12 LTQEVLKELLKTODNTGKLW 71 

LT + + ELL +D +TG W + + S F GY + 

Sbjct: 16 LTVKRIFELLSFDKSTGVFRWKVPTQ GRIALNSVAGAFDSN GYSMIM 62 



Query: 72 IFKKQYNAHRLIWLFVYGKH-ASSIGHIl^ 130 

I ++Y H L++ + + A I H+N +TDNR NLR+ EN +N KN+ S 
Sbjct: 63 IDGRRYKTHNHiVFYITHNRWPAGQIDHVNG 122 



50 Query: 131 GHTGVYFHKP SKKWQ ARIMVNKKNKILGLFEHIEDAA- KAREAASKDFG X78 

G GV +HK KKW R+ + K+K G F+ +E A A EA K +G 
Sbjct: 123 GCRGVTWHKRQKKWNVRLGFHGKSKHFGCFDDLEI^ 171 

Based on this analysis, it was predicted that this protein from N.gonorrhoeae* and its epitopes, could be 
55 useful antigens for vaccines or diagnostics. 
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ExamplelSl 

A DNA sequence was identified in N. gonorrhoeae <SEQ ID 301> which encodes amino acid sequence 

<SEQ ID 302; NGS 154>. Analysis of this protein sequence reveals the following: 

GvH Examining signal sequence (von Heijne) 
5 Signal Score (-7.5): -6.98 

Possible cleavage site: 28 
>» Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 
calculated from 1 
10 ALOH: Finding transmembrane regions (Klein et al.) 

count: 0 value: 8.12 threshold: 0.0 
PERIPHERAL Likelihood =8.12 
modified ALQM score: -2-12 
Rule: cytoplasmic protein 



15 



20 



25 



35 



*** Reasoning Step: 2 



Final Results 



bacterial cytoplasm Certainty* 0.423 (Affirmative) < suco 

bacterial periplasmic space Certainty* 0.000 (Not Clear) < suco 

bacterial outer membrane - — Certainty* 0.000 (Not Clear) < suco 

bacterial inner membrane Certainty* 0.000 (Not Clear) < suco 

The protein has homology to the following sequences in the databases: 

A **gbp_15830449 gi| 15830449|ref |NP_J309222.1 1 \(NC_002695) hypothetical 

protein [Escherichia coli 0157 :H7] 
dbj |BAA94132.l| (AP000422) hypothetical protein [Escherichia coli 0157:H7] 
30 dbj |BAB34618.l| (AP002554) hypothetical protein [Escherichia coli 0157:H7] 

Length = 148 



Score * 42.7 bits (99), Expect * 0.003 

Identities * 27/99 (27%), Positives = 49/99 (49%), Gaps = 10/99 (10%) 

Query: 37 IRPRKSKRSVEQNRRLWFLYREI SEKVFIDGRRFSQDVWHE FLKRKFIGCIEMPN 91 

+ ++ KRS QN R+W + ++S +V G+R + + W + +LK K + +P 

Sbjct: 33 VHVKEPKRSKAQNDRMWPMLNDVSRQVLWHGQRLAPEDWHOD 92 



40 Query: 92 GQ LMGI STTKLSVREMSEYQEKI ISWASMEHGVLW 126 

++G+ T+K+ M+E E 1+ W B V W 
Sbjct: 93 IDGGVVMLGVRTSKMRKASMTELIB-ImFWFGSER 130 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
45 useful antigens for vaccines or diagnostics. 

Example 152 

A DNA sequence was identified in Kgonorrhoeae <SEQ ID 303> which encodes amino acid sequence 

<SEQ ID 304; NGS155>. Analysis of this protein sequence reveals the following: 

GvH Examining signal sequence (von Heijne) 
50 Signal Score (-7.5): -4 

Possible cleavage site: 27 
>» Seems to have an uncleavable N-term signal seq. 
Amino Acid Composition of Predicted Mature Form: 
calculated from 1 
55 ALQM: Finding transmembrane regions (Klein et al.) • 

count: 0 value: 4.98 threshold: 0.0 
PERIPHERAL Likelihood =4.98 
modified ALOM score: -1.50 
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*** Reasoning Step: 2 



Final Results 

bacterial inner membrane — Certainty- 0 . 046 (Affirmative) < suco 
bacterial periplasmic space — Certainty* 0.000(Hot Clear < suco 
^Serial outer membrane — Certainty- 0.000<Not Clear < suco 
teSeSaf cytoplasm — Certainty- 0.000<Not Clear) < suco 

The protein has homology to the following sequences in the databases: 

* **gbp 15801502 gi 1 15801502 |ref|HP_287S19.1 J \(NC_002655) putative 
endonuclease of prophage CP-9330 {Escherichia coli 
0157:H7 EDL933] ., .,„.«,, 

reflNP 309804.1! (NC_002695) endonuclease [Escheri chia coli 0157 -™ cp-9230 
gb|AAG?6131.l|AE005344_7 (AB005344) putative endonuclease of prophage CP-9330 

[Escherichia coli 0157 :H7 EDL933] 0157-H71 
dbj|BAB35200.11 (AP002556) endonuclease [Escherichia coli 0157. H7J 

Length = 119 

sx^xfez?™* <««»■ — - 

Query. 71 ixu^JHK^WI^VBSM^™^^ 

Sbjct: 5 ^P^PTVNTYVmRRGSTYFVSKAGERYRRAVVl.IVRM « 

^ery: 130 «— ™~ i 
Sbjct. 63 PDKRRRDLDNILKAPLDALTHTUJVLMDDEQFDEINrVIi^ 117 

Query: 190 ME 191 
E 

Sbjct: 118 SE 119 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 153 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 305> which encodes amino acid sequence 

<SEQ ID 306; NGS156>. Analysis of this protein sequence reveals the following: 

GvH Examining signal sequence (von Heijne) 
Signal Score (-7.5): -2.61 
Possible cleavage site: 49 
»> seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 m 
AIjOM- Finding transmembrane regions (Klein ec ai.j 
"count: 0 value: 7.96 threshold: 0.0 
PERIPHERAL Likelihood » 7.96 
modified ALOM score: -2^09 
Rule: cytoplasmic protein 

*** Reasoning Step: 2 
Final Results 



bacterial cytoplasm Certainty* 0.307 (Affirmative) < suco 

bacterial periplasmic space — Certainty- 0.000 (Not Clear < suco 
nSterial outer membrane — Certainty- 0.000(Not Clear) < suco 
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10 



bacterial inner membrane Certainty* 0.000 (Not Clear) < suco 

The protein has homology to the following sequences in the databases: 

~ **gsaJ\AG90O98 C glutamicum protein fragment SEQ ID NO: 3852 
- 1 EP1108790-A2 | 26-SEP-2001 
Length = 148 

Score = 102 bits (253), Expect = 7e-21 

Identities * 60/147 (40%), Positives = 88/147 (59%), Gaps =18/147 (12%) 

Query: 3 NAYEVADFFL SPFEEEDGEQI SNLKIjQKIjLYyAQGYAIiAILNRPLF AENIEHWQHGFV^ 62 

+A ++A++F++ +E D E +S LKLQKLLYY+QG +A R LF-H- I WQHGPV P 
Sbjct: 5 SAREIAEWFVAWGDELDAE-VSPLKLQK^ 63 



15 Query: 63 CIYRTYKKYGGSPLPAAHIEPDKYADEEL WLNRVRKEQGCYTAWALRNKT 113 

+Y K YG +P I+PD+4- +E L V ++ G Y+AWAEjR KT 

Sbjct: 64 GVYSDTKSYGRNP- IDPDEFVSDEFNWDDYSOTSDELVTVWRKYGIYSAWALREKT 118 

Query: 114 HQEAPWIQT-RQGEVIGI — ALMGEYF 137 
20 H E+PW+ QG+ I I A + ++F 

Sbjct: 119 HSBSPWLDAWAQGQNIEITDAALKDFF 145 

Based on this analysis, it was predicted that this protein from ^gonorrhoeae* and its epitopes, could be 
useful antigens for vaccines or diagnostics, ' 

25 Example 154 . 

A DNA sequence was identified in N. gonorrhoeae <SEQ ID 307> which encodes amino acid sequence 

<SEQ ID 308; NGS 1 57>. Analysis of this protein sequence reveals the following: ' 

GvH Examining signal sequence (von Heijne) 
Signal Score (-7.5): -5.65 
30 Possible cleavage site: 42 

»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOH: Finding transmembrane regions (Klein et'al.) 
35 count: 0 value: 3.98 threshold: 0.0 

PERIPHERAL Likelihood = 3.98 
modified ALQM score: -1.30 
Rule: cytoplasmic protein 

40 *** Reasoning Step: 2 



Final Results 



45 bacterial cytoplasm Certainty* 0.291 (Affirmative) < suco 

bacterial periplasmic space Certainty* 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

bacterial inner membrane Certainty* 0.000 (Not Clear) < suco 

50 The protein has homology to the following sequences in the databases: 

A **gbp_6274533 gi 1 6274533 |gb|AAF06681.l|AF163663_6 \(AF058689) Toul 
[Neisseria meningitidis] 
Length = 272 

55 Score = 546 bits (1408), Expect = e-154 

Identities = 267/272 (98%), Positives « 271/272 (99%) 

Query: 19 MKGMDKLRYQRDFLNIRPIFTAGEQEYLTEL SDRLPLSVLTDSVRNIEE IG IDFVYSPAK 78 
MKGMDKLRYQ+DFLNIRPIFTAGEQEYLTELSDRL AK 
60 Sbjct: 1 MKGMDKLRYQQDFIiNIRPIFTAGEQEYLTELSDRLPLSVljTDSVRNIEEIGIDFVYSSAK 60 
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Query: 79 LEGOTYNQYDTQALLKLGQTAGGKL 138 

LBGimNQYDTQALIiKIiGQTAGGKLYSDAVM^ 
Sbjct: 61 LEGNTYNQYOTQALLB^ 120 

Query: 139 THSLISENLLEKGSGGWRTOSVTI 198 

THSLI SENLLEKGSGGWRRDSVTI SGTDYTPL SNPQSLDTELKWLLQEAPKIENPFDRA 
Sbjct: 121 raSLISEt^LEKGSGGWRRDSVTISGTD^ 180 

10 Query- 199 VYLHNNIAYLRYFKDCNKRT^ 258 

VYLHNNLAYL+YFKTCNKRTJU^CMTLSIjMRSG 
Sbjct: 181 VYLHNNIAYLQYFKDC^ 240 

Query: 259 GLFKKYFISAYENTVNKYGPQPDVDIFRNFSI 290 
15 ^ GliFKKYF I S AYENTVNKYGPQPBVDIFRNFS + 

Sbjct: 241 GliFKKYF I S AYENTVNKYGPQPDVDIFRNF SL 272 

A homolog was found in serogroup A N.meningitidis but not in serogroup B, so NGS157 protein and 
nucleic acid are useful for distinguishing between gonococcus and serogroup B Kmemngitidis. 

20 Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 155 

A DNA sequence was identified in N. gonorrhoeae <SEQ ID 309> which encodes amino acid sequence 

<SEQ ID 310; NGS158>. Analysis of this protein sequence reveals the following: 

25 GvH Examining signal sequence (von Heijne) 

Signal Score (-7.5): -6.98 
Possible cleavage site: 18 
»> Seems to have no N-terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 
30 calculated from 1 

ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 0.16 threshold: 0.0 
PERIPHERAL Likelihood = 0.16 
modified ALOM score: -0.53 
35 Rule: cytoplasmic protein 

*** Reasoning Step: 2 



40 Final Results 

bacterial cytoplasm Certainty* 0 .185 (Affirmative) < suco 

bacterial periplasmic space Certainty* 0.000 (Not Clear) < suco 

bacterial outer membrane Certainty* 0.000 (Not Clear) < suco 

45 bacterial inner membrane Certainty* 0.000 (Not Clear) < suco 

The protein has homology to the following sequences in die databases: 

> **gbpJL5791833 gi 1 15791833 |ref |NP_281656. l| \(NC_002163) amino-acid 
ABC transporter ATP-binding protein [Campylobacter jejuni] 
50 pir||H81391 amino-acid ABC transporter ATP-binding protein Cj0469 [imported] 

Campylobacter jejuni (strain NCTC 11168) 
emb|CAB75107.ll (AL139075) amino-acid ABC transporter ATP-binding protein 
[Campylobacter jejuni] 
Length = 253 

55 

Score = 301 bits (772), Expect = 6e-81 

Identities = 153/244 (62%), Positives = 195/244 (79%), Gaps * 2/244 (0%) 
Query: 1 MALLSIRKLHKQYGSVTAIQSI^ 60 
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M++L ILK YGS A++ ++L+++ EV+V+LGPSGCGKSTLIiRC+NGLE G+I 
Sbjct: 1 MSILKIENLQKYYGSHHALKDI^ 60 

Query: 61 VMIXJTOEFGKDVS - WQTARQKVGMV^ U9 

+D + KD W RQKVGMVFQSYELF H++V ENILLGP+KVQ R + E + A 
Sbjct: 61 YIDNE-KI1}KJ)FKEWPRMRQKVGMVFQSYELFEHLSVE H9 

Query: 120 GKLLEKVGLLDItfCNAY^ 179 

LE+VGLL + +AYPRELSGGQKQRIAIVR+LC+NPE4-+L DE+TAALDPE+VREVLE 
Sbjct: 120 KIWI^KVGLLHKIHAYPRELSGGQKQRI^^ 179 

Query: 180 WLELAREGMSMLrVTHEMGFARKVAD^ 239 

V+L LA+EGM+MLIVTHEMGFA+- VAD+I+FMD+G I+E +DP++FF PKSERA++FL 
Sbjct: 180 VMLNLAKEGMTML IVTHEMGFAKAVADKI IFMDEGKI IEENDPKSFFKNPKSERAKKFLN 239 

Query: 240 GMDY 243 
DY 

Sbjct: 240 LFDY 243 

Based on this analysis, it was predicted that this protein from ^gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 156 

A DNA sequence was identified in N. gonorrhoeae <SEQ ID 311> which encodes amino acid sequence 

<SEQ ID 312; NGS159>. Analysis of this protein sequence reveals the following: 

GvH Examining signal sequence (von Heijne) !j 
Signal Score (-7.5): -4.16 

Possible cleavage site: 13 li 
»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 0 value: 6.47 threshold: 0.0 
PERIPHERAL Likelihood =6.47 
modified ALOM score: -1.79 
Rule: cytoplasmic protein 

*** Reasoning Step: 2 
Final Results 



bacterial cytoplasm — Certainty^ 0.312 (Affirmative) < suco 
bacterial periplasmic space — Certainty= 0.000 (Not Clear) < suco 

bacterial outer membrane . Certainty= 0.000 (Not Clear) < suco 

bacterial inner membrane Certainty^ 0.000 (Not Clear) < suco 

The protein has homology to the following sequences in the databases: 

> ^**gbp_15794799 gi 1 15794795 |ref |NP_284621 .1 1 \(NC_003U6) hypothetical 
protein [Neisseria meningitidis Z2491] 
pir||B81819 hypothetical protein NMA1914 [imported] - Neisseria meningitidis 

(group A strain Z2491) 
emb|CAB85135.l| (AL162757) hypothetical protein [Neisseria meningitidis Z2491] 
Length = 206 

Score ■ 265 bits (678), Expect = 8e-70 

Identities * 131/146 (89%) , Positives « 138/146 (93%) 

Query: 1 MTKL YAQIAKTEAQDDGTVKVWGYASSEAVDSDGEWAAEAMKAAI PDYMKFGAVREMHG 60 

MTKLYA+IAK E QDDGTVKVWGYASSE +DSDGEV+AA AMKAAIPDYMKFGA REMHG 
Sbjct: 1 MTKLYAEIAKMETQDDGTVKVVTCYASSEEIDSDGEVIAAAAM PDYMKFGAGREMHG 60 
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nilprv . 61 ^AAGTAIEINVEDDGRTFFGAHIVDPVAVTKVKTGVYKGPSIGGSTO 120 
Query. 61 tffgaHI+DPV V+KVKTOVYKQFSIQGSVTAR+DmKSQITO 

Sbjct: 61 S^iSS 120 
Query: 121 LKLTEI SLTORPANPDAVFTCFKADK 146 
Sbjct: 121 LKLTEISLIDRPAOTDAVSTCFKADK 146 

A homolog was found in serogroup A ^meningitidis but not in serogroup B, so NGS159 protein and 
nucleic acid are useful for distinguishing between gonococcus and serogroup B ^meningitidis. 
Based on this analysis, it was predicted that this protein from N. gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 157 

A DNA sequence was identified in N.gonorrlweae <SEQ ID 313> which encodes amino acid sequence 
<SEQ ID 314; NGS160>. Analysis of this protein sequence reveals the following: 

GvH Examining signal sequence (von Heijne) 

Signal Score (-7.5): -8.3 

Possible cleavage site: 33 
>» Seems to have no N-terminal signal seq. 
Amino -Acid Composition of Predicted Mature Form: 
calculated from 1 

ALOtt- Finding transmembrane regions {Klein et ai.) j 
'count: 0 value: 7.85 threshold: 0.0 

PERIPHERAL Likelihood =7.85 ( 
modified ALCSM score: -2.07 
Rule: cytoplasmic protein 

*** Reasoning Step: 2 



Final Results 

bacterial cytoplasm Certainty* 0 .407 (Affirmative) < suco 

bacteriafpSlasS space — Certainty* 0.000<Not Clear) < succ> 
bacteriafouter membrane — Certainty* 0.000 (Not Clear < succ> 
bacterial Lxer membrane — Certainty* 0.000 (Not Clear) < suco 

The protein has homology to the following sequences in the databases: 

> - **gbp 2126352 gi 1 2126352 |pir| |JC52 18 type * s^*; 3 ^"^ 

deoxyribonuclease (EC 3.1.21.3) Hsd chain S [validated] 
- Pasteurella haemolytica 
gbjAAC44667.il (U46781) HSDS (Mannheimia haemolytica] 
Length =442 

SSiiS'l JSWtSU,?^^^ <53%), Gaps - 3/U9 (2%) 

Query- 26 EVAE¥SKbn*ICSDKLNEHNXVGVDNLLQNREGK^ 85 
++ G +1 L + NY+ DN+L N_ G _L+_ +P+ J.^'JS* 69 




Sbjct: 10 D3 

Ouerv- 86 YLKKIWQADCTGGTNGDVLVIRV — TDEKVNPKYLYQyiiADDKFFAFNMKHAKGAra<CTG 143 
Query. 86 ^£™ A+ ^ + TD +N +VL+ ++ D F F + A GAKMPRG 

Sbjct: 70 YFKKVWI^SGGCSPI^VMRSI^ 128 

Query 144 SKAAIMQYKIPIPPLPEQEKIVAILGKPD 172 

K A+ + IP + Q+K +A FD 
Sbjct- 129 DKNAMKGFIFNXPSIEYQKKCIANYPAFD 157 
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Based on this analysis, it was predicted that this protein from N.gonorrhoeae* and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 158 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 315> which encodes amino acid sequence 

5 <SEQ ID 316; NGS161>. Analysis of this protein sequence reveals the following: 

GvH Examining signal sequence (von Heijne) 
Signal Score (-7.5): -5.08 
Possible cleavage site: 36 
»> Seems to have no N- terminal signal seq. 
10 Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et al.) 
count: 1 value: -1.59 threshold: 0.0 

INTEGRAL Likelihood = -1.59 Transmembrane 302 - 318 ( 302 - 318) 
15 PERIPHERAL Likelihood = 3.76 

modified ALOM score: 0.82 
Rule: cytoplasmic membrane protein 



20 



45 



*** Reasoning Step: 2 



Final Results 



bacterial inner membrane Certainty^ 0 . 164 (Affirmative) < suco 

25 bacterial periplasmic space Certainty= 0.000 (Not Clear) < suco j 

bacterial outer membrane Certainty* 0.000 (Not Clear) < succ> 

bacterial cytoplasm Certainty* 0.000 (Not Clear) < suco | 

The protein has homology to the following sequences in the databases: 

30 pir| |E81921 probable DNA-invertase NMA0772 [imported] - Neisseria meningitidis 

(group A strain Z2491) 
emb|CAB84055.l| (AL162754) putative DNA-invertase [Neisseria meningitidis Z2491] 
Length » 321 

35 Score = 295 bits (755) , Expect = 9e-79 

Identities = 151/322 (46%), Positives = 216/322 (66%), Gaps - 3/322 (0%) 

Query: 14 LRNAVGLDISKLTFDATAIVGNAEYSAKFD^ 73 
+RNAVGLDIS TFD 1+ KF ND +G + + + +++++CMEATGN 

40 Sbjct: 1 MRNAVGLDISAKTFBWTIINGETDYRKFSNDEQG^ 60 

Query: 74 YYEEVADYFAQYYSVYVVNPLKISKYAESR 133 

YYE+ AD A+ Y V V+NPLKI YA+ RF R K DKQDAKLIA++C++A EL KR+ 
Sbjct: 61 YYEQAADCLAEEYHVSVINPLKIKAYAQKRFSRVKNDKQ 120 

Query: 134 KPTDEQYRLSRMTAAYAQIKSECAAMKNRHHAAKDEBAAKAYAEI IKAMNEQLEVLKEKI 193 

KPT++QY L R+ + +Q+ + + KNR AAKD K + + +K + L +K+K I 
Sbjct: 121 KPTEQQYSLKRLLSLQSQLIiEQQTSQKNRIKAAKDSFVQKIH^ 180 

50 Query: 194 KEQTEKPN — OOEGVKRLETI PAIGRMTAAVLFHHIjTS SKFETSNKF AAF AGL SPQQKES 251 

+QT K + KE KRLETIP++G+ TA L +L +S FE + +F A+AGL+P Q S 
Sbjct: 181 -DQTIXSDKKMKELTKRLETIPSVGKTTAI SLMSYLINSTFENAKQFTAYAGLNPHQNIS 239 

« 

Query: 252 GTSVRGKGKLTKFGNRKLRAVLFMPAWAYRIRAFPDFIKIUjEEKKKPKKVI IAALMRKL 311 
55 * QTSV K K+TK+GNR++R LFM A+VA++ FP F RL++ KKPK +11 ALMRK+ 

Sbjct: 240 GTSVNKKSKMTKYGNRRIRGSLFMAALVAFKNNYF PAFTNRLKKAKKPKML I IGALMRKI 299 

Query: 312 AV1AYHVHKKGGDYDPSRYKSA 333 
V+A++++K D+D +RY++A 
60 Sbjct: 300 LWAFNLYKTETDFDKTRYQTA 321 
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A homolog was found in serogroup A N.meningisidis but not in serpgroup B, so NGS161 protein and 
nucleic acid are useful for distinguishing between gonococcus and serogroup B ^meningitidis. 
Based on this analysis, it was predicted that this protein from N.gonorrhoeae, and its epitopes, could be 
useful antigens for vaccines or diagnostics. 

Example 159 

A DNA sequence was identified in N.gonorrhoeae <SEQ ID 983> which encodes amino acid sequence 
<SEQ ID 984; NGS162>. Analysis of this protein sequence reveals the following: 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-7.5): -0.86 
Possible cleavage site: 33 
>» Seems to have no N-terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM: Finding transmembrane regions (Klein et ai.) 
count: 0 value: 4.08 threshold: 0.0 
PERIPHERAL Likelihood ■= 4.08 
modified ALOM score: -1.32 
Rule: cytoplasmic protein 

20 *** Reasoning Step: 2 



25 



50 



55 



Final Results 



bacterial cytoplasm Certainty* 0 . 032 (Affirmative) < suco 

bacterial periplasmlc space — Certainty- 0.000 (Not Clear < suco 
Scterial outer membrane — Certainty- 0.000 (Not Clear < suco 
bacterial inner membrane — Certainty- 0.000 (Not Clear) < suco 



30 The protein has homology to the following sequences in the databases: 

reflNP 312507.11 (NC_00269S, hypothetical protein [Escherichia coli ■ °«™r71l 
gb|AAG58749.1|AE005587_7 (AE005587) putative adhesin [Escherichia coli 0157.H7 

^jlEABS^OS.ll (AP002566, hypothetical protein [Escherichia coli 0157:H7] 
35 " Length = 1588 

Stipes 2 - ^S"^^^ "0/358 (46.,, Oaps - 65/358 (18%, 
40 Query: 22 AVAI^SSSTASGEYSYAS^SVASC^SYAAGYASVASAEGSWIGDSRQVKPEA^ 81 

W uuery + A G+YS A G + A G SAG ++■*■+ + S+ +G S + * 

Sbjct: 93 STAVGYDAIAEGQYSSAIGSKTHAIGGASMAFGVSAISEGDRSIALGASSYSIiGQYSMAL 152 

Query- 82 AVGSKATVKNKAKQRVVVGSEAKVNAERGIAIGKEAKAGGKOT 141 

ska K + +G +-K IA+G KA T + +IA+G 
Sbjct: 153 GRYSKAL GKLSIAMGDSSKAEGANAIALGNATKA TEIM SIALG 195 

Query- 142 YQAEAGKGGAIALGKQAKATKQNGMM^SEAAGOTSTAVGNESKAKGQGG 193 

A A K ++ALG + A+++N +A+G E+EAA + +TA+GN +KAKG 
Sbjct: 196 DTANASKAYSMALGAS SVASEENAIAIGABTEAA-ENATAIGNNAKAKGTNSMAMGFGSL 254 

Ouery. 194 -VGLGNQSKAEADFAVAV — GNKAEATKE NSLVIGRYARANGN 233 

+ LGN S+A AD A+A+ GNKA+ N+ * 1U 

Sbjct: 255 ADKVOTIAIXSNGSQALABNAIAIGOGNKADGVDAIAI^NGS 31* 

Ouery- 234 HSVSIiGSRSEIKDGVSNSVAPGYGSVASENNWSVAYKETPQSTELSYRK 291 

S++LGS S +G+ NSVA G S+A +N VSV 25LL»Lt« 
Sbjct- 315 KSLALGSNSS-ANGI-NSVALGADSIADLDNTVSVGNSSLK RKIVNVKNGAIK 365 
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Query: 292 -3TOFDAVNVRQLKAMQGQNMAEL 349 

+ +DA+N QL A+ SV + G AA +T +Y+ N ++ VG 

Sbjct: 366 SDSYDAINGSQLYAISD SVAKRLGGGAAVDVDDGTWAPTTOL^ 417 

5 

Score = 86.3 bits (212), Expect ■ le-15 

Identities = 68/253 (26%), Positives = 118/253 (45%), Gaps = 39/253 (15%) 

Query: 28 SSTASGEYSYASGYNSVASGNKCT^ 87 
10 S+ +G + G + A + Y S ++ +G V IG G+KA 

Sbjct: 38 SALVAGGMLS SFGALANAGlTONGQG\n3YGSGSAGIX3WVAI^ -GAKA 83 

Query: 88 -TVKNKAKQKWVGSE^UCVNAERGIAIGKEAKAGGKTT^ -146 
T N + VG +A + AIG +AG+ +A G A + 

15 Sbjct: 84 NTFMNTSGSSTAVGYDAIAEGQYSSAIGSKTHAIGGAS MAFGVSAIS 130 

Query: 147 GKGGAIALGKQAKATKQNGMALGVESEAAGDFSTAVGNESKAKGQGGVGL^ 206 

+IALG + + Q MALG S+A G S A+G+ SKA+G + LGN +KA 
Sbjct: 131 EGDRSIALGAS SYSIX3QYSMM/3RYSKALGKLS IAMGDSSKAEGAN^ 190 

20 

Query: 207 AVAVGNKAEATKENSLVTGRYARANGNHSVSLGSRSEIKDGV SNSVAPG 255 

++A+G+ A a+K S+ +G + A+ +++++G+ +E + +NS+A G 

Sbjct: 191 SIALGOTANASKAYSMALGASSVASEENA^ 250 

25 Query: 256 YGSVASENNWSV 268 

+GS+A + N +++ 
Sbjct: 251 FGSIADKVNTIAL 263 

Based on this analysis, it was predicted that this protein from N. gonorrhoeae^ and its epitopes, could be 

30 useful antigens for vaccines or diagnostics. , ! 

Ij 

Example 160 

Further open reading frames were identified in gonococcus <SEQ IDs 317/318 to 8621/8622>. These 
polypeptide and nucleotide sequences are useful for studying gonococcus, for diagnostic purposes, as 
antibiotic targets, and as vaccine antigens. 



35 



It will be understood that the invention has been described by way of example only and modifications may be 
made whilst remaining within the scope and spirit of the invention. 
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CLAIMS 

. . A cwnience selected from the group consisting of SEQ IDs 26, 72, 
1. A protein comprising an amino acid sequence seiecxea nvm ^ * 56 

, ===== 

« « ' l£ 62 S M « "0 .72. 174. 176, 178, 180, 182, 184, 186, .88, 190, 1A 1H 
^^»mmV294,296. 2 »8.30am304,306,308,310,312,314»d316. 

m « » W M m S % 68 170 17X 174, 176. .78, .80, .82. .84. .86, .88. 190. .92. 

4 M m fib^,»Uch S p K ifi<3Byb^»apro t etoaccoritogu, m yo n eofctaml Kl 3. 

6 A -udeic acid .ta* accorfing to Cain. 5, compnaing a ri* — -"J"? * 

• Iliilllili 

«. S m m » 277, 279. 281, 283, 285. 287. 289. 291, 293, 293, 297. 299. 30, 303, 305, 307, 
309 311, 313 and 315. 

. liH ill? 

311, 313 and 315. . 
8 . A auclac acid atari, cotopriang a mdeodda .*«. o^-y » « aod 
40 n»lecnka«»rfiiigtoanyoiieofdatais5to7. 
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9. A nucleic acid molecule comprising a nucleotide sequences having 50% or greater sequence identity 
to a nucleic add molecule according to any one of claims 5 to 8. 

10- A nucleic acid molecule which can hybridise to a nucleic acid molecule according to any one of 
claims 5 to 9 under high stringency conditions. 
5 11. A composition comprising a protein, a nucleic acid molecule, or an antibody according to any 
preceding claim. 

12. A composition according to claim 1 1 being a vaccine composition or a diagnostic composition. 
. 13. A composition according to claim 1 1 or claim 12 for use as a pharmaceutical. 
14 The use of a composition according to claim 13 in the manufacture of a medicament for the 
10 treatment or prevention of infection due to streptococcus bacteria, particularly ^gonorrhoeae. 

15. A process for distinguishing N. gonorrhoeae from N.memngitidis, comprising the steps of: (a) 
contacting a protein, a nucleic acid molecule, or an antibody according to any one of claims 1 to JO with a 
biological sample under conditions suitable for the formation of an antibody-antigen complexes; and (b) 
detecting said complexes. 

i 

i 

i 
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SEQUENCE LISTING 

C^CTMTCCCTCTTCCOT 
ATTTCAAATACTTTTTGTATGTTC^ 

GAQAATAAAAAAATCCOTCTTCGGGGGCAGGTTGTTTATTTGC 
TTTCCGCCCCATTTTATTTCGACGATG^ 
GTATAAACAGGAAGCCGAATTGGTGGAAAAAAAGGGCT^ 
TCGGATATGAATCGGAAACCGTTTACTTGG 
SEQ ID 2 

MIHHIEKYIGTISHGSKSDSGSQYKliNIAAIPSSPNro 

EOTOCILIiRGQVVyLPRSIVNSTKMDA^ 

SDMNBKPFTW 

SEQ ID 3 

AT GGACTATAACCAAACTCTm 

GAATTGTTGAAGAACTTCCCGATTTCCAAGT^ 

GATAGGTCAATTTTTAGGGCAGGAGTTTTTTATCATC^ 

TCTGCAAGTATGCATTATCCTGATCAATTTCAATTAGGGAA^ 

GGCATTTTCTGATTTCTCTGCCTTATCCTTATGGGCAAGAGTTG 

TACCCAGAGAGAAAGACTTTTCTTGAATACTCATO 

GATATAAATCGTGCGAGTACTGTCTGGCAGGCAGGA 

SASMHYPDQFQLGKTVNIGRPWVEQSSFR 

dinrastvwqag 
S aS^ccccgtotggattattaccggc^ 

CCGCACTCGCCCTTTCTTT^ 
OCTCTCGAGCCTCTCTATTTTCCTGATTCCGCTGM 
ACGATGGCGTTGCTGTTGAGTTATCCCATTTGGCGCAACCAAATCCT 
TTGCCACCACGGCAGGTTACGGATTGGCAGGCATTACGOTGC^ 

tccgctgttgattgccgcok^^ 
gggaccxx:ggccggtatttccatcggcgta^ 

actcgaaacaggtcatcaccgcgcctotcgttgaaacggtgctgctgttcaatccc^ 

cggttacgaaaatacggctatgtat^^ 
caggrrttatgggttatcattccgcttgtttt^ 

S ^^IITGKEARDSL^ 
TMALLLSYPIV^QILAGKFVGHLIIIALATTAGYGLAGITI^ 
GTAAGISIG\A^FFWIFDMALIX3ILVADSKQVITAWVE^ 
QVLWVT I PLVLAAGIFRKRRI 

SEQ ID 7 

ATGTCAGACGAAAAATTAGAACAAAACGGCTTGAGCCGTCGTTCGTTCTO 
GTGCGGGCCTGTTGGGTTTG^ 

GGCGCAAC^GGAATCCAAACCCGGCCAACTGTCTTCCGAAGTCG^ 

GGCCAGTCCGGCGAGATGCGCCTCATCGGTCTGCCTTCTATGCGCG^ 

CCGGTTGGGGACGCACCAATGAGAGCTTGAAAGTCCTCAAC 

CCTGCGCTGCTACCCC^CGGC^^ 

AACGACAAGGCAAACAACCGCGTCTGCCGCGTGTO 

GTATTC^CGGTCTGCGTCCGCAACGTTATCCGAAAACCGGTTATGTTTTTGC 

AGGCAAACTGGATGATGCGAAAACCTGGAATGCAGTGTACACCGCCA^ 

GTTGACGGTAACCTGGACAACGGCGATGCCGACTATCAAGGCAA 

CCX3TACAAGGTGCGTCTTCCAACGAGCAGGAC?TGGTGCGTCGTTO 

CTTCAAAGAAGTTAACGGTGTGAAAATGCTCGACGGCCGCGCTGAG 

TCTCCTCACGGCTGTAACGCAAGCCCTGACGGTAAATACATCATGCC 

TCAGCAAACTGGACC^TTTCTTCGCCGGCAAAATCAA^ 

ACACACTGCATTCGACGGTCGCGGCAATGCTTATACGACATTGTTO 

ATCAAAGCCTACAAAGGCGAGAAAATCGATCCGATCAAACAAAAACTC 

TGGGCGAAACCAAAGAAGCCGACGGTCAATGGCTGG^^ 

GAAACCCGAGTGCGACCAATTGATCGGCATCTCCGGCGACGAAATGCGTCTG^ 

GACTTGTGTTTGGTTGCCGCGTCCAAACTGAACCCGGGC^ 

S MSDE^ 
GQSGB^IGLPSMIttLMRlFV^^ 

VIXjNLDNGDADYQGKYSFATCYNSERALTVQGASSNEQDWCWFDLK^ 
SPHGCNASPDGKYIMPNGKLPPTVTVLDVSKLDDIiFAGKIKERD 
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IKAYKGEKIDPIKQKLOTHYQPGHNHTT^ 
DLCLVAASKLNPGKTWDRKDPWP 
SEQ ID 9 

ATGAAACTAAATACTCTCACATGGGCTTT^^ 

AAATACAACCCGTCAAAACCTTCTCCCCGCCCAAACCGA 

CCGCTCCGACCGCAGCGATTATTACTTTGTTACCGAAAA 

GGCAAAAGCTTTTACAACTCCGTTACCGCGCAAGCACTC 

ACGGCTACAAAGATGGCGGTGGGCGCGACACTGATTGGAAAT^ 

TGAAAATCAAGAATACCGCCTCACTTATCTGCACGACGACATCAACAA^ 

ACCGAACGCCACATCTCCAAACT(^CGTGCGTTGGGGC^ 

AACTCAAACGCCATGCCGACAATTACTCCTTGC^ 

TGATTTCTCGCTCAAACACGATGCGGATTrCGXSCAAATTC 

GAACGCAACACCCACACTGCCATGTGCGATTTCCTCAACGGCTA^ 

ATACCTTATCTTACAAGTTTGACGACCGACACAAACTGGGC 

TACGGCACAGCCTGAAAACCCAATAAAACCCGGTTTCCCCTTCGCCTCA 

TTCAACGGGAAAGTGCGCCGTCATGCTTTC 

GCTGATGAACCAAAATCCAGCCGCCGCCATTGCAGGCAATCrc 
SEQ ID 10 

MKIiNTIiTWAIiMrVPSVAPSWAEQPANTEBIQPVKTFSP 
GKSFYNSVTAQALGAKVYGVANLNRTKANGYK^ 
TBRHISmNVRWGNADLSNTVSAEAGVIKLKRHADNYS^ 
ERNTHTAMCDFLNGYRFADVHIDRWCI^ 
FIK3KVRKHALSGELKYDFTPSETQCT 
SEQ ID 11 

TTGCAGGCAATCCCCCTGCTGAAAACTGAAAAACACAACCGCATCA 

CGCACACGGACAAAGCGGCACAGCTTCCAAAGACGGC 
TACGCGCGTTACAACTTCAATCCGCATTG^ 

CGCCCTATCAAATCCGTCCGTTTGAAGCCGCCGTCCAAGCCGACTACAAAAACTAC 
GGCAACACGCTTTGTCGCCAAACAAACGCGCGGTGATTTTG^ 
GGCTTTACCGTTGCCGACGTGTACGCAGGCGTAAACATCAAAGACAAATACGGCT 
AAAAATACGTCGAATACATCAGCGGCGACCACGTCCTCGCCCTATCCCCCAG^ 
GAGTTTGCATGCGGCATTT 
SEQ ID 12 

LQAIPLLKTEKHNRIRLTADSRITOYYNGYMNSLAG^ ITRNVDARLFTAQA 
YARYNFNPHWAAGIK£AYNYGH)raTIX3RPPYQIRPF 
GFTVAnVYAG^^IKDKYGLRLGVNNVFNKKYVOT 
SEQ ID 13 

ATGGATATGAAAAGACGCGATTTCTTAAAAATGACCGCCGCGCTGGCAGCC 

AAGAGCAATTTACCGTGTACGGCGCACCGGCAATGCCGAGC^ 

GGCGGACGTGTC^TGAAAATCTCGCGTTCGC^ 

CCGAGCAATGTCGGCGTAAACCTGCGCAACCAAGGGCAGAAAGTCGGC^ 
TCTGCAAAGGCAGCGCGATTGCCTCGCCGGAGGATTTGGTCGGCAAAAAAATC 
CGTGCTGCAAGCCTTGTTGAAAAAACTCAAAATCGACGCACA 
GGACTATTTCCAAGTAAAGGTTACCATGCCGTCM 
GTATAAACGTCGTGCACGGTTTTGATTTGGTGAAA 
C^TCGCCAACGAAGAATATTTCCACGCACACAAGGC 
TTGGCCAACCGCCAAAACGCGGCGAAAATCGGCAAAAACT^ 
CACGGCTGACGGTAAGCAAAGGCAGCGAAGTGAAAAACGAGATTTTGAAGT 
TTTGGGCGGCAAGCTGCCGGATAACGGGTTCTTCTTGGCT 
SEQ ID 14 

MDMKRRDFLKMTAAIiAAAGVS PSLLAAGKEQFTVYGAPAMP SVT I AVAALQGKLAKQ ABVSLKIWRS PDQLjRAGVASGQFKVMMS 
PSNVGVNLRNQGQKVGMVNILTNGI^ 

GLFPSKGYHAVILPE PMATASLLKGKT IGINWHGFDLVKAWGQAFDTKPL I PMAGI IANEEYFHAHKAQFDIFHQDLKNALNWI 
LANRQNAAKIGKNYLPAPEPALVMGLIXsAIUjTVSKGSEVKNE ILKFYE ILMQFNPELLGGKLPDNGFFLA 
SEQ 0D15 

ATGGCGATGGTGGCGTTGTGGGCGTGGGGCAGCGCCG 

OTTTGGATTTATTGAAACATTTTCAGGAAAATGAAATC 

AGCAGGATTGGCGGCGGGGCTGGTGGCGGGGCTGGTCGCGGGCAGT^ 

ATTTTGTTAGCAATGCCGCCGATTATTTGGGTGGTGATG 

TCATTGTGTTGGTTGCGCCGCTGACGTTTGCGAGTGCGG^ 

CGCTTATAAATTAGGCCGTCTCAAAAAAATCCGCTATC 

GCGGTGGCGATGGGGGTGAAGGCGGTGATCATGGCGGAACTOT 

GGGCAATGCTG 



PCT/IB02/02069 

WO 02/079243 



S ^^lAm SA VFGEFMLPAPV^ 



AVAMGVKAVIMAEIjIjGASKGVGARIADARAML 

ATTATTATGAG 
SEQ ID 18 

NV1TLPTPLSERDSAFEEWVAREFQGIHYYB 

ESLAIiGYETRAIIAYGSVAIiGAESVANEENTVSVSSDTIiKI^ 

CGAAATGGAAAACGAGCTGCAACAGTTGCGCCGCGCGTTGTC 
SEQ ID 22 



MSPMARKI 



YDAVNVRQLNRLSKRTNRVGASAAALASLKPAQL^ 



GGCTGCTTTTTGTTTTTCAGACGACCTGTAAT 
SEQ ID 24 



TDICMOJ^FHYHTLFNALFQEKENSLCMJ^FVFQTTCN 
S ^OTATAGrreTAa^TTT<^CATA T GCACGGAAGGAAACGC 

CAACCCOSACCAACGCGCCGTC^CGCCGCAaSGCACATG 

S SfStvvltasai^sgcvadpv^ 
yi^jtrgvaasrltvygygshmpvasnatvbgraqilehweiijiilpdqravhaarhm 
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SEQ ID 27 

ATGAAATCGAAACTTACTGTCG^ 

GAAATTTTCTTATTCAAGAGATTCCTCTTTTC 

GCTGTTTTTC6ACCACTTGATAAAAGCTTCAGG 

GCAGCCATAGAAGAACACAGTGGTAAAATCAGAAATVAATGGAGAGAAOTAT^ 
CTK^TTTAAAAGGAATTTTAAATAGATATGAG^ 
SEQ ID 28 

MKSKLTVVYYDIjESNIAEEILSGNIMPDGNFL^ 
AAIfiEHSGKIRI^GENYLSVNFPPKKW 
SEQ ID 29 

ATGGTGCAAATGGCAACAATAACCAAGCGGCGCAATCCTTC 

GCTATCCTGCTTTCAATGAGAGCAGGACXSTTCAGTAAAAA^ 

AGCCGGCCC^AACTCOTTTTCAAC^ 

CTTGGAGCGGGTCGGTCAAAGAAAATGGGCTTGCGTTTTCTC 

GGTCTGATTTCGCGGAACACGTTATGCAGAGGCGGCGCGGAATC^ 

GCAGGAGCTGCAATATATCCGTTCCGTGCTAAAACACGCCTTTTATO 

GCGGCAAAOSGGCTGAAACGCTCGAATATGGTTGCGAAA^ 

TGACAACTCATTTCCTGCGGCAATGGCAAAGCAGGAAATC 

AAGGCGGCAGGATGAGATrTGCCGCCTGCTGTrro 

CCGAACGGCAGCACAGGGAATAATAAGGAGTTTGATATTC^ 

TCAGGAAGCGTATGCTTGCCAACAAAGGCATCGCCGACAGCCTTGTA 

GGCGTGCAAGGTTCTTGGAATTAAAGACTTGCGCTTCCA^ 

ACGATTCCGCAAATGCAACGGGTAACGCTGCAT^ 

GGCTGGATTTTAAAGAGGCAATGATGCAGGCGCAAAGCGATATAA 

SEQ ID 30 

HVQMATITKRROTSGETVYKVQVKVGKKCTPAFN 
I/jAGRSKKMGLRFIjMEFPIGGIGIDKLKRSDFAEHVMQRRRGIPELD 

AANGLKRSNMVAKSAIRDRLPTTEELQTI/^ 
PNGSTGNNKEFDILFMALPNTIDELPEESV^^ 

TI pQMQRVTLHIXaWNSLQRYVSVRKRSTRIjDFKKAMMQAQSDIKSGK |j '' 

SEQ ID 31 

TTGCGACAACGGCGCCGGCTTCAGCTGCTTCAACTG 
GAAGCGGCGGCATGAGGACGCACATCCGAACCT^ 
CGGCACGGCTCCCGGCGCGGCGGAACAAGCGTTTTTCAGCX^ 
GCTCCGCCACACCCGGACAAGCCAAAGAACGGACGAAAACCCATGACXST^ 
ACGCCGAATACATCACGGGCGGCAGCTTGAGGCGGCTGGTTGCC^ 
TGACGGCGCGGCAGGAAGCGGGCAGCTGGAGCAGTACATCGAACCGTCCX^ 
AAAGCCTTGTTGCAAAACTACCCTGCCGCCAAAGTATACAACACAAGCC^ 
CCGTCATGAATCCGCCGTTTTCGATCAAGCTAAAGGATTT^ 
GAAATCGGGCGTAGCCGATGAAATCTTCGTTTTAAAAGGATTGGAA 
ATCGCCTACCGCAAAAGCGAGCAGCGATTCCGCGAAATCATCGGCAACCG 
AAGACACGCCGATCGAAGTGCTTTTACTGGTTATCG^ 
AACAGACACCCTGCTTGCCGCCGACACTTGGCAAATCGAACCGGACTTC 
AAGGAAGACCCCGTCCTCTTGGAACACGAATGCCGTG^ 
TCAAC^GATAGAAGGCTGGCCGCATGCCGAATT^ 
AAAAAAACACTACTTCCCGTGC^VGCCa^ 
SEQ ID 32 

I^QRRRLQLLQLDRTFQTIX5KPDQPHAQGSG^ 

APPHPDKPKNGRKPmTQGHNNRKKAGGYAEYITGGS^^ 

KALLQNYPAAKVYOTSLFLYTDGEPQDCTVM^ 

1AYRKSEQRFREI IGNRLAELNRI QNAFEDTPIEVLLLVIDKDKTDGGC IRELYIX^KTDTLIjAAITIWQIEPDLWQTVQEPAPPKE 
KEDF^LEHECRDAAAKRIARELRFSKM\^^ 
SEQ ID 33 

ATGCAAACAGTAGCAACAAAACCGACGGCAAAACAGATGCTTGCCG^ 

CCGTCAAACGCGCCXSGGACGGTCAGAAACGTTGACCGGAA^ 

GATGTTGTCTGGCGCAAAGGTATCGGAAGACGAAGCCCTGACGTGCGGCA 

GCCTGCAATCAGGAATTAATCAACTTCGCQSAACAT^ 

CCGO^CGGGGAAAGCGTACTGTTTGC 

CCCGAATCAGCGCCAACTCGTGCTGCGTCCGCTGTC^ 

CTGATAGCCXSAAGTATCGGCATACTCATTCGCC^ 

TTTCGGCGGTTGGCAAAGTCGTCAACGGCGCGGATTC^ 

CCGGATCCTACACGCCGCGAACCTGCTTTACGATC 

AATCCCGTGCGCCCCCGACGGATAAGCGACGTGCGCCGGCCGATGATGAAAATGCTTG 

GGGCCGTAAAAGACTCGGAAGACGTCATCCGGCATTGCGACAACGGCGCCGG 

ACXSGACGGCAAACCTGATCAGCCTCATGG^ 

SEQ ID 34 

MQTVATKPTAKQMLAAKRAAKESTRQ^ 
ACNQELINFAEHIVKQVQRIiGIiYCOTDDPANGESVIjF 
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OTVRPRRISDVRRPMMKMLVADKGGALVRAVKD! 
AAAACAGGAra^CTTACGCG^ 

^^SSRPIPIKKLLSVI^DEQAVAIVERLEEQAYINRAFG 
QCQRCACOQTTGAAQAaGCCSTCGCCASCQCCCOTTCTGC 

ATAOKGCCATGCCCGACCATCT^ 

GGGGGCAGGAACTGAACCGCCTGTTTGATOATT^^ 

AATACAGGC^TGTTAAAGCCGCTTCAGACGG^TACCCOT 

ATCCCCTTGGTGCAAAAGGCG< 3 AAAACGGCGAAA^ 
SEQ ID 40 ^^^n^T^YSRLSRFFRRFDPPAYKAATRAFEKEWRIEraJ^SAPDQ 



IPLVQKAENGEIRKIIKPIVFGBKDP* 
LADNRSEGKI IRNFACG 

j !5Ja^^ 

AC^AACCCAATACAAGC^^ 

MTGCTGGCftCTGCAAGCAGACA&TCTCQACftC^^MCM 



ATGTCCATCGGCATTGATGGAGATATATTGC^w 




GGCGAACCATCAGATTTTGCGCTAAGAACCATCAGATTT 
GEPSDFALRT IRF 
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ATGGCOTTGAGGAACGCGTCTGATTTCTTG^ 
GATACGCGGCGAAAAGCGTTTT^ 

ATACGCGGAAACGCTGCGCCGCGGGGCGATGCGCGACGCG^ 

AACTGCCGGATTGAAAACTACGCCGTCAGCGATTCGATC^ 

ACTTCGCCGATGTCTTGCAGACXSGGGCGGAGCATGATT^ 

CGCCCGCGAAGTCATCGCCGCCGGCAAAAGCGCGCTGGTCATCA 

GGCGGCGGCGAGGCGGGGGCGGTCGGGATTTTCGTGAAGCCCGATTTG 

AAACG6ACGGGCGGATTTTGTTTTCCGTOTTCAA 
GGAAGCCTTCCGCGAAAACACCGACGCGCGGATCAGGGACAGGCTGCGG 

AGCTACCGTGCG 

^M^RJJ^DFLGAYGGGTO 
NCRIENYAVSDSIPGMARAKAAAAEYAA 
GGGEAGAVGIFVKPDLLVLDEFGMSL 
SYRA 

S aS^tataaaagaatttatgtctaactataccaacc^ 

ATTCATATACTTGGGATGGTTTATTATCTAAAATTGCAATAGATTO 

GTACTGTGAAGATGGTAGATTCOUIT^^ 

GATGGTAGATTTAAAGAAATAAATGATAAGTTCTTTGA^ 

CCACCCTGCTCTCACAATTGAATTATAAM3ATAATTCTAATACAGAAOT 

AATCATTACAACAAATTATGATAAATTAGCCCAAGATATTTCT 

CCTTATGGCTCAGTATACAA^TACATGGTTGTGTGGACGATCCATCAAAAAOT 

AAGAAAAA.TATGAACTTATTAGAGCCCAACTATTATCGTTATTT 

CGAAAATATCAAAGAAATTTTAAAAACAATC^ 

TTACTCGTAGAATATGAACCTGAGTC^ 

TTAATAAAATCAAAACAGATAACTTCTCACAAATTTATAAAGCTCTTCCAGAACTAACACT 
TAAATTTCAATCTATAGCAAAGGAGATTTA^ 

AGCGATAAAGTGGTTGCTATTGGTTCAACTAAAACTATCAGCTACAACOT 

AAATAATTGAAGAGGAAAATTCACAACTTCTAAAATTAATAGAT 

ATTTAGTAGGATATGTTCTGATATACATAAAGAAGCTGTACT 

ATAAATAGGCGTTGTAAAAATAATCATTCATCCATCCAATCAATTTTAGA 

CGATTGCTTGGGGAATAWMAATAACCAACTOTCAG 

GCACTATAAAAGACTACTATGTATGTTCGATTATAAAAAATATGCGGATACTGTC 
DGRFKEINDKFFENMRAGNTLSRFKXXTSTLLSQLNYKDNS^^ 

PYGSVYKIHGCVDDPSKI I ITKKBYEKFKEKYELIRAQIiLSIiFIHNPI IFI/?XNVGDENIKEIIjKTIFTYVEQNSPSANKIRRNF 
IiVEYEPESNNEDXVBHDIDIT^ 

SDKWAIG STKT I SYNFQTT SEMMSNYFKI IEEENSQLLKLIDKHS IASTQYFP IYGF SRIC SDIHKEAVLKRQQKEKLiDHF IBK 
INRROOINHSSIQSILDDENISOT 



GCCCGAAGACX^GACCTTTTGAAACGCG 

CTTACCGAAGAACAGGTCAAACTCGGACACCAACTCTGGTA 1 ^^ 

GCCACAA(XTTGCTTCCGCCGGTGTGGACAATATGCCGAO^ 

CGCATTGAATGCTGCGCTGCTGGGCAGCCAGTTTTGGGACGGAC 

AATCCGGTGGAAATGGCGAATGATTCGCAAGAGGCGGCTGCAGCCAAAATC 

AAGCTTTTCCTGAAGACGGCGCTCTTTCGTTTAA 

C^^TGGGACGAATACCTCAAAGGCAACGTCAACGCCCTGAGCG 

TGTATTGCCTGCCACAACGGTGTCAACCTTGGAGGCACGACCTTCC^ 

TTGAAGATCCGAAACGCGACAAAGGCCGTGCTGACGTAACCAAAAAA 

TAACGTGGCTAAAACTTATCCXSTATTTCCACAACGGCAGC^ 

TTGGGTAAAGACATTCCGAAAGAAGATGTXK3ATAACATCGTC^ 

GCAO^TGCCGGAACTGCCCCn^CCXX^CCGATGGAATOT 

SEQ ID 50 

MSFKLRYIiASVLALSSLliAACGGQEKSAAGDASPASETEAASQVQASEAVPSASS 

FTEEQVKLGHQLWYE PRL SKGNTVSCNSCHNLAS AGVDNMPT SQGHKGQFGGRNS PTALNAALLG SQFWDGRAADVEEQAGGPLV 
NPVEMANDSQEAAAAKIAKWEYQEMFKKAFPEIX^VSFKNITT 

C I ACHNGVNLGGTTFQKFGLVQGPYWKFIEDPKRD^ IMGKAQ 
LGKDI PKEDVDNIWFLNAL SGNVSESARTMPELPLTAPMESKPDNK 

S ATC^cL^CK^O^^ 
ACACGCTGGAAGCCGTTATCGGGCAGTTCGAGGTCAAAGCGCAATCCGCT 
AATGCTTGAAGGGCACATCAGGCAGATGACCGGGAAGCTCAAGGC 
CGCAATATGCAGGCGGCGGGCATTACCGAAATCAAAGCGGATGACGG 
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TCATCTTAGACGAAGCACAAATCCCCGCCGAATTTA^^ 
GATTGAAAGCGGTCGGCAAGTAGCWSGCGCGAAGATTGAAGGGC^ 



CTGCAGGCGGATTTAGCCTACGCCGCCGAACGCATT^^ 

CGGTAAGCGATTATTTCAGAAACATCCX3TACGCATTCCATC 

AGCGGCAGATTATGCCCGTTACAGAAAGTGGAAC^^ 

AAGAGAGACCTGAAGACGGAAAATCAGGAAAAOSGCAGCra^ 

AACTCAACGACAAATO^CCCTATATCGGTGCGCGCGTCGCCTACGGACACGTC^ 

AACAGGTACTCTTACCGCCTACCCTAGTGATGCTGACGCAGCAGTTACGGTTTATC 

CAARAAAGCARCAGCAGCCGCCGCTTGGGCTTCGGCGCGATGGCGGGCGTGGG 

C CGGCTAC<XCTACCACAACTGGGGACGCTTG^ 

C 

SEQ ID 54 

VOADLAYAAERITHDYPEPTAPGKNKISTVSDYFRNIRTHSIHPRVSVGYDFGGWRIAADYARYR^ 
KRDUOTENQ^ 

QKSNSSFJUJGFGAMAGVGIDVAPGLTLDAGYRYHNVTCRL^ 

S |!^cggcgttgcggcggtaggcggc^^ 

CCGCCGAACACGGT^CGAGCCGCCCA^^ 
<^CTTGTTCGCGGTAGCGT0X3G^ 



CCCGACAATTGCGCCTGATAAAACGGCAAAACCCCCACCGTCTGC^ 

AAGCATCGTCATAAGCCGATTTGCGGCTGAAATAATAGCTTGTGCCCCCGAC^TTGGCGCGGAA 
CAGCGCAGTCAGCTTTTCCGC^ 

CCGCCCTGCAGGCAGTATTGCGGCGCGGCATTGTTGGCGTTTTTGTTC^ 

TCAGCCCCTCCGTTTTTTTCCGGAAACGCCCCACATOTTCCAAAACCGGGGC 

(^GCCCTGCCGACTTCAGCCGGAAATCGTCAAACTCCGCCGCCGCCAAATCCAGCAAAACCCGCTC 

TGCAACGCCCGATAGCC^GCCACCGCCTC^ 

CATCATAATCCGGCAGCTTGGCATACAAATCCC^CAACGAAGCGATCAAATC 

GTOITCCAACAACTOCGGGCGCGACAGCAGGAAATCCCCGTCGAOT 

GGC^TTCCCGCCC^CCCTCCGCAGCCGCACGCTGGGCGCGCAACGC<^ 

ACGCGCCGACCGCACCCCAAACACACAGCGCGG^^ 
AAACACCGCCGGACGGCX3GCGGCAGAAACGCC3CAAGCCCCACGCGC^ 

S ^G^^GGIVDRAAHRFAFRHKPRKI J PAAEHGCRAAQPPCQBHIKEVL 
LHTRPTOQHIRRKPAVGRRFRFRIKAVATRQLRLIKRQNPH^ 
ORSOLFRPGLIIQPCRTDP^GNAAYPAAAAI^VLRRGIVGVFVHRPADAARK^ 
OPCRIiOPErVKLRRRQIQQNPLVCRIIPVQRPIARHRLR^ 
WQQ^QQBIPVDYARTAVO^PFRR^^ 
KHRRTAAAETPQAPRAADAAHHIVD 

S A TCAACArrACCATAGCCGCC^ 
CGCAATCGCACGACGTGTTGCTGATTACCAGCAACTTCAAGCACTACGACA^ 

CTCACAAGGCCGTCTGAAAGTCATCCTGTTGGAA 

CGCTTCGTCAAACATTTTGAAAAATGGTTGGAAAACTGCCGTCCGGGCGAACAAGACGTC 

rrACCAACCTGCTGTTGGGCAAACACAAAGCGCGTTTGGGTTACAA^ 

CTCCTCTGTCGTGCCGTTTTTGAAAAAAATCCCXKACAACCTACTGCCCTTTGCTTCACGC^ 

GACGOSTWMTCGCCX3TATCGCAGACCTATCTCGACCGCGCCAAAGAAGCCAATC 

OTGCG(»TTTTCCCGCCATOGCCaXX:CCC 

CTATGACGTGGAAACCGTGTGO^AAGGCGTTCGCAAKITGTTGGA^ 

CCCGATTTGGACAGGCTC^CAATATGCCTGCGACGGCATCAAGTTTTACGGCT 

CCAAAGGCTGCGACATTGCCGTCAACGCC^^ 

GCAAAAACCGATTCTCAACAGCCAGGTCCACGACGAAGTTGCCGAAGTCCTTA 
GATGTGGACCX3TTTCGT<XAAGCCGCCAA^ 

TCAGGCACGA(^TTTCCTATCGGAAAATCGTCAACCTGATTGAAAGATTGGCAAATGA^ 

S ^?iAAPYCSLPSEPHFNPJWIA^ 
pjVKHFEKWLENCRPGEQ0VVySAYPLIATNLLI^KHKAR^ 
DALVAVSOTYI^PAKEANPtm^EVVYIGADPAAIAPPPRFRSKTVRFFYIiGT^ 
PMiDRLKQYACDGIKFYGYIPYAEMMSVAKGCDIAVNAIH^ 
DVIXSFVQAAKDIIjKRKNDPVQSDEIVRRFRHDISYRKIVNIjIERLANE 

S A^3GGCGGCATATCTGGCAGGCTCCATCCTGCGCCACCX3GA^ 



CCATTj 



ACGGCACGGGGACGACATCATCCTCGCCTCCTCGG 
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GTTGATTTTCACCGGGGCGGGCTTGCGGGCT^ 

^TCGACTGCCAAATGGCGCGTCT^ 

CTCGCACGGm^CGCGGCAACCTACrc 

CACGAACGCGAAAAGCTCATCCGCCACGCXriTCCCCGCCGGCC^^ 

GGRAAACC<3AAAAAGAAT<MGCCGTACTCG<^CM 
GAAGAAAATGGTTGGCCGGAATATACCCGTACGCCTGAACATGGCGGGCA 

AAACGTTTGGGTATTGAAGACAAAGTGACCTTCCTC 

TGAACACGCCGCATTGCGTAGGTGACCACGGCGCAGGTGTCGGCAACAATAT^ 

C^CCTACAACATGGCGGGCATTTCCGAAAT^ 

ATCGAAGCCGTCGATACACTCATCAAGCATCCAGAGTTC 

GCTCCGACQACGAAATCTACCGCACCACCATGGCTGCGTACGAAATG 

DCOM&RVTOACALAAKIVSPKTKIICHSHGLDAATYPKTAKLFDKIjG^ 

TPHCVGDHGAGVGNNII>EAGLYDTPVVTYNMAGI SElWITGQTGYCIPFGQDEAFIEAVDTLIKHPELHSQMGKAIJnQWETI^S 
DDEIYRTTMAAYEM 

S TOGAAAAACGGTG03GCGTTTTCGTGGGG^ 
TTTACCAAGTCCGCCGCGCCGTGTTCGACAAAATCCTGATTGAAGAAGCCGCCAAACAR 

COTAACCGCGTTCGACAACAGCGGCGATTTTGCCCGCTTGAACATCGAAA 

TTCGTCTTGGACXSCAAGCGGCTACGGACGCGTGCTGCCGCXKIC^ 

CGCATTTCACGCACATCGACGACAACATTACCCACCCGAAATTCGACCGCAACAAA^ 

CGACGTGTGGATTTGGCTGATTCCCTTCGGCGACAACCGT^ 

TCGGAAACGGTGTTOAAAAAATTTGTTTACGAATGCCC 

CGTTCCGCTCCATCCAAGGCTATTCCGCCAACG^ 

CCTCGACCCCGTGTTCTCGTCGKXKGTAACCATCGCGC^ 

GGCGAAGCCGCCGATTGGCAAACCGAATTTGCCC3AACCCC1GATGATCGGCGTAGACGC 

ATGATTTCCGCTTCCAAAACGTCGTGTACGCGCCCGACCGCAGCCCGGAAATCA^ 

CGCGTGGGATACCGAAAACCCGTOCGTGGCGAAATC^ 

GAA i. - 

SEO ID 62 I i • 

L^GAAFSVTOSRYTEFDFTDKFSDGPGTV^ 

f^da^gygr^prll^et^ 

SBTVXJOSFVYECPMLSEIIJJKAVW^^ 

GEAADWQTEFAEPLMIGVI1AFRTYVDGW¥T3FRFQNVVYAPD^ 
E 

TTGGGGTGAAACAGCCAAGCGTTTCAGGTTGGATAAAAAATGGACGAATAGATAAAAAA 
CTCAGACGTAGTAACGCCAAGCCATTTCGGCATTGA 
CG^AAATGCCGAAGCGACCTGCGGCGCAGGCACGATTAACGACCA^ 
CATGGGCGCGGGAGAAACTGGGCGGAAACCINIAACAAAATCCAAGTCATTA^ 

AAACGGCGACGTAATGTTCGTTGATACCGCCGTC^^ 

AAGGCCAAGCGGCTGCAATCCACCGTCGGCGGCGGCCTGATGATCATCA^ 

GCGAAGATTTAAACGCCGTACGCATC^TCGGACGCATACGCGGCGCA 

S SsGEQU^AISEAIKRKOTSQKWADHFGWQPSVSGWIKNGRID^ 
RXiNAEATCGAGTINDHYIEVVDYVWAAAWAREKLGGN^ 

kakri/qstvggglmiisdns syrtetvrgedlnavri igrirgawrlsqf 
S ^aiStgaagccgtctcaaagtt^gagcg^ 

TAAATGCGGCAATATTATTCGGG^OTTCTTCT^ 

AATTGAAATTGAAACCGGGCTGTCCGTTCAGGAACAAAGAACGGCACGGGC^^ 

GAAAA&CGAATTGAACACCGCATTTACTACAAACTCAATOT 

AATCAACAGCCCCGAAATGCAATATCAACAGCCCC^ 

CAATATCAACAGCCCCGAACTTCAAAATCAACATTCGGGGAGTGAGGA^ 

GATTTAGCAGTATATACCCCCTTACCCCCAAACGCCGGAAACGG 

COQaaACGTGCGGGCGGGAAACCGGCGAACCGACTTCGC^ 

ACCGAAAAATGCGAATGTTCCGCGCCGCCGC^ 

GGCGGCCGGTTGCCAAGCGTCCAAGTGCTGAACGACACG^ 

CGGCGCCAAACGGCAAGGTCAGGTTCGGGGACAAGGAAACCG^ 

CCCGTTTTGGATGGGCGAAAACCAAACAGGGTTTGCGGTCGGC^ 

GAATGGCATCCGCCTAAAACGAACCAGGCGGCAAGGGGAAGGGCA 

S Ll^SESLI^RPIAYYPKU«^^ 



dlavytplp: 
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C^PSVQVUlDTRKRAIAmWC^ 
EWHPPKTNQAARGRA 

^ATGGCACTAGGG^IAATTCGACGATGTTGAAAC^ 

StatoG^gcggcta^^ 

GGGCGCGTGCCTGATTGCGAGTTTATCCAC 

OGTGATGTACGCAACAGGTGCGGGCXMTACGATTACAGGTTTCGGTGCG^ 
CCCGGAAGAACAGCCCTGACACGCCGATTATCCTGATTATGCAACGCCTGCACGAGAAAGA 

GAAACATTC^GCGGAATGGAGCAGGCCGCGC^ 

cgttcaagcctgacaacctgcaatttc 

CGCAA^GGCGGCGACTACACGGCAGGCGGCASGCTTGGC^ 

ACCCCGGGCAGGCAGGCAAATCGCAAACACTATA 

GACGCGCTGATTGOMAAATGCG^ 

TGC'I^ATACCCGGACGGGCAT<^TTGATTTCCTGCGATCGCAGGTCGAGGCTGTGAAA 

S ^^fddvetsvirsi.ssasly^tr^ 

GKVPTCJ^IHASYSAAliAVNNSVQIFJtTCiVQ 



DALIAEMRMFP^RHDDQIIX^jGRAFGEIiLDTRTGMIDFIjRSQVEAVK 

ggtggttcg^ 
cgggc^ttccaagccgcg^ 

cg^Saaagaccg^^ 
g^gmggacggggcgacgattaagcgcgttttggac^ 

cgcagcgtggcagagacgagc^ 
a ^?^g™caWtat^^ 

g^agSga^cggataagcaaga 

^tgattgagggtgcggc^ 
ttggattttgg 

caggggatatgcctgaatct 
S mskktplsqgfiarvaagvryaftgnadgw^ 



vmdgat!tckvld^ 



IGLHEPF SHGALTI PGHPNCRCAWPVLW3DMPES 

S ATOa^CGGATTCTCCCCCAAACCTAAGACCATTATCCTAAGCCTTG^ 
CG^CAATAC^AACAGCAGAAAGAACTGAATACCATAGTCGTCCATGGCAAACG 
CTAO^ACAAAAAC^TCTCCAACGCOTACGTCGGCA^ 
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GGCCTAAACGGCGTGTAC^CATGAACACCCGCACC^ 

GCATCCCGGTTACCATAGACGGCACCGAACAAACCATCGAOT 

rcCCGCCCTCTTCCGCAGCATCGCGGTGGAAAAAAGTC^ 

ATCCGCACCATCXSAACCTTCCGACATCATCCC^ 

TGGCGCAGAAAAACGAGCTACGCCAATTCTTAGGCCGC^ 

GGGTATGCCCGATGTATTGACCGGCTATACTGGCAAACCTTCCCCC^CCGC 

TTCTCGGGCGGTAAAAGCCAOVCCAACOT 

GCCTTGCCGCATACAGCCACCGCCAAAAAGGCAACTACTAC^ 

O^TGCGGATGCCTGTTACGAC^ 

CCAAATATGGCCGTCCTGTTCCGGCCXSGGCGAA^ 

GGTATCTTCCGGATAACCAAAAAATCAGCCTGCAATATATG 

ATGGATACTCGGTTTCGCCGlAACAATCCOT 

TAGAAAATCGGCTACGAATGGAAGCCGCAAAAG^ 

ACCGCCATCAGAGCGGCGGCCC^GTCGTCGGCGTCATTACGTCCGACTTTGATT^ 
AAAACCGTCGCCCAACCTGCGGGGGGAAAGCT^^ 

GTGCTGCGCATGATAGAAGAC^CGCCCGACAAAATCGCCCGCAAGCTGGC^ 

GCTGGATGGGGCATACGGGCGGCTACTACACCATCACGCCC 

AATGATGAATCTGGCCGAACTGAAGCAAAAGCTGAATCAGGAACGCT 

GCGCXX3CAGCGTACCGACGTCGTCCGCAACGGTTTCAACCTCA 

CCGCOGACTACCAGCGCGAAACGTTGGAAGAACGCACCGAM 

CCGCATGGCGGCCTTGGGCGGCCCGC^GTCGGCGAAAAAGCGCGAATGG^ 

CGCCTOAACATACAGGCGGGCGTGCGCTATCAG^ 

CTTGGTATCAGTACGGCCTGGGTTCGGACAGCT^ 

TGCCAACCAAAAGCGGATGTTGCAGCTTTATGATCCCGATAG 

TTTAAGGAAAAAAACGGTTACGATTTTGATCCGCATGATACTC 

ATTTTGTTCAGTATCCGGACGGCAATAAAAATACCGATGTACTATA 

CAAATTTGACCCCGGCAAAGTACAAATTACGCOGGAAATGTACC^ 

CGCCGTTATATCCCAGGAT(X3CATATCTACATAACAC^ 

CTCCXSTCAAAAGAAGACAACAATCAGGCACTGG 

AGCCC^CCGCTGGCGCATGCCGCAGGAGCAGC^ 

CACCGCCTCTTCGCCCGCTATCCCCGC^^ 

ACGGCAGCGAGACCGTGGCCGAATACAGCCTGAAGCCGGAAAAAAGCACCAACTGGGAAGTC 

GCATTTCGCCAAACTGCGCCAAGGCGACCTCCGCCTCACCTACTACAGTAATA 

GACGGCGGTATGATCCAATACGACAAGGCAGTCAGCAAGGGGGTAGAACTGC^ 

CCTTCGGCGGCACGTACCGCTTCAAACATATGGTC^ 

CGAATGCCTTGAAGGTGGCTTCX3GCCTCAGCCGCTTCTTC 

CGCTTCTTCAACGAAAAACTGGAATTGGGCATGCGCGCCA 

CCGACGGCGCGGGGCAGGTGTATGCCCGCAACGGTAAACCCT^ 

CCGCATAGGGAAGCATATAGACTl^^ 

GTCCCCGGCCCGGGCAGGACGATTACCTTCGGCATCAAGGGCAGGTTT 
SEQ (ID 72 

MMSGF S PKPKT 1 1 L SLAGAFGALAF ADT PNNTEQQKBLNT IVVHGKRS ADQKGADDN^n^NVSNAYVGKBYLERYRVQ SAGDVIjK 

GLNGVYNMOTRTAGGAITPNIRGITGKGRIPVM 

IRTIEPSDIIPEGRNWGIEVKTEFSGNTVAQKNDLRQFI^ 

FSGGKSHTNFKDDRQIMLSAAFKTO 

FNMAVLFRPGEE IMNSHTDTKILLLKNNWYLPDNQKI SLQYMDNKIGFGEINPIilTAWIIjGFAEQSIiNEPVQQAPGIG'rKIDSKT 
YKIGYEWKPQNNKWI DLQ ADMWRVKTDSNRHQ SGGPWGVIT SDFDYDLWYVJCNIRKKP S PNLRGESCESAMTWNAYGSARTHEE 
VLRMIEDTPDKIARKLAEYDENDHIAJTDRWMGHTGG 
ARQRTDVVRNGFNIiSNRLRLSDKLSLTIjAADyQRETLEERTDT 
RIjNIQAGVRYQNYKGl^IELARQRAARNPWQYGIjGSDSyvrrG 
FKEKNGYDFDPHDTLVGQDTRFYA^^^ 
I^YIPGSHIYITPGSVREEbffiRKANENQIPSKED^ 
HRLFARYARMSRFPSIiYELTAATGSGGLYGSETVAE^^ 
TCGMIQYDKAVSKGVELQSRLDSGRFFAS^ 
RFFNEKLELGMRAIHHSKAERRNYDKLIAIX3AGQVYAR 
VPGPGRTITFGIKGRF 
SEQ ID 73 

ATGAAAGCATCACAATTAACCCTCGCCGTCCTTCTC 

CAAGTAAAGGGCAATTAATTCAAGCTGCCGAATCCGATTTCCT^ 

CAACGGCTTGTCCAAATCCATCAACCTCGAAGCCGGCCCCXK^C^ 

GGCAATCAGAACACC^ATGTAAACGGTGCGGCCAATTCGAGATATC 

CGAAAACTAGACTCGCCCAAGTGTGGTATGAAAAACGTGCCAACAA^ 

GCTGCCGATTGCGCCXSAAATTCGGCGGCATGACTTTT^ 

CCGCGC&AAGGCAACAGGAATCAGATTACGAAGA^ 

AGAATCCGACTAAAAACACCAGAAACTTGACTGCG^ 

TTTOTATACCGGTGAAATTACAGCCACATTCGGCACGGGT^ 

GAATTGAGCTTTAACGGTGTTGAAATTACCAATGCTGACGGCTC^ 
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(^CAATTCTACGGCAATGGCGCGGCAGCCATGG^ 
CGGCGCGAAAAAA 
SEQ ID 74 



CCGCTACCGCCAAAACGAATGGCTGTTGGGTTTTAAATATCGGTTT 



AAAAGTGCX3GGC6AAATTa 



MXK^AAAACAAGTGGACGCGCCTTACATATCGQCGAATTGGCGGGTGCCAATAAATT 
CAACAMSGATGACTATAATA^TG^^ 



An 

AGO 

ETPYYKSKRGSNGIFAYGDKSLVKLIGENNIVKSEIS 
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LKGKNNVTSNPKYJA/FAYKKAKVDLTVENKNTLSDAEFGVY INVDRKDNI IEGD 

AVALVGKGGSQNIRASRTNL I S SKSLG IHAEQ AAKIAITGASNT IHA3NAAIRSLDKSEVKIDGQIT ID SNVANLARQDGS IHLN 
YKDOTRITGATVSDKGLVAIKPLNimJIVADTIHYKGDV^ 
KSAGEINFNIiAKDALWTMTGQSWIiDKLEGQGTID^ 
KmjSEVLDSMNYGEKLRFATVTNSKNEFW 
TDNRQN\^VKQATGNPSRNVKNINDMFDSTAHYAFTLDTYA 
DRFSLNEQEKKRKWGISLDYGHGRTSLWOT^ 
GKYSNTLFSISTEYGRRKFLDDDKLWRIOT 
FHEFLGSRSFKAFEGKSHYAQK 
SEQ ID 79 

ATGCGCTTCCCCCTCCCTATTACCAATGCTGTGCT 

TAAAACTAACAAAAGCTACTTTTCACCTAAAAAAAGAC^ 

(X:TAACCCTATCTAATACAGGTATTTCTTTATC 

ACGCTTGCCAATCAATCCTATGCAGACAAACTCACCATTC 

AATGGAACTCGCCGGGTGATGATOVAGGAGGAAATTC 

GGTCATTTCAGTOGOTJ^^ 

GCCGTTGTTAAGGTTTTGGGACAAGACAAGGGGGCGGAAACCGGTAAATO 
GCGATACTTTTAGCGGAACTGCCAAAACGACAGGGGCGGG^ 
ATACTTCTGGACTTTAACCACGCCGAATCAAGATAAAACCATCATC 
AACTTGGAATCAGGCTATGCGATGCTGGATACGCTCCACCAACGCC^ 
ATTATCGGCAGGATGCGGAAGCAACAGACATAAA^ 
SEQ ID 80 

MRFPLPITNAVIiKITNGAMEFKSEDIGTIKIiTKATFHIjKKOT 

TLANQSYADKLTIEGNYVGKDGVLKVNTEWN 

AVVKVLGQDKGAETGKLNIEDAKHTYTMIUOT 

NLESGYAMLDTLHQRRGENQTL SRDRQGNYRQDAEATD IKATKAPWV 
SEQ ID 811 

ATGATTCTTGCGTCCCTTGTCCGCTATTACCGCOT 
GTTTTAGCGAGGAGAAAATGGGCTGGATTTTGGTGTT^ 
TAAAAAGCCGCAGCCGAAGCTGATGAGTGTGCCGCGCCCTGAAAA^ 
ACCGCCTACGCGCTTGGCGTGGAAGCCAATAAAAACAAAGCCGAAGCCAAAGAAA^ 
AAGCCTTCAAGCAATACC^TCTCGATTTACTGCAAAACAGCGAA 
GCAGCCTGCACATTTCGCTGCCGAAAACCTGCCTGCCGAAATGCTO 
CTTATCCATAAACGCGAAGCCGCGCAAACCTTGTGGGCAG^ 
GCGGCGACACCGCGCCGATTGCACGGCTGCATCCGGCGATTAAAG 
GTTTAATAAAGAAGCCTTTTCCTCTTTCGdAAAAGAGCAAGGCG^^ 
ACCGCGCTGAACTATCTCTTGCGCCXX!GAAAATAATC^^ 
ATGATATAGTGGAT 
SEQ ID 82 

MILASLVRYYRRLATETDETGNPKVPSYGFSEEKIGWIL^ 
TAYALGVEANKNKAEAKERPFTPSEKTFEAFKQYHLDIjLQNSE^ 
LIHKREAAQTLWAGCLKSDHALESLCLISGDTAPIARIiHPAXK^ 
TAI^JYIiLRRENNHCI/TIGDASTVFWAEADDIVD 
SEQ ID 83 

TTGGCGCGCGGGACGGAAAAACAGCCGGATGTTTCGGGTATCCCGTCGGAGGA 

CGGAO^TTTGGCGCGTCAGTATGCGGAC^TGGCC^ 

(XK3TGCGGTTTCTGCGGCCGCTGCTTTGGCGGCGGATTC 

TCGGTTGCGGATGATGCGCCGTCCGGACGGTCGGCGGATGCGGACAGGG 

GCGGTGCGCCXSCGCGGTGOSGCTTa^TTGCGCC 

GCCTGCGGGCC^TACTTCGAO^TTTGGATACGCGCGGGCGCAA 

GCGGATTTCGGGCAAATCGCCGCGCCTGTGCGCCGAAAAATCGAGC^^ 

CTTCCGAAGCCAAACAGGGTTACCTGCCGCCGCCCGTGCGTATGGCGGATGCCG 

TGCCGATGCGTTGGATAAGGAATCGCGCAGGCGGTT^ 

TATACGGCGGACGGGATTATGCGCCGGGAGGCGGGTATGGCGGATATGCGCGGA 

GGGCTGTACGCGCTTACCGTGCGCAACATCCGGAATCGG03GATC 

CAACAACGGTTGGAGCGTGCCGCTGTTGAGCCGCGAGGGGGA^ 

TCTGAGGCCGTTGTCGGCGGCGGCAGGGGTTTGACCCGGGCTTO 

ATGTGCGGCAGGGTTTGACCCAAGGCGGCAGGGGTTTGACCCCTG 

GCCAGGGTCCGCCGTTGCXSTCCGGCAATGCGCCGGCCCGTCGGCAA^ 

GGCCTGTCCGCGTCTGAAAACCTT^ 

CGGTATTGAATCCGCAGGTTGCGGAATCGGCGGGCAGGGTATC 
CCGTTTGGCGGCXK3ACAGGCGCAGGCCGGAAAAGGCGGGTC 
AGGCATATTGATGCGCTTGCX3GGCGTGCCGGGCAG 
CCAACTCTGACGGCCTTCTCTCCGACGGTC^ 

AGCCGTGCCGTCCGAGCCGGGTCGGGATTATCGTCCGACGCAGGAAGCAAGGG 
GCCGCCGCCGACGGCAAACXX3GCGGGCAGGGCGCAGCCTGC 
AAAATGCGGCAACCGAAAAGCCGTCTTCGGATAAGGTGCGAA^ 
GTCGGCCGCCGCACAAGGCGCGGCAACCGAAAAGCCGTCTG 
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TGTCGCCGCCGTGTCCCSAA^ 



,TAACGGGTAA 



GOGGCATTGCCGATTTlt^CACGGMsGCGGTGCGTTTTTC^ 
AAAATCCGATTTGAGAAACX^GCTGAAAGACCGCT 
GAGGACATTTACGGCGGCGTTTTGGACGGCCT^^ 
CTACOaiOQOOGACAAGGTTQTC^ 



gcggacggt: 



TGOSCAGAATATGGGCAGCGGTGCGAATOATCTGG^ 

AGGCGGTGATGAATCCGAATCCTTCCAAGCTCGCGCA6GCT 
TTCTGCGGTTGTGAACCTGTCGCAGACGGCAATCGTGGCTO 



ACGTTTGTCGCGGCCTACCGTTTGGCGAM^ 

ACGGGCATTTTGACTATGCGXKXKAAAACCGTC 

TTCGCAGAATATCCTGTATGCGCTGGGGCGC^TCCGTA^ 

CTGTTCGTCTCGCATGCGATGGCTTC^ 
GTGACGACGATGACCCGTGGGATGCGGAAGCGGCG^^^ 



GKimVAENI/TPERAWAAWHELGHRGF^ 
QRTGGWAGIEtnOfGVKVGNGljKRGIAGVLAMGAL^RWjQM 

RAEDGAAI^SKSBSLEKIJ^ 
QSIAAIPQII^VYIim.PNE^ 

EDIYGGVLIXjLKEYGRLSELFGADAN^yTEADKW 



DFVDGRKYEEGFDQF 
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ELLRASKQIGLRFGEKENTIEDSLNGb^ 

TFVAAYRLAKRAGADSEAAFEQAKKATYDGHFOT 

LLVSHAMASGILGLPFVSTLLAVASML^ 

GIiEGKKWAESLVVGSTGAVVGAGIGAJ\DGVRTRS SVPRTANTLSLMKSW 
SEQ ID 85 

ATGAGTGATTTGGTCAGATACGATCCGTTGGAA^ 

ATGCGCGGGCGGCCGCCGACX3ATACGG05TTGACGCGCG<3G^ 

TACGGGCGACAAAGAGGAACTK3GGCCX3GCT<^^ 

CGCAGGGAGTTGGGCGAGGCTTGGGAAAAGG<X!GGGGGTC 

GGOGCGAGAAGGGTTTGGACGGOKJCTTGGAA^ 

aJCTATOWrAOSGCAA 

TTGGGCAATACGCTGATGGAATACGGCGGGCAGCTGGACAGGGCGGCA 

GGCGATGAAACTTGGCGGCAGTATTTTGAATATGGGCA 

GCGGATAAGGCGGCGGTTGCGGCXSGCTAAGGGAACGCCIK^ 

GTGCX3GCACX3GCACGCGGCGGCTTATGCGACGGAATCGGCCTC 

GGAATGGGACGAGAAGGGGGCGGCTTTGGAGGCTTT^ 

GCGGCGGTAACTQACCCGCTCAGGCCGCCCGGCCGGACGGAAGGCGGG^ 

GCGGAACAGGGCGGCGCGGGTTTCGATACGGCGCATCA 
AGGAGGAGGCGGGGCGGTTTTTCAGCGGCCCTGCCGACGGCAAT 
GCCGGATGTTTCGGGTATCCCGTCGGAGGACGGGGCGGAATTTOT 
SEQ ID 86 

MSDLVRYDPLEHGRLVGGLKBYRGFTQKDARAAADOTALTRGFKN 
RRELGEAWEKGGGVGGGLSNVWGELKKDWREKGLIX? 

LGNTLMEYGGQLDRAAEAAGVDPADKDAVMAF IGRGAPGALKNAAWGAWGAADMAAMKLGGS XliNMGKKAAGKAALiEKMGVAA 
ADKAAVAAAKGTPEFAALAKESAKGGLGGAARHAAAYATE^ SLGHSAVGFAGTKAY 
AAVTDPLRPPGRTEGGCAGGYRGQQEGGQAGPGR 

RRRRGGF SAAL PTAJRRTRRNWRAGRKNSRMFRVSRRRTGRNFWIRA .y-' 

SEQ ID 87 li 

ATGGGGACGGACGTACCGGAAACAGGCGTATTGCCCGATAAAAACGGCGAACCGCT 

GTGAAATGATGAACCAACCTGCATGGCGTGCCGTTGCCGACAAGGAAA 

TOTACAGAAACAGCGCGAGTTGGGCCTGCCCCCCGCTO 

GCGACGATACGGACGGACTGGCXK^TGACXSGCGG^ 

ACCGCGKTGGAACGGCAAAGCCX3TGCCGACAAGGCTTG 

GGTTACACXK1AACCCCAACCCTTTCGAGTTTCCTT 

TACAAATACGACCTGTCCGATGCCCGCTGGCTGATACGCCGCCGCT 

AOXKXX^ACACTTCAAAGCGAIIXXSGGCGO^ 

ACTGGCGGACGCTTGGGGTATTTCGGGGCGCAACACCGTCAGC^ 

GCGGAAGTATGGTACAGGCGGTGGGTAACGGCAGACTGCCTGCGCGACAAAA^^ 

ACCCAAACCATCGGGAGATGGCGGC^WVCGGCGCGGTATTC 

GGATTTGGTCX^aSCGACGAGCaSACKX: 

ACCGGCATCCCCTACGGATATGTCXXSC^TATGAAATA 

TGTGGGCAATACGCACGGTACGCACCAAAGGCATAGTCGATATOTCGGACGA 

GGACATCGTGCTGAACAAAATAGAGGCCGCCCAGCCGGGCG 

TGGCAGATGCTTCAAGACAGCCGCGCGACAAT^ 

CCAGCGGCACKKLAGGAAAGCATCXAAGTCGAGCAGTCC^ 

ATTGGTCGGCGAGTTGCTGCTTCCGATGATTATTOAJGGATOT 

ACGCAAGGGCGGACGGTCGTCATCAACAGGC^ 

ATCCCTGCCGCCCGAATATCAGGCGGCGGTGCTGCCGTTTATGGTG 
AAAATCAAAGAAGTCCGAGTGCAGGAAACGCCCGAACAAATCGAJGGCGCOT 



GCGGTCATGCAGGGCGOGGG&TATATCCGGCCGGTC 
AAATACCGCCCGAAGGCATCCC3X3AAGCCTACX3GCa 
CCAAACAGGCATGGAAACXX!CGACGGTGTCGGACAACCTC 
SEQ ID 88 

MGTDVPETGVLPDKNGEPLTIGEYRLFVGEMMNQPAWRA^ 

ATIRTDWKVTAIXSETGGRDVADAIjNFK^ 

YKTOLSDARWLIIOlRMiLPER^ 

AEVWyRRWVTAIXIiRDKKTCRTVEFIXSANPNHR^ 

TG I PYGYVRNMKYAQDNLNSTNSKLRWGLSAIRTVRTKG 

WQMLQDSRATIRQ I SGITPSFMGNRGNATSGRQESIQVEQSNQSLGLVMDNFRQ SRSLVGEUjLAMI IKDIaGSDEQTWTEGDAV 
TQGRTWINRPETDPVTGKAYLSNDLQNIRLKVALED^ PFKDKVIE 
KIKBVRVQETPEQIEARIAQAVQDALAKSGNDIKRRELAIiKEQR 
AV^GAGYIRPARGDDPGFPVPAMPPETQIPPEGIPEAYGADTGPMTA 
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SEQ ID 89 

ATGACGGGAAAGACGGTTGAT^^ 

GGCGCGGCGGCGGCAAATCGCACGGCGCGGCATCCGCACTC 

CGAAATTCAAAAATCGATGCGCGATTCCGTAC^ 

ATJttCCGAGOTOG^ 

CGTTTGAAGGTATCGACATCGTATGGGTTGAGGAAGGGCACGGCGTCAGT 

CAAAGAAGGTTCGGAAATTTGGATTACCCTCAATCCCGATATG^ 

GAAGACACTTCGCTTTGCGAAATCAACTGGCGCGACAATCCG 

GTTCGATGAATAAAGAGGACTACGGGAATATTTGGGAAGGCAG^ 

ACAGGACGCTTTTCATTCCXSGACGCGOTACG 

AACX^TGCCATQACCATCGQGCTGQTGCM 

TGGACTGGTATGTTGCCGAATTGGAAAAGCTGCCCTACCGGTGGGGGACG 

CCAAACAGGCAAAAGTACGATGGAGATTTTGACCGGACTGGGGCGCAM 

GGCATCAGGGCGGCGCGGATGCTGTTTCCOUAGTGTACTTC 

ACGGCCGCCAAATACATGaSAAAAOfc^ 

COTGGCGCAGGCGGTTGATTTAATGGATACAGGC^ 

SEQ ID 90 

ITDFEIRGANGTLFVFSGI^SHTVDSIKSFEGIDrVWVEEGHG^ 



NDAbOTIGLVQRDLTSVRIIGYIEOTHRT^ 
GIRAARMLFPKVYFDKDKTARLLECIjKRY 
SEQ ID 91 

ATGATGAAGCCGTCTGAAAGTTTGAGAGCGGO^GGCAGGCC^TC 
ATGCGGCAATATTATTCGGGCATTTCTTCT 
TGAAATTGAAACCGGGCTGTCCGTTCAGGAACAAAGAACGGC^ 
AAACGAATTGAACACCGCATTTACTACAA^ 
CAACAGCCCCGAAATGCAATATCAACAGCCCCG^ 
TATCAACAGCCCCGAACTTCAAAACCAACAOTCGGGGAGTGGGGAA 
CAAAATCAACATTCGGGGAGTGAGGAATCAACAGCCGTTATAA 
TACCCCCAAACGCCGAAAACGGCAAAGGCGGTTTGAACGCTGACGCGTTTC 
CGGCGAACCGACTTCGCCGAAGGCCGAAAGCGA^ 
CGCCGCCGCAAAATCCACGGCGTACCGCTTCAGGAAATCGCCGATTTCT 
AAGTGCTGAACGACACGCGCAAACGGGCGATTGCAAACCGCTGGTGC 
GTTCGGGGACAAGGAAACCGGTTTGGCCTGGTTT^ 
CAAACAGGGTTTGCGGTCGGCTTCGATTGGATTT^^ 
ACCAGGCGGCAAGGGGAAGGGCA 
SEQ ID 92 

MMKPSESLRAAGRPIAYYPKLAKPIX3GVNAAILFG 
KRIEHRIYYKLNLDAFDDLMliQHSGGGEPTAPKCNINSPEIi 



RRRKTHGVPLQEIADLYNEVLGGRLPSV^^ 
QTGFAVGFDWIFKAGNFVKILEWHPPKTNQAARGRA 

^ATCAGCAACGGCGCGCGTTGGACGGTTA 
GCX^CAACAATAAATTCOTCAAAGTOTCCGTGAG 
AGGAGAATCGGACAAGCTGATTACCCGAAAAGGCAGCG 
* ACGGGAAGGGAATATCTGAAACTGGTCGAAAACAAGGGCAATCAAGAAGA 
ATGTCCGCTGTACGGAACAGGGAGGGTGGTGTTTTGCCTTGGGTGAGTC 
TAAGCGGGACTTCTACCTCTACCCGGA^ 

AACGCGGTTTCCGATGAAACGCTCGTGCAGCGTATGGGCGAAATCCAC 
TCAAACGCGTCGGCGGAAAATTCTCCGGC^GCCGCAGCGATTACCGCGTC 
CGGCTTCAAC^GGACGGGGTTCGGCGATAAATGGATTCA^ 
GAAGACTACGTCGGCAGCGGCAAAATCTTACGGCAGGGCGGCAGGTGTTO 

TA 
SEQ ID 94 

MSNGARWTVTNDSMLKELDM 

TCRKYIiKLVENKGNQEDNKASNKASYKUlVRCTEQGGW 

NAVSDETLVQI^GEIHAIX3MPQECNNrVWIKRVGGKF 

EDYVGSGKILRQGGRCIiFRLAQPGKQGLL 

SEQ ID 95 

ATGACCGATATTTTCACTCCATCCAAACGT^ 

GGAAATTTCTTTTTTCCCAAGGTTTCCGTTATC 

TAAAACAGTCGTGTTTATACACGGCTGTTTTTC^ 

TGGTTGGAAAAAATCACAAAAAATCGCGAACGTGATATTAAAAATC 

TTTGGGAGTCCGAAOTAAAAAATAAAGCTATTTGTAGGGAGCGATO 
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SEQ ID 96 

^DIFTPSKRSFVMSKIHSKETKPEVLVRKFL 

WLEKITKNRERDIKNETELEKIGFKVIVVWECEL^^ IKDAV 
SEQ ID 97 

GTGAAAGGTGAAAGOSGAGTTGATATCGAAAAT^^ 
ATC^TTTGGAAGATTCGGAGCTTACAAATAAAGAACAAGC 
CTATCATTGGCGGCATTTGCATCAGGATTTGCATAC^^ 
ATCGAGGCAGTTAAAGTTTOCGAAGATAAGGTTCA 
CCTTCGGGAGTAAAAAATCTATGCTTCTATTAAC 
TGGATGCGGAACATGGACTGGATTTAGAAATCCTGT^ 
GATGCTTTGGATTTAATCAGCTTAGTTTCATATTT 
SEQ ID 98 

VKGESGVDIENWKNKLPEKEREPVEVXIiNRLEDSELTNKE 

IEAVKWEDKVQKQTGLHSI1X3RELIEKAFGS^^ IFNDK 
DALDLISIiVSYLLRKVEQTKKRAKPTSP 
SEQ ID 99 

ATGTCTGAAGAAAAATTGAAAATGAGTTTCGAGCCAA 
CGATTGCCGAGTTGATAGCGAATGCCTACGATGCATC 
CGTTAOTAAAGATAATGGCATAGGAATGAGCTTCGA 
AAAGAAGCCTCCCCGTGCGGAAGAATTCCAACGGGT^ 
AAATCTCXACTATCC^AGGAAACGAACGGGTTACTTTO 
GGAGTTTCAGAAAGAGTCTGTTAAACCCAATACCGAAAAC^ 
CCGOTAGATAATTATGTGGGGCATCTTTC^ 
CX3GLAACCAAGAATCATTGACGGAAACOTAAAA 
TATTTCATCGTTATCTTCAAAATTCGAAC^ 
AAGAATAATATGAAAGGTATTACGTTGTTT^ 
ATTTC 
SEQ ID 100 

MSEEKLKMSFEPTVIEHI/3VKMYSHTWAIAELIANAYDACA 

KQASPCGRIPTGKKGLGKLALFRIiG^EK^ 

PLDNYVGHLSRIiFDFPAQDFKIKVSLNGSEPRII^ 

KNNMKGITLFANGKMVNMPEFFTDSESSHF ji 
SEQ ID 101 

ttgggaatggaaaacggctttcx:gaaaattatggc^ 
atatcaatttgaaacgattggctttgatacctaaaaac 
atcttttatcggaaaagacaatagttttaaagacacgttcgg 
ttctttagcatttccaacgggcggtt^ 
tccctcgtaactatgtctttaaagcgggcagtagggacaagatagcccx3 
aaaaataggcagggcaattgttgataatatcgaatgt 
SEQID102 

LGMENGFPK IMAGHQDETDFMHSCAGL SDINLKRLAL I PKNGGNRLAFAHI PELQLECF IGKDNSFKDTFGRLWWDKPAPTITTK 

FFS I SNGRFAHPEEDIU^SLREGATLQSFPRIWVFKAGSRDKIARLIGNAVPPMYTEK 
SEQ ID 103 

TTGCAGCCTGAAACCTTGGAAAAAGAACTCGGOT 

ATTGGAGTCTTATTCAAAGAGACAAAAGAAAATCGGAAAAATC 

CAATCCCGGATATGTTGTTGTAGAGAACOTACCGGGAATTCTG 

TTGGAAGAAAAAGGATTTACCGTCCATTTCfGGTATTCACAATAC 

TAATTGGAAACAGAATAACCAAAAAAAGCTGGAACCAGTCAA 

GAAAACGGCTTTCCCAAAATTATGGCAGGACATCAAGACGAGA^ 
SEQID104 

LQPETLEKBLGLKKNDDDLILIGCSPCQYWSVIQTD^ 
LEEKGFTVHFGIHNTADYGIPQSRKRFTLIANRITKKSWNQSSIRA^ 
SEQ ID 105 

ATGAATTTACCITTTCGGGCAATGGTGTCTGATTTAOT 
. ATACTCTGAGAGTATTAGAGCAAAAAGGCGTAGATATTATTT^ 
AAAAATCGGTGCGGAACGTGCGGTTATGATTACATCAAACGGTGCTC 
AGTTTGCCTGAAGAAOTAGTTCTCGAGCTTTACA 
AA 

SEQ ID 106 

MNL PFRAMVSDLGGTIiLTPEHLVGDIiTIOTLRVI^QKGVDI ILATGRNHTDMS SILGK IGAERAVMITSNGARVRDLQGNIiLYSN 
SLPEELVLELYKTSYVGNLIQTPRFCRHQ 
SEQ ID 107 
ATCGAAGAGC^ACTTGCCOVAAACGACCAGCCGTC 
TTTTGGTGACGGGCAATGTGATGGTGGCTGAAAGCGGCAAAGGTTC^ 
AATGCTGAAGAAATGGGCGAAAGCCCGGACGCAAAACGACAC 
GCGGTGGTCAATAAAACACCGCTTGCACCGAGCGCCGTGC^ 
GTGCCGATGAAATGAACGGGCTGATTCAGCAATTCGTAC^ 
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AACCGTATGCGCT^CCTTTTGGAAACCTACACCGCTATTCGCGCTGCCGCGG 

c<mcagatttccaaaaaggcggatttgacgaaagcgaatcg^ 

T^TTTCCGGTGGCAACTACGAAAGCCCGCAAATCCTCGCCGC 
TAC^CGAAAAAGCCCGTGCAGCCAGCCAAGCC^ 
TATCCAGCGGTCATTTGGATTTGGTCX5GCATAGCCCG 
OTACCAAACCGTACAAGCAGACCGCATCCAAACTGGC^AG^ 

CGGGAAAACGG 
SEQID108 



YAEKAIUVASQAPLIITGGFRSQTAMEDALSSGHIJ5LVGIARPFALVPDI.ANKMQNRTO 
YMTQMDLIGQGKQSNPKIVGVESIABNFAGKR 

S A^CGCAACAAACCATGAATOTAATGCG^^ 

AAAAAl^TCCTGCAAGCCGGAGCGGTGGCGGTGGAGCAGGTCGGCAAAGAGCGGT^^ 
GCGAGATTGAAACAGCTTGCCGATTTGATGGCTCAA&ATTGCCCGCTOTCAAAA 

ARLKQLADLMAQNCPLSK 

aAGCAAGAAA^TCTGATAATGCGCTAAACCAATCAACCGAACTGTTCCA^ 

TTCTGATT^GATCCTTCCGATTTCATGTGCAGATTCGTTATCT^ 

GCCCTGAOTAAATTACACAAACCTCTATCGAACTCTTO 

JaScaaatc^^ 

acttgaaaatatcaacacacatttagagataataaagagaaaa , 

SEQID112 



S atcaagk^tggaaatttotk:cgg^ 

otSag^caaIa^ 
cSaa^tat©^ 

^AAAGGCCATCTGACACGGCTGAAA<^TTGA^^ 
AATGCAGCTCATTACGGAG^CGCAAAAACGCGAACGCGT^ 

AAG^TTCAATAAAGATATTGCTC 

GATACCTTGTCCGACATCCCG»TCCTTT<3GGGAATCATAAAATTACAGGAC^ 

GACACACAGGAAGCGGGATAGACGAACCGTCCAAAACCATTAA^ 

OTMGMGATGGAACAGTCAGATACTTTACCAGCTATGAAGCAAAACTTC 

TGTCGGAGAAAAGCAGCCTGCACAAC 
^IffiS^II^^AGGLAKGLEIiAGFQHASFIEL^ 



ICiADYGVPQKREKVVXVGIRADLDIDWKFPKRTHS^^ 

DT^DIPOTIiGMHKITGHF/tfBIXjARryPGHTC 

AWGEAMRQIGNAVPVKLSEII/SKHLMGVLSEKSSLHN 

^TTCTCGGACATATCGGCTTCGCGGGCTGCCTATATGGATGTGCAAA^ 

cg1accatatocac<k:catttggacgctgccgcccgacx5a^ 
cgcatattcccctcatactaaaaacttaggggctgtactagattatccc 

^SDISAS^YMDVQK^ 

S ™?TCAAATGGCTTATATTTTGGATTGTTAGAATC 
TAAAAGAAACACTAArrCGTGTAGATO^^ 

TGACyvAACAATTAAATATTCTCGCAAAAGAAATAGGTTTAACTATAGAATTGCCCATCC 

TTGTCAACCTTAGAGAATAAAGTTAATTTTTATAGTAGATTAACAAAAATCAGG 

GTACGGCAAGGCGAGGCAACGCCGTACCGGTT 

lstlenkvnfysrltkirtrrraadstkstarrgkavpv 
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SEQH0119 

ATGGCTGGTGGAGTTTATTTAGGCGGAAAAATTATTAGCCCTA 
ATGCTGAAACAAATATAATTTTATGTCCTGCTCAAAAAT^ 
GAAIH^GATAATTACTTTTTTATC 
GCTATAGAAAAAGCAGCAAATTTTTTAAAACT^ 
SEQ ID 120 

MAGGVYLGGKI I SPIYHS SQEFSGEPI IYAETNI ILC PAEKFLTLKRVFHNGNIFNMNQ I ITFLSNKQGGVRFDKNYDKYKTWQV 
AIEKAANFLKLGNPYN 
SEQ ID 121 
ATGACAGAATOACCGTGC^TAGCCGAAGC 
TGCAATGGCTGAAAGAGACGGGCGGCTTCCC^ 

TGTCGGATACTGCCTGGGCAAAAGCGGACGCGCGGTCATCAGGGACTGOT 
AAACTCGAAGCCCCCGCATACGGCTGCATCGCGGTCAAACCGCGCCGG 
CGGAAGGCAGAATCTTGGGCTTGGGCGGCAATCAGGGCAA 
CTTCTGGCCGTCCAAGCTGATTGGCGGCAAAGCCGTGCCT^ 
ACGGCGAAACAGGGCGCGGGCGAGGCG 
SEQ ID 122 
MTl^FWIAEARiraiGLKEIPGAKHNPTIVQ 

KIiEAPAYGCIAVKPRRGGGHVFFWGKDAEGRIIjGLGGNQGNMVSIIPro 
TAKQGAGEA 
SEQ ID 123 

ATGATTGGGGCTTTCCTGAAAAATTGGAAGCCGCTGCTO 

GGGCGGCGCAATACCGTCGCGGATACGGTGCGGCGGTGTCGGAGGTTTCGGAA^ 

ACACGCCCGCAAATCGTCCGCCGCGTATCAGGCGCAAAAGG^ 

AAAATCATTGAAAAACCTGTGTACCGCAATGCCTGTTTTGAT^ 

GT 

SEQ QD 124 

MIGAIiLKNWKPLLILSAIAFFAVSWQLDRAAQYR^ 
KI IEKPVYRNACFDADGVRELNAAVDDGG 
SEQ ID 125 

GTGCGGCATCGGTTGTTGTCCGGATCGGTACGGCATTTTCATTTC 
CCTTCGCGCAAACAAAACGGCAAAAATTTGCCGCGTC 

GGAGTTTCGGGCAGGGCCGCACCAAATCGGC 
CA.TCAAAACAGGCATTGCGGTACAGAGGTT 

CGCCTTTTGCGCCTGATACGCGGCGGACGATTTGCGGGCGTGTTC 
TCCGACACXXXX^3C^CCGTATCCGCGACGGTATTGCGCC 
AAATAAGCAGCGGCTTCCAATTTTTCAGCAAAGCCCCAATC^ 
CGACAACCGGTAACGCCCTTCGGCGGGGGACGAAGGGACGGCTT^ 
TCCGCAGGGTCAAACGGGATGATGGATACCAT ATTGCCCTGATTGC^ 
CAACGAAGAACACGTGTCXIGCCGCCCCGGCGCGGTTTGACCGCGA 
CAT 
SEQ ID 126 

VRHRLLSGSVRHFHFRPLLVRLIILYNAPFAQTKRQKFAA^ 

CRALQSLYIPAAFRAHGSTSAFVFPSRCGSFGQGRTKSAGGLT^^ 

RLLRLIRGGRFAGVFGVFDRGGFEAFRNL^ 

RQPVTPFGGGRRHGFAANQLGRPEVAVNVRRVKRDIXjnilALIAAQA 
H 

SEQ ID 127 

GTGTATGTTAAAC^TACACCCTTGCTGAAAAAGCCAACGG 

TTCTGTATTTTCTTTCAAATCTCAAAATC 

ATTCITCAAATTCAAAATGCCCGTCAGTTGCCAT^ 

AAGAATTACTTTTCATCAACGAGCCGAATCTC 

GATATTTGAAGAGGTTATCCCTCAAATCCGCAAAACAGGCGGOT 

TTGCXXJCGGGCGGTTGCCGCGCTTGTCGGGCGCAAA 

TCGAAGCTCGTCGAAGGCATCCCCGCCGAO^ 

CGGCGAAGTCCCCGACCGCGAACOSTTGCCa^ 

ATGGTTTATTACGGCACACGGATGATCGAATTGGGCAAAGAC© 

CGATGTGGACGGTTOKSGCAaSAAACCCGTTCAATCC^ 

AGACGCCTCCGGCCGCATAGCCGCATGTCTTGAAGGCA^ 

SEQ ID 128 

VYVKHTPLLKKPTGGLSGHLRMCCFHEQSTmiKRP 
ILQIQNARQLPLKDQGIQKSSVATKKGNQELLFII^ 

LRRAVAALVGRKRIGYS SAYSMIHQRFNVEAVEGIPADKL PEAVAYVHALTLHTGLAGEVPDREPLPAPQPALPI SGNALADIAA 
M\n!YGTRMIELGKDVSAPliKQLGCKQAVTMWTVWffi 
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SEQ ID 129 

GTGTGTACCTTATGCCGGACCTGCATGTTGAAACATAAAGGAAAAAATCATG 

GTCTCTTTGGCTTCCGCCGCCTCTTTCGCGCGTGCGCACGGACACGC 

CMCCGCCTACACCGCCGTCCGCC^ 

CACTCAGGGCGACACGTCTATGTCCGACTGTGCG^ 

GCACAAAATTCCCCTCTGACACGCGACGCGGCAAAAGTGTC 

SEQ ID 130 

VCTLCRTCMLKHKGKNHEPSSIFGQRR^ 
HSGRHVYVRLCGCRAPDACLMRRGAR^ 

ACTACCACC^TCATCACGATATGCAGCCTGCCGCC^ 

ATGCTTGCCTTATGCGGCGCGGTGCACGACCTTG^ 

CGTGCAAACAGTGTGCGAAAGCCTGTAAAGAACACTCCGTC 

TATCAAAGAATGCGAAAAACTCGCCGCC 

SEQ ID 132 • 

MNRRQFLGSAAAVSIiASAASFARAHGHADYH 

MLALCGAVHDIAAQNSPLaOT 
SEQ ID 133 

TTGGACTGGCGGGGCAACAAACCGCTTGGGGCGGCGGAGCTGGC 

GCGGTCTGCACATCTATAAGGCAAGTGCCGTCGTGTCGACGGG 

CACGCAGCGGTATGCCTCTTTTTGGGGCGACGGCTACGACGTGTAC^ 

AAGGCATTCAAAAAAGCCGCCAAAAGCAAAAATACCGTGATG 

ATGTTTGGGATGATTTTGAAGACGCGATGGAACTGAAGGCG 

TCTATATGAATATCTGGAGGAATATCCGTTTATCACCCGATTGTC 

ACGCGCATAACCGATCTTTCTGTCGATATGACCGGCGTGGAATCGCTTTA 

GCGAAATCAAGGAAAACTGCAAGGTCTGTACCGCCGGAA^ 

GCGGGGTTTGGAAAACCTGACCGCCGTCAACGTCATGGGCATTGCCGAT 

CTCAAAACCATACGGCTGTGGGGCAAACCGGGCAATA 

CTGCCGTAGATTTGTTCGGTTTCGGCGCGGACGACATCCCTCA 

CCTGCCCGAAGAGGCGGCGAAGGCAGTGAAGAAACTTTATAAGAAACGGAAGGAAGACGGGCTGGAC 
CGAAAACCCGAATGGCTGGCGCAAAATTTCGACAACCCCTTCC 

AAAAAGCGGCGGAGCTGTACCGGAAAACCCGTGCCGGCGTGGTCAAACTGCTCGGCAATC 

GGCAGAGGCGGTCAAAGCCTATACCGGAGGTTTCAACAAAAT^ 

GCCGAGGCGTTGGAAACGATATTGGACCTGATACCGGATGGGTCTTGTGCCGA 

ACAGGAATTTT 

S L^<^^GAAELADLKPLYKDF^^ 
KAFKKAAKSKNTVMFAGREYGISKQinjSDVWDDFEDAMELKAFPCLS 
TRITDLSVDMTGVESIiYIiNBGLDSLlNIiKGEIKENCKVCT^ 

LKTIRLWGKPGNIANFS AVSGFEDLEVFTAVDLFGFGADDI PHPDRLPKL»HRIjWMS SLPEEAAKAVKKLYKKRKEDGIjDPWIBKA 
RKPEWLAQNFDNPFRDWDG AEHI PKSHAKKAAELYRKTRAGVVKLLGNPPENIGE IDTVEREDI 
AEALETILDLIPDGSCADKEKLFBIFDKNRHF 

SEQ ID 135 

TTGCAAGAGTTTAAATTTATTTT^ 

CCCATTTGAAATTCACACTGATGGTAGCTCAAGGC^ 

ATATTATGCTTTAATCGGTTCTTCAAATO^ 

GAGCAGGATTTTATAAAAGTCAAATCTTGGGCTC 

ATCAGGAAGCTGAAATTAACTATAAGAAATCAACTGTTAG^ 

TCAAGAACTGATTGCGGCACGGAGAAAGCAGATGAG 

GCAGCAGGAAAAATAGACAATAATGATTTTTATGGGGAATTTAATAAATO 

TAGGCAATCGTTTTCAAGATAAAACGTGGAAAAGGACGGG 

TTTTGATGCTCCACTTACTGAGAGAGATAATGTGGTTGCCAAGCAAATTC 

TCAGTTTTTTCTCAAATGCTTTGTCAAGAATATCC^ 

ATAAATTTAAGTCTGCAAAAGGAGCAAGCGAAGGCTCCAAATAT^ 

GTCAGAAATAAAAGATTTGGCTGAATTAGATGTTTTAGT^ 

^Q^KFIF^ 

EQDFIKVKSWADE IAMKS I PVSEDWLEBY QEAEINYKKSTVRQ SVMDKLFMEMPIT^QELIAARI^QMRI^QWCNQIjKNLIKQC 
AAGKIDNNDF^GEFNKLWSWKSE^GEGVGNI^QDKTWKRTGKSSDFFKW IAIQSVFDAPLTERDNWAKQ IDWLKECGVSTRG 
SVFSEMLiCQEYPDRYFVLNAPIXKFLE 
SEQ ID 137 

TTOTCTGAAAACGGGAAGGGCAATTTTGCCGC^GCCGCCGC 
GAATCCGGAACGCAAAATCTAAAGAAACCGTTTTATCCGATAAGTTTC 
GACGGCGGCGGGAATCGGCGGATTCGGCGGAGTTGGTGGATTTGG^ 
ATAGCGACAAACCGCCCGGCCGCCACCCGCGCCCGCCCAAGGCAGACAACCGTTGCG 
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SEQ ID 138 

LSENGKGNFAAAAAVI PANLHSVI PTKTOIRNAKSKETVLSDKF PHRQVWI PAYAGOTAAGIGGFGGVGGFGGLKPAIjVYRNFRI 
IATNRPAATRARPRQTTVA 
SEQ ID 139 

ATGAAAAAACTTCTAATGATAACCCTCACCGGTATGCTTGCAGCTTC 
TGCCGCACSGGAGAACGCCCTGTTGTCGTGCAGGTTCCCGCC^ 
AACATCCGGATCGCCTTCGGCATCAAATATGATTGAAATGC^ 
ATGTTTAATAAGGCGACCGCACTTTATTGCTra 
GCGAAAGCGACAAGGCCGATTTGGAAAACGCGGCAAATGCC^ 
SEQ ID 140 

MKKLIiMITLTGMLAACSTGVNVGRIiMVEMPQGERPVW 
MFNKATALYSLNHAKKVGNOTSVY1TO 
SEQ ID 141 

ATGCGCTATTTTAATAGAAAC<X3AAGQATGCCTGACAACTT^ 
GCAAAACCTTGATAGGCGGATATCACATTGCGGCGTTC 
GAGCCTGACGATGGTCAAGCTCGGAAAGCCCCTO 
GAAAATATTTCGGTTTGCCGCCTGAACATAGGAATAA 
CAGCCAAAAACAAATGCCGTCTGAAACCO^AI^ 
AACACTTCCCGCCACAATTTCC 
SEQ ID 142 

MRYFNRNGRMPDNLILI SVLAKPSPTHARKTLIGGraiAAFQTDLIX^ 

ENI SVCRLNIGITAFQFANNR1TCTIX:SDTA^ S 
SEQ ID 143 

ATGACGCACACCGCATCGAAAAC^CCCAAACTCT^ 

GGATGACGCTCGGACTGTTCGTCC^CCCGTCX5TC^ 

CCAACTGATGTGGGGCGTATTGCAACCATTGTCCGGCG^ 

CTCCTGCTCGTCTGCGCCTGCCIGATTGC^ 

TCGGCACCX^TTCCGGCGGGTTTTCCATCATTATGGGACAGGTC 

ACTGGTCAACGCAGGCGGTTCGGCAGGACAATTCCTGT^ 

TCGGCTGGACGGGTACATTTTACGTTTGGGGCGCAATCGCC^ j 
SEQ ID 144 

MTHTASKTPKLWAVXAAAAFILLITIGMRMT^ 

LLLVCACLIASNIPTTWGIiMIAVGLLIjAFGTC 

SAGRVHFTFGAQSPC 
SEQ QD 145 

gtggggcgtattgcaaccattgtccggcgcgcttgtc 

GTCTGCGCCTGCCTGATTGCCTCCAACATCCCGACTTATTGGGGG^ 
GTTCCGGCGGGTTTTCCATCATTATGGGACAGGTCGCCGCC 
CGCAGGCGGTTCGGCAGGACAATTCCTCTTCGC^ 
CXSGGTACATTTTACGTTTGGGGCGCAATCX^CC 
C^CXSCCGCCCACACCCAACACACCCAAGCCACACACGGACAAAC^ 
TAC^TCCTCCTGCACCTGAGCTTTTTCGCCTGC 
GCGGACTGCCCGCCACCGTCGCCTCGACATCCA^^ 
SEQ ID 146 
VGRIATB/RRAraPFRRVQGIKKR^^ 

RRRFGRTIPVRTAG SRTQGLWLPEVGWTGTFYVWGAIALL ILPVSWWLAGGNNGGNNAAHTQHTQATHGQSIX3EAVKTAFKTPS 
YILLHLSFFACGFHIAFLVTHLPTEVAL(X3LPAWASTSIM IGLANIAGCVFSG 
SEQ ID 147 

ATGGTGCTGATTTAC^TCTTCTCACCCAAAACCGACCTCAACOTTT 

CCAAATCGGCGGATTCCTCGGCTCGTACATCGGCGGCATCX5TGATT 

GCCGTACTCGCCGGCACGGCAGCCCTGCTTGTCCTGCCCGTCCGC 
SEQ ID 148 

M\foIYIFSPKTDIiNFYIFAAAIK3FTWL 

AVLAGTAALLVLPVREPRTAA 
SEQ ID 149 

ATGAACATTATCGATGCCATCATCAATTTGGCTAAC^ 
ATCAGGCAGGTGATGCTTTGGAAGAATATGTAAAAGAOT 
ACACGCAAAAGTATTCTCATATTTGGGCAACA^ 
AAAATTGAAAGTAAAGATTCCGCACTGGCGCTGAATAGCA(^ 
AAGCCTGCAAAGATGCGGAAAAATGGGAAGAGAAAGACATTATCTAT 
SEQ ID 150 

MNI IDAI INLANNPWGVNSHSQSNNRANQAGDALEEYVKDLFSGSFNIiN^ 
KIE SKDSALALNS SHPKSKLSVDDSMLTKACKDAEKWEEKDI IYNCRRR 

SEQ ID 151 

TTCGCAATGGTTOACGGAATCGACTACTGCGCCGATGCCGAAT 

ACATCGGCGGCATACAGTTTGCAGAAACCAAAGAGCTGG^ 

ACGGGGTATGTGGGGAATCGAAAATCGATGGTTO 
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GCCATTATCAACQAAGACAAATQGAAC^ 
AGATTAAAAOTAAAAATCCCAAGA^ 
SEQ ID 152 

LAMVYGIDYCAIJAECYLKIKNQIKEGIGNIGGIQFAE 

AI INEDK^SFNNTDKLLiAIQDSKLAI SDIKIKNPNNPARLRNAKLITYHL 

SEQ ID 153 

ATGCAAAACTCATCACCTACCACTTATAATCOT 

AAAAAGCGGGATTTGAAATCCCCGCCGCCAACGAATACGATAAA^ 

TTTGATAGAAGGCGATATACGCAAGATTAAAGAAGAAGATTTCCCTGAAGAAAOX: 

TCTTGGTCTGAAGCGGGAGCTTTGCGCGGCATCGACGA^ 

AACAGCCAAAATTCTTTTTAGCGGAAAACGTCAGCGGAATGC 

GTTTGACGGATGCGGATACGACGTAACCITGACTATGGCCAAC 

TACATCGGTTTCCGTAAAGACTTX3GAAATAAAATTTTCTTTTCCAA 

AAGACGTTATTTGGGATTTGCAGGACAC^GCCGTA 

ATATTTTACCGGCAGTTOTTCCCCTATTOTTATGAGCCGGAACCX3 

TGAGGCAGGCAGTGCCAACTGCACCCGCAAGCCCCAAAGATG 

AAACACTATATCGGAGGATGACGGTACGCGAAGTTGCAAGAATCCAAGGCT^ 

CAACGACGCATACAAAATGATTGGCAACGCCGTCCCCGTCAACCTTG^ 

AGG 

SEQ ID 154 

MQNSSPTTYNPMKIISIiPSGCGGL3DLGFEKAGFEIPAANEYDK^ 
SWSEAGALRGIDDARGQLFFDYIRIIiKSKQPKFFLA^ 
YIGFKKDLEIKFSFPKGSTVEDKDKITLKDVT^ 
SGRQCQLHPQAPKMEKHGAITOYRFAAGKETLYRR^^ 
R • 
SEQ ID 155 

TTGAAACGTCAATTTAAAAATAAAGAATCAAGGGTTA 

AACGGTATAAAAATGCCGTTAGGAAAGACAAAGCAGCTGAATTGTATTO 

TGTTOCTATATTTGAGATCGOTTTAAGAAAGAAAATTGATATT^ 

AGCATACAACCACAAACGAATCCTGCATTGAAATATCAAGGTTGCTTO 

AGGTTGCATTATCTAAAATAGAAAATAATAGCGGAGGAAAGTO 

GCGGTATCTGTTTGCGGGAGGGAAAGATGCTCAATTTGATGCT^ 

TCTACACCAAGTGTGCAATATAACCAAAAATGGATTTTTCGAGAA 

ACGAACCGATTTGTTTTAGTTTTAAAGGTGCAATAAAAGA^ 

TAATTACATGGATGTAGATACTGCTTCAGTATTTAGCCATTTTAGCGA^ 

SEQ ID 156 

MERYKNAVRKDKAAELYLLNLSLSRELFHWSIFErVLRNKIDI 
XKVALSKIQNNSGGKFDHNQIiVAGWFGFWRYLFAGGOT 
HHEPICFSFKGAII03TGYARNIHQSIFEI^^ 
SEQ ID 157 

ATGAAACXKJTTTATOCTGCCGGTGCTTTTATCCGCAACAACCGCCC 

CTGATGAAAACATTCACAGTGATACAAAAGTCAAATATCTTCAG 

TGACGGCAATATCTTCGCATACCGTGTTGTCGGTGCAGCT 

TACGGTGAAGTTTCACGACAGCACAGCCCCGAAACGCTGGCG 

TGTAGACCGGATTGGTCGCGCAGATGGACAAACCCGGAAAAGATGATGTGT^ 

AGA03GAACTTCCCGTGAGGCATGTACGAAAGTGGAG 

SEQ ID 158 

MKRFILPVLLSATTAPASPIVGTWHCIGTD^ 
YGKVSRQHSPETLAWLEKSEI^VRIiLESMMyTGLVAQMD 

SEQ ID 159 

ATCATCACGCCGAGTAAAATTGTAAATTAATGTTGCAGGGGCTT 

AGACCGCCGTCTAATATTTTCACCGTACCCAAACTTACACCATAACGCCGTGAAGCCT^ 

TACCGCTAATACCCCCGTACAACTGCCAAGCCGAGATA^ 

ACGTTTTGCCAAATATTCGTCATGGTATTGTTTGCGGCCATAC 

CTGTAATCCAACACAATACCCGTATTTGCCGTAT^ 

CTAGACGGCTATTCCGATAACGGTAGCCTACTTCGGCATGAAACGCCTGTTCGCTAAAATC 

GGAACCGCTGATGCCTAAGCGGGCAAAGTGATTGCCTGCCATC 

GCGCGTTTGGGCAGGCTGTCTTCCGATTTTATCCATTTCCGGCCGT^ 

TATCTGTTTGTTCATAGTTCATATTGAACGAAGGATGC 

CCGCATATCGGGCGGCAACACCTCCTCGGCCCGGTGCAACIH3AAC 

CCCAAATCAAAACGGGGATAAACAAGATCGGGTCTTTCCGA 

SEQ ID 160 

MFEAADLYRELLSERPDLVYPRFDLGV^ 

NASLSRDIVINGRKWIKSEDSLPKRANGIRYEIiGVDRMFNMAGNHFA 
FRPFVKQNRIX3NNRYTANTGIVLOT 

NTVEKEQASRRYGVSLGTVKILIX^LGLKLGAGYTKRIFKAPAm 
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SEQID161 

TTGTCCGATACGGATAAACAGGCTTI^ 
• CGAAGTGCCCACATTGCCAATCTAC^CATTTCGTCAAAAATGG 

AAAGTCTTTTGTTGAGCAGACAGGCACTATTC^ 

ATTGAAAAATACCCACTTCGCAAATGTGCTGAAATTT^ 

ACGCACTTCAAAACATGATGAATGAAGTTGAGTTAGACGGCATTC 

GCACCAGZU^AAATTTTAATCTGCCACGTCCTGCTCAGA^CGA 

TGTGTTCCTTGTGGTATCAATTTAGACGGTAAATCTGTTC 
SEQID162 

LSDTDKQAFIiTSVSSKEQVKK^EPRKVTKCPHCQSTHPVKN 

IEKYPLRKCAEICKINIiATAFTWRHKILDAIiQ^ skeqv 
OTPCGINIJDGKSVARISNIiGKPSIjKtra 
SEQ DD 163 

ATGTTTGCTCTTGCCGCCAGTAATATGATTTTACGCGGAGACGG 

TTCAGGATTTAAOH^AAAATCCCAAACCTGAAACAGGCTO 

CAAAAGCGATGCCGAATTGCATGAACTTTATTTTOT 

ATTCCAGTCAGTTGCGTGATTGCACCAAGCAA^ 

CGAGCGAACTGTIITOACCCAGTTGGCACGGTAACGTGCATTGTC 

TGACX^CGGACACACAAGAAGAAATCAGCACGAAAAAAGCCTGTC 

AAAACCAAACACTTGGGACGCATTGATTTATACGACCGCTGGCAQ 

AAGTTCACACAGGAGAATCGGTAACAGCATTTGTAACTGATAACGATGAA 

OVAAATTACCCGAGC^GATTTTGAGC^ 

AATTTGGATAATGAAAGCTATGAAGACGATGACAATAACGACTTC . 
SEQ ID 164 
MFAIiAASNMILRGTCKANLHQSSCFMTDF^ 

I PVSCVXAPSKAKSE rTKYHRLKAVMSMPSELFYPVGOVTC IVVFRAHKPHFQTWIDPIXrQEEI STKKACRKTWFGYWRDBGFE 

KTKHTCRIDLYDRWQGIKARWLEHYI^^ 

NLDNESYEDDDNNDFGDDB 

SEQ ID 165 | ; 

ATGGTTGAATTGCAAGAGATTTTTGATGTGAGTTACGGTTC^ 
AC'ITTGTAGGCAGGTCAGGCAAAAATAATGGTGTAACAG^ 
ATTAACTGTTGCATTAGGTGGTTCTGTTTTATCT^ 
AATCCC^UUVACAGAAATGACAGAACAACAAAAACTCT 
GTAGGGAAGCTAACCGCACGTTACGACAACTGTTTGTGCCATC 
TCCATCTGCGGGTGTAACCGAACCGAAATTAAAAGAATCGCTTGATT^ 
TTTACTATTCAAAACGGCATTGCCGCAACCAAATTAAAAGAATTTO 
CCAGTACGCAAGCAAGAACGCTACGAAGTTATATCX3CTCGTC 
CACTTCTACCAATGGCGAAGGTAGCCATACTTATA^ 
ACACCCATACAGTCCGATATGCCAATAGAAGTAAAACTC 
GACGTAAACCCAAAGGCGAAAA&TTAAAAAGCATTATC 
GATTCACACCTTGCTATTTAGTAACAATTTG 
SEQ ID 166 

. MVELQEIFDVSYGSKLDLNKMSSFNPTINF^ 

NPKTEMTEQQKLFYCAAIFANAYRF SACGREANRTIjRQLFVPSL1)EI PSWVESVNliNP SAGVTEPKLKESLDIjPVVRQSKRIjDEI 
FTIQNGIAATKLKEFEQRQKOTVVYIRPASTQART 
TPIQSDMPIEVKLYYAKCITANRYLFSYGRKPKGEKLKSIMIiPYFI^ 
SEQ ID 167 

ATGAAGAACTGTACTCCAACTAACGGCACATGGTTCAAAATG^ 

CACCTACATCO^CATCCCCGTTACCCACATAAAATGCCTGCGGAT 

CACTACCCCTGCCGCAGAGCATATTGAGCA^ 

GCCGATAAAATAGCCCTCAAAGCAGGCGGGGAAACATTTGTCAGCOT 
CCGCCGCAGGATTCCCTCAACCCTATCTGTCC^ 
AAACXX^TTAGGCGTTTCCCC^ 
SEQ ID 168 

MKNCTPTNGTWFKMGATL1VAVAACTVSTPTSAIPVTH 
ADKIAI^GGETFVSLRMKKGFTQSEIiATAAGLPQPYLSR 
SEQ ID 169 

ATGACGGTCCGAAACACGCAAACCGAAACCGTCCGGACGGAAGCC 
AAAACOSCGCCTTCGAGTGCGTCGGGT^^ 
MAAAAAGGCGGGCGCGAAGACTTGGAAAAAGCCCT^ 
AAACACCGCCGCTATGAAAAAATGTACGCCGGTCTGAAAGATTGCGGGTTTO 
TCTCCGCCGCTTATTACATCCGCGACGGCGAAGACAATTTTGC^ 
GCCTGAAGCGGGGCGGGCCCCX5CACCCTGAAAGCCCGATGCCGCCTGA 
SEQ ID 170 

MTVRNTQTETVRTEAAPQQGGNTNPGYYKNRAPECV^ 
KHRRYEKHVAGLKDCGFIXXSTEAAI^^ 
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SEQ ID 171 

ATGAACGTGAACCAACTCACACMGAA^^ 

CCCAGCCGGCTAGTTTTACCGCCGGCCTGCAAACCTATGA 
• GCGCAACOSTATCCCGCGCGTGGGCGGCGGCCGCACCATCGGCTCGAA 

CGCGCCGGTATCGGCGAAGGCAGGCGCGGCGGCG?^^ 

TGGAAAACCAAGTAACCTTTGAAGCGGATTACGC^ 

GGCTACTATGGTTGCCG^GAAATGATTTTGCTGGGCGGCAACACC^ 

GTTTCCGCCGACGCGGCGGGGAAAATCAGCGTCAGCACCTTGTCTC 

TGGCAGGCGCGAACAACGGTGCAACCGGTCAAAG^^ 

GGATGGTACAACCGATACGTTCGGCGGCGGTTCCGCCCG 

GTAACCGCCGTGATTCCGGCTGTTCGCGGCGCGGTTGCCT^ 

TTACCACTGCCGCCAAAGTGGAAATTTCGGCTGATGCCGAAGGTACrcAG^ 

TTCTATTTTGGAATTTGACGGCCTGTTGACCCAAATCGCC^ 

CTGAOTTCCGACAACGCGGGCGGCGTGTATGAATTTGAAGAGGCGTO 

CCGTCTACGTCAACGCCOSCGATTTGGCGGCGCTGACCAAGCT^ 

TGACGTGAACAATACGGCGAACATCCGCGCCGGTC 

GTGGTACACCCGAACCTGCCTGCCGGTA^ 

TGCAAGTGCG;rACGCGCCAAGAGTATTAC 

GGTGCTGCAAGGTATGTTCATGCCTGCCT^ 
SEQ ID 172 

MNTOQLTQETIELmSAQAGGGPPDKGFTQPASFT^ 
RAGIGEGRRGGVINHETVERNAQFRAIGIiENQVTFE^ 

VS ADAAGKI SVSTLSVICVALGLQAYWDVAGANNGATGQSPNIKTAQVPAKI TRRNADGTTDTFGGGSARKSAAASV SG IEAGKK 
VTAVI PAVRGAVAYAWYWGAAGSEKLGAVTTAAKVE I S ADAEGTQTAASLPSEDNSTS ILEFDGLLTQIALPDSGAFWSDNKGGG 
LTSDNAGGVYEFEEAFAHFFTRYRLSPDTVYVNAI^ 
VVHPNLPAGTYIjFYSSRLPAYVQGVGN^ 
SEQ ID 173 

ATCGCCQCCCCGQTATCGCTTGAGGAGTTCAAGCAGCGT^ 

TTGACGGCGTGTCGGCGGCGGCGGAAGCCTATATCGGGCGC^ 

CAAAGACCX3CATCGTGTTGGACAATTATCCCGTCCTC 

TTTGAO^CTGGCTGCTGATGCGCCCCGAAGGTTTCG^ 

GCATGCCCGCCGACATACGCGAAGCGGTGATGATTATCC^ 

GACGCTTGCAGGCGAAAC^GTCGCTTTTTCTACGTTC 

TACAAACGAAAGGGCGTG 

SEQ ID 174 

MAAPVSIiEEFKQRIQVEHDRRI^FFIiSVIDGVSAA 

FDNVnjLMRPEGFARGLKNVBVSYRAGYERMPADIREAVMI IAVQRVNEIBGKGVRSKTLAGETVAF STFGNSGGMPPSAFAHiNE 
YKRKGV 
SEQ ID 175 

ATGCAGTTGACXjTTTGGCAGCGGCGAAGTGTTCXX^CCAAATC 
TGCGGATTATGGGCTTGCAGGAAATGTCCGTCGATTTC 
TGTGGCGCAAGGCAAGGTCAAGGTGTCGGGCAAAATGAAAGGCTC 
ACGGAATATGCAACCGGCACGATGAAGGCGCTTTGGGCGGAAACTACGGGC^ 
AGGCAGCCGCGCCCGGCGGCGGCAAGTTCGCTGAGGACGCGGGCGTGATGGGC 
CTCCCCGAAACAAGGCCAATAT^CGGTTTCGGAATCGGGCGTGTACGCCTTTAA 
AGCTTTACCTATACGCAAACGATGCCCCCXS^ 
TGAGGTACCTGACGCAGTTCAAAGGCAAAAAAGCCC 
TAAAAACGACGACTTCTCCGTCCCCGAAATC^ 
CAGGAG 
SEQ ID 176 

MQLTFGSGEVFAQMITDAYGNRVQNATPVRIMGLQEMSVDL SAELKEFYGQNRYPLAVAQGKVKVSGKMKGAL INGLTLNTLFFG 

TEYATGTMKALWAETTGKVLDGDNYSYI^AAAPGGGKFAEDAGVM 

SFTYTQTMPPAKKIELTNMATGNTPTFKMRYLTQFKGKKA 

QE 

SEQ ID 177 > 

ATGGGTGGTAAATCGTCAACCGTTACATCTGCCGAAGAGCGGATTTTATCX3 

TGCCCK3TCATCTACGGCAGGACGCGCGTGGCGGGCAATTT^ 

GCGTCAGGGCGGCAAGGGCGGCGGOXSTGTG;^ 

GAGGGCGAGATTCAGGGCGTGGGGCGGGOTrGGCGCXSACAAGGAGAAATTC 

GC^GCGGCGACGAGCAGCCGTTGTGGACGCACCTGCAACAG^ 

TTTGTGCAGCCCGAACTACGAACTGACGAAATCGGCGCAAATATATCAGC^ 

GGCAAGATCCCCGATGCAAACCCGCGCGAAATCGTATO 

ACATCGGCGATACCGACCGGTACAGCAATTATTGCCGCGCCGTCGGGATTTTC 

GCAACGGAATATTTCCGAACTGCTGGAGCA^^ 

GACGGCAGCCATTCGGGGAACGGCGCGGTGTACGTTGCCGACAATAAGK 
CGGGCGCXK^AAGACCCTGTAAAGGCCGGGCGCAAAACCAATGCCGA^^ 
CAACGACTACAACGTCGCCATCGCCGAAGCGAAAGACCAGGCGAATATC^ 
ATGCACGGCATTTGCGATGCGAAGGTGGCGCAAAAAGTGGCGCAAC^ 
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MTTTAAGCTGGGTTGGAAATACKXIC^ 

GCCCGTCCGAATCACGGAAATCGAAGAAGACGGAGAAGGGGTTTTATCCGT^ 

GTGTCGGAATATCCGACGCAGCCGTCTTCGGGTTATTCGGC^ 

TCGAAGCGCCGTTGCAACTGACGGGCGGCGAACCGCAAA^ 

GTGGGTATCGGCGGACGGCGACAGCTACACCCGCGTCGGCGCGGTCAACCGC^ 

CCCGACGGCGCGGTTTTCGACCGCAQU^TACATTGGGCGTC 

GCCGCGATTTGCTGACATTGTGCPACGTCmCGGC^ 

ATTGGGCAACCTGACGCGCGGCGO^ 

TTCTTCAAATACGCCGTTCCGCGCAACTO 

TTCAGGATTTGGCGGAAGTGCCXSGCGTATTC^ 

ATCGTGGGCATACGGCAAAGAAGCCGTCATCGCTTGGG^ 

GGC^CCGACGCCGTCTGCGTCO^TTGAC^ 

GCCAGGTACGCGGTATTGTCTTCAAGGTTCGCGGGCGTGCCGOT 

TCCGCAATTGCAGGCATTGCAGGGCATCTCTATCGA 

GACTTTGCAGGGATTATCGTCTGGGTTO 

CGTTTATAAC CATCGCCAAATGCGGCGGCAAACCGCTGGAGAAAGGAAAGACCT 

CX3GCAAAGATAACCTGAAAATCAGCGGCAGCATATCX5TTTA 

AACAAGGCTTTGCGCGACAAAATCAACCTGATTGACG^ 

CGGCGGACGGCAATGCGGCGGCGGTTCAGACACACGCCCGAAG^ 

AAACGGCAAAGTGGCGGGCTTCGGCTTGGCGACTACX^CGAAAAACGGCAC 

TTCGGCATCGGTGCGGCCGGGAAGGCGGATGTGTTCCCGTTC 

TGGTGGTCAACGGCAAGGCGGTTGTCGATAAGTTGAACGCCGGGGATATCCA^ 

AAAC CGCCTGAAAGCCGGAAGCGTTACGGCGCGGGAAATC GGGGCTGCCGCCGTTACCGCCGATAA7ATCGGGGCAAACGCGGTG 

ACI^CGATAAGATTCAGGTTGCCGATTTGAGCGCGGTATC 

GGGGTAATTTTACGGTGTCTTCAGACGGCATCCTGACGGCCGACAATGCGGTAA 

TTTCAACGGCACGGTCAGGGCTTCGTCCGTCGAGGGCGACGTGTTGAGGGCGCA 

GTGTTGGATTTGGATAAAGACCCGCTGCCGAGGGTTTTGATTC 

TGCAGGCAAAGCTCCTGTTGAACGGGGGTGTGCTTGCGCC(^ 

CGGCAGAACTTTCGGGAGGTACGAAGACCTGCCCCCCCGGGCAAGGGGGGGCGGGGATTATAC 

AGAACCCGCCTCGAGTACC^GATTCAAGTCATCCCCGCGGGCAAACCCGTCAGC 

TGTTTTCGCCGTTCGTGTCGGTOTOSTATTTGGCGCA^ 

CTTTGCGGAAAGTTTCCGGTACGACAGTAAGAAGCAGATTTATTTGGGCGGCGAC 
TATAAACATTGGAATGCTTACGGCGGGCTGCTGC^GTTGCCGGACGATATCTAOT 
ATGCCGACTGGAGTACGATGCTTGCGTOTGACCGGTCGGATAAG 
GTATTCTCTGTTTCAGGAGGAGTTTAATACTG 
GAGCTGCGCAATATCAGGGTGCTGATTCCGGAATC^ 
SEQ1D178 

MGGKSSWTSAEERILSLQVQRSSQGLTLPVTYGRTRVAGNLIWYGDFVTIEro 

EGE IQGVGRVWRDKEKFDSPAQLRLTLMRGGDEQPLWTH^ I YQHNFEVTGKSGYS 

GNIPDANPRE:H7IiDLLTNQRYGCGFPSQNIGOT^ 

TCSHSGNGAVYVADNKAAYDLTDDDF^ 

MHGICDAKVAQKVAQQLLQRALYVRNEYEFKLGWKYCLLEPMDIVT 

VSEYPTQPSSGYSADYWSPGNAHVPVTFEAPLQLTGGEPQIWIiAT^ 

PDGAVFDRTOTLGVEISAGQLTCGTEQDSRDLLTLCYVIX3E 

LFKYAVPRIWIGRTVWVKLVSYNVFGGGIQDLJ^ 

GSRRRLRAVDGIVDNSYTYTQADMKAIX3GQWGIVFKW 

DFAGIIVWVCENAAC P AADANKAYDGAETF IT IAKCGGKPLEKGKTYYLRAAGYDSFGKDNLK I SG S I SFTVYDVS ATDL SE SSL 
NKALRDKOmiDGNGAGSVNEKVEAW^ 

FGI GAAGKADVFPFVVOTQKNRVG INGELWNGKAWDKLNAGD IHGGKIAAOTLDANRIjKAGSVTARE IGAAAVTADKIGANAV 
TADKIQVADLSAVSSNLGSITGGSLNIGGGNFTV^ 
VLDLDKDPLPRVLIPNFYWSETFGNNRVQAKLIiLN^ 
RTRLEYQIQVIPAGKPVSLKLSLASHESWSPFVSVSYI^ 
YKHWNAYGGLLQLPDDIYGI SFEYRPFTOADWSTMLAFDRSDKFVV^ 
ELRNIRVLIPESRENEVWPSV 
SEQID179 

ATGAGCGATTTGGAAKCAAAGTCAGGATAACGGTAGAAAACCATACGAAGCM 

AGGCGGCGGAAAAGATGCGCAGCAGCGGCGATAATGCCGCAAAGGGCTTO 

CAATTTGCACGCCGATGTGAAAGCGGGATTTGAAGCGGCGGG 

GGTAAAATCGGCTCAGGCTTATCGGGGCTGACCAAACTC 

CCGCCGATGCGATGCAGTCGATAAACAGCCAAGTCCGACAAGTCAC^ 

TCTCGATACGGCAAACCGTACCCGTGCATCGCTTGAATCAACGGCAAACCTGTACGOT 

GGCTACACGCAGCAGGAAATTTTAAAATTCACCGAGGCGGCCA^ 

CCGCCGCGCTGATGCAGTTATCGCAGGCTTTGGGCAGCGGCGTAT^ 

TATTTTGCTGGATACCATTGCGGAATATATGGGCAAATCC^ 

GATGTGATTTTCAAAGCCATTTCCGGCGCGTCGGAGAM 

TGACGGTGTTCTCGAAGAACTGGCAAAGCATGGTTT 

TAAACTGATTGCGGATAACCTTAATTTGGTTGTTCCG 

CTGGCTTTGAATCTTGCACTGCTGGCAAACCCG 
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TCGGCGATGAAATCGATGTTTTCGGCGGCGGCTGGTCGAATCTT 

CGTCGGGGAAGCGGTGGGAACCGTCAAATCGTGGTTTG 

TTGTTCGGGCGCGTGATGAGCGTCATCTCAAGCGCAATCGGCGCGTATC 

TGCTGATTAAAGAAGCCGCAAACGATATGCCGCAATTCTTCGO 

CGAACGGATGATA7ACAAAGCGGTAGGCATGATTAACAGCATGATTGA 

GGCATTGGAAAGCTGAACAAGGTTCGGATGGGGC^ 

ACCGCGCCGGCGCAATGGCAAATGCCGTCCGCGAACGGGCC 

AGGACACGCCAAAACCGCCCGGAAAAAGCCGGGCGCAAATC^^ 

GCGGCCAAAGACCCGATGCAGGCTTGGGAAGGGGAAATCAAAGCCCGAAAACTC^ 

GACAJCCAAGAATGGGATTTGGCGCGTGAGGCGGAATATT^ 

GAAAATCCGCGAAAAAATCCTGATGCTTGAAGACCAATT^^ 

GAGAAATTGGACAAGCACA^GCTGGAC^TGGAGAAAGACGCGG^ 

GCCTGGACTTGGAAATCC^GTTTGAAAA^ 

AGACCCGGCATAGAGTCAGGCGGCAATCGGCAAGCTCA^ 

AAGAACGAGGGCAAACGCGAAAACCAACGCCGCAAAGACGCGCCCAACGTCATGGAJ^ 

GGCAGGAAGCACAGCAGCAGATGGGGCAGGCGTTCACCGCCATGCTGG^ 

TTTCAAGAGTATGGGGCAAACCTTTATCCAAGAAATGGOT 

GCCATTTACAAGATGATTTTCGGAACTAAGGAAACGCTGGAGACAGCGG^ 

CAACGGCGGTTGTCAGTAGTAATGCTGTTCAAGCGGCTTCAGGC 

TCTTGCCGTTGCGGCAATGG03GCAATGATGTC 

ACGACGCGCATTCCCTCGGCGGCAGGCGGCTGGGACATCCCGGCAGGCATCAACCCGCT 
" TCTTACC05CAGAACACGCCCAAACCATCCGTGAAATGGCGGATCAGCAGOT 
AGGCGATTTCATCCATAAAAATGACTTGGCGAAGCTC^ 

GKIGSGLSGLTKLIAGLATAOT^ 

GYTQGEIIjKFTEAANNAMTIGGVGAQQQAAAIj^^ 

DVIFKAISGASEKFGEQAAKMPVTMGQALTVTSNNWQSMVS 

LALNIiALLiANPFGI IAVAIGTWGLIAKFGDEIDVFGGGWSI^SDTIRAVWQI ITETVGEAVGTVKSWFDGLTGRLNEGAGGWPS 
LFGRVMSVI S S AIGAYVNVYINIFATGV7ML IKEAANDMPQFFANLGKAIGNWIS AIERMINKAVGMINSMIDFANKAASMAG^ S 
GIGKLNKVHMGRMDTCGLGERIHDSLTKDRAGA^^ 



EKIJ3KHKLEMEKDAADQALAIX3ia 



AIYKMIFGTKETLETAAAAKTAATKATETTAWSSNAVQAASGAAASQAHIPYVGPIW 
TTR IPSAAGGWDI1?AGINPLTQLHE^^ 

CATCAGACGCGAAGGCGTGGGCGGCGCGAATAGAGTCGCAAATTATGGAC^ 

TGCCGACCTTATCCGGCGGTATTOATCCGA^ 

CTAAAAACACCTTTGGCAAAGGTGCGGCTTGCCG^ 

TATCCCCCACCAGCGTCGGACGCGAATTAACCACTTTATCCGCCGTCTGCGAACAC 
AAACCCTGTACGCAAAATCAGCAAGCCGAAAAAA^ 
GCCGCCCTCCTATACCGGCCTAATGAGAAACCGAAAATGGCGG 
CCATGCGGGC7VGGCGAAATCTGCGGCCTAAAATGGGCAGATGTGAATATGCGGCGTC 
CGGCGACAGCCGTGACGTGCC^TTATCCTTGCGAGCCGCCGAACTAATAGAGCAA 
AGCCTGGATGCCAAAAGCCTTGATGTATTATTCAGGCGAGCGCGGGAT^ 
GCCGCGAAGCCCTCACGCGCCTATCAAAAAAAGTGCCTGTGGAGGTCTT^^ 
ACTCAATGTTTACTACCGCCCCGACATGGCGGATATTGCAAAAATC 
ccq in 182 

MATITQRNGKWRVQIRMKOTSRSATFER^DAKAWAARIESQIMre 
LKTPLAKVRLMLRPQDFADWKDQRLQEVS 

AALIiYRPNKKPKMAVQRVAVAVLPAIETAMRAGEICGIiKWADVNMRRRXAH^ 
SLDAKSLDVLFRIUVRDNCGIQGLHFHiyrRBEALTRLSKKVPVBV^ 

SEQ ID 183 

ATGACATATTTAAAAGTAATAGCAATTAGTATAGTTTTATAC^TT^ 

TTGATTTCTTAGTAGAAAATATAGATAAATATTACCAGCAATATGGATCTTACCCAAA 

TGATTTTACCACTGAATCATATTGTGATTTTTGGGATAAAAATAOT 

GACTATACTATTTTAGTCATGGGTTTCTCATCG 

ATGAT 

DYTILVMGFSSKILFSSHNKIKEFNSNKYD 

S T^TC^CCATTACGTTACGTTTATGTCCCA^ 
GCTTGGGCTATGATTATCTTTCGCTAAAAAATGCAAATTGGGACAG 
CTGCCGCATCAATCCCGATTTGCCGCCGGATG 
GGGCAAAAATTTTACGAACGCCTCACCAATCAATCCGGCGG 
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ACGTTGTAACTGAGTTGCCTTGTATCAACGGCGACGAAGCATTCCGCCCCGACATTGCCC 



AGCGCGAATTTACTGCCCGCGATTTAGTCGTCCATACC&TCGAC^ 

GACG 

CGAACTCTTCGGCGACTACATCCACAAATACTATTACAM 



i r rcr.-c?.xr atacas sgcgcaa ^^^^^^^^?5^^^^^^^^^Q^a^racA^MAA^^^ccM^M^^^o 



GACTACGACCTCGCCATTCAGCGCGTCTATTTTCTCGACG^ 

•CCGACGTAAACGCCC 
■CGGCGACTACATCa 

ACGCCCATCCGCACTTCGTCC^CC^^ 

ACCGCCGCGTTGATATTGCACGACGTCGGCACTAAGGAAGAACGCGACCAATGGGTO 

TCAAAACCGCCGAAGAAATCGAACAGGAAAT r 
TTGCAGCCAAATTAGCCAAATAGAAGACAAA 



TTGCACXX^TTArcCAAATAl^^v^ 



GGCA2 
TCGGC 



ATTACTTTGAAAAACAAATGCAAGGCATCGTAT 

AGCC^TCAACCGCC^TTGGTGCGGGAATATTTAAAAG ! 



YIjTHYYAKQGIVPKI 
DYDLAI QRVYFLDKV 
IGSRYKAQLQEAIAC 



GTGGCTTTCG^ACOGAAOOTA 

^ATGGAAC 
AATCCA' 
kAACCCOK 

ATCT TGCTTAGCAAAAATCCCA^ I TAAAGATAT rAAGGATTT 1 C iAAAATATCCTOTTCTTGACCAA 



CCGATTTTAGAAAACGAGCGATTGAATGTATGGAAGGAACTTCCGGCAGGC^ 

1AATCC 

CCTAf^ATC^^ 



r^OTATTCCAGAACCTCAAATCCAACAATCCATCGCCGCCGTCCTATCCG^ 



CAAGAATTGTTAS 



"caagaattgttaagotagttaattttcaatatgcgcggggagcagat^^ 



£aAA™AAATCAAATAGATTTAT^ 

AACAAAACCAC^CCTGACCCAACTACGCGATTTCCT 
AAGAGATGGA 
SEQ ID 188 IQ ^^^ Q ^ GALAIU ^^ 



SODFICGFTNDEKSILNPQDAHIIFCJDHTRXVKJ mfUlAi^fli^iyvxu^ — ^ARHFKFIjKEI 

KIILPSKDISQKYireiAmTFVKVWm,KQN^^ 
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^^IaLcTATTTAAAATATTATT^^ 



TOAGAATATGTTAACGATAATTTCTTACCTAGCT^ 



AGTTCGATTTCTGTATATCGAATTAA^ 

SSIAVYRINNFTFLBK 
^ATCAAMTTOTTTTAGCTAAAATTACTTGCA 
AGATTCTTCTCCTAAGTTTTTGGATAACTCAAATTTATT^ 

^^taaotataS^ 
tatamaaag^^tgataaa^ 

gctac^aaiactgaagtacagcaattcttagaaagcgattttattga 

■EATSSTATTGATGAACGTTATTACACAACAGCAOT 

ygideryyttalgkekklansvqpi 
S a^aaacagccgattaagccaac^^ 

nfiCAGCGATCTCGACCACATCGCCGCCCAATACGGACT^ 

TCAAAACCCAAAGCGGCAOKCCGACGAAACCATTT 
^S^QLPAPAAIEETT^IFAR^^ 



SKPKAARPTKPF 
^ATCAATC^GCTTTTTACCCAAGAACGCAGGAT^ 

Jag^Jagcaa^g^tgggagcaatacccgaas 

rRTCCGGGTAATGGCAATATG^ 

CATTTCATCTATTCCCTTGGGTACAGGGGAAATAGATTGCG 
TGGCATTT 



LNYEDALDLFYIMVNGIKIjKDISDLPLDLAF 

^atcacca^qsaagtcaacagcggacaagtgcgcggcccggto 
aaSScatcaca^tggcggtaaccaatgaa^ 

S !S^S 8 GOVRGPVQIA^ 
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SEQ ID 199 

ATGAATGCTGTCCAAGTTTTAAACTTTCAACAAAACTC 
ACGATGTTTGCGAGATTTTAGGCTACACCAACC^ 
GACCCCCACCACCAGCGGCGAACAGGAGATGACCTATATT^ 
GCCGCAGAAGCGTTTGAAGAATGGGTAATCGAAACCGTCCTGC^ 
CCACCXX:C^ACGACCGTACCGGATTGCGCCAAGCTGTTGCCGCGCTTG 
TATGATACACGAACGCTTCAACGTCGAATCCATC^ 
TTGACACTGCACACGGGCTTGACAGGCGAAGTGTTGGACGCACCGCCC^^ 
TAGCTOACATTGCCGCTATGGTTTATTACGGCACATGGATC 
CTGCAGGCAGGCGGTTACGATGTGGACGGTTTGGCACGA 
CGGGGATATGCGGACAAAGACGCCTCCGACCGCATAGCCGCATGTCTTGAA 
SEQ ID 200 
MNAVQVLNFQQNSVRTVADNKGELW^^ 

AAEAFEEWVMETVIj pai RKTGGCQVGPKTTADDRTGLRQAVAALVGRKGIDYS saysmihqrfnves iedlpagklpeavayvha 
LTLHTGLTGEVLDAPPKAEPKLPIIX^SLADIAAMVyYGTWMIESGTO 
RGYADKDASDRIAACIiEGIYGKATVR 
SEQ ID 201 

ATGGCACAAAAGACGAATACGGCCTACGGCGACCCGCAGGCGA^ 
GK1AAGAGTACGCTGAACCGTTTGGCGGGGAAGATGCCT 
GCATATGCCGGTCGTGCGCTGTCAGGATTTGACGCGCGGCATGGGTC^ 
CTGCCGATTATGGGCGACAACACGGCGGAAGGCAGGGTC^ 
GTTTCCCCGTTGACGGCGGCGGCACGATGACGAATCAGCG^ 
CCTGATGGACCGTTATGCCGACCAGACGCTGTTGGTGCATATGGCGGGCGCG^ 
CCTTTGGCGGGCGACCCGAAATTCAATGATTATG^ 
GCGATGCGGTAACGGGCGTTGGGGACAACGGCGGCGAGTTGAA 
TATGCGTACCGTGCTCGACGAGATTCCGCTGCCGCCGCC^ 
CGCGTGTGGCTGCTTTCCCCGGCGCAGTACAACCGTTTTGCCGCCGATC 
GCGCCTCCCAGGCAAATCAAAATCCGCTGTTTTTGGGCGATGCGGGO^ 
CATCCGTTTCTATGCGGGCGATGAGATGAAGTATT^ 
TTTGCGGACAAGTTTGCGGTCGACCGTTCGGTTATTTTGGGCG 
GCGGTATGCCTTTCTTTTGGTCTGAGAAGGAGCTTGACC^ 
GAAAACGCGCTTTGCCGTGGATGTCGGCGGGGGTGCXSAAG 
TTGCACGGCGGTATCCGC 
SEQ ID 202 

maqktntaygdpqammkqaaglfamhmqiwstiinriiagkmpagtagabatlrk 
lpimgdotaegrgvg^lseaglrwqaotpvix^tm™ 
plagdpkfndyavnfvkapsknrhftasgdavtgvgdnggei^ 
pvwllspaqynrfaadpkfrqlqasaiawvsqanqnplfiigdag 

FADKFAVDRSVILGGQAVLEAFANTGKHGGMPFFWSEKEL 
LHGGIR 
SEQ ID 203 

ATGAGAGTCCGAAACACGCAAACCGAAACCGTCCGGACGGAAGCCGCGCCGCAACAA 

AAAACCGCGCCTTCGAGTGCGTCGGGTTTGCGCAATAC 

GGAAAAAGGCGGGCGCGAAGACTTGGAAAAAGCCCTGCGGT^ 

AAACACCGCCGCTATGAAAAAATGTACGCCGGTCTGAAAGATTGCGGGTTC^ 

TCTCCGCCGCTTATTAGATCCGCGACGGCGAAGACAATT^ 

GCCTGAAGCGGGGCXK3GCCCCGCACCCTGAAAGCCCGATGCCGCCTGA 

SEQ ID 204 

MTVRNTQTETVRTEAAPQQGGNTNPGYYKNRAFECT 
KHRRYEKMYAGLKIXGFDGGTEAALLAVISAAYY^ 

SEQ ID 205 

ATGAACCGAATCGAGGAAACGGAAGCCGTCCAATCACTGGCCAGCGTAGGGGC^ 

AACCGACGGCGATTGCGCGGTGCGCAATCCKl^^ 

GGATATGGCGGCGGCAAACGAGCCTATCGACATCATCA^ 

GGCCTGGCTTACTTGATAGACXITGAA^ 

TTGTCGAGCGCGGCTTGCTGAAGGCTTCGGCGGCGATTG 

GTCTAAGGCGGCCGATGAATTGGCGGCAGTCGGCAAAGAC^ 

TTGATTGGCGGTTTGGACAAAAGGCTTGACGGCGTGCGTTT 

GTTTGCCXSGATGGAAACCTGATTGTGATTGCGGCGCGTC 

GCTGAAGCAGGGCAAGGCAGTTCATTTCCAAAG 

AATATCCCCATGCAGAACCTGAAAACCGGCAATCT^ 

AGTGGAAGTTTGACGTGAACTGCGACCTGCTCAACGl^ 

GGATTTGTTGGTTGTCGATCACCTTGACATCATC^ 

TTGAAAAACTTGC^GGCAGAGCTGAATACCCCCGTCGTCT 

GCCCGAACATGGCAGACATTCGCGGCAGCGGCGCGATTGAGC^ 

CGACGGAAACGAGAATCCGAGCATTGCCXiAGCTGATTATCGC^ 

AAAGGGCAATTTATGAAGTTCGAGGAAGAGCCTGATTTGGCAT^^ 
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GATTCAAAGCGGTCGGCAAGTAGGAGGCGO^ 



TTCGGCIGGTTGGCAAAGTCGTCAACX3GCGCGGATTCGCGCGCGGAAGTC 

ccgcgaacctgctttacgatg 
^cgtaaaagac^S^ 



CGGATCCTAC 



CGGACGGCAAACCTGATCAGCCTCATGCACAGGGAA^ 



RTANLI SLMHREAAA 



GGAGCGGCGCGGTGATGTTGGTGGATCAJAGCT^^ 
CGCGCCTGCCGGCAT' 
ATCAATATGGACGCO 
GTCTTGTTTCCGAAG 
TGGTAACTCAGACAT 

ACX3AACGGACAGGAAGAGGCGGAAAGCAACGGTOT 



SEOIB212 :™*»h«»^^^ 



p|s=§i=li^ 

TG 
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SEQ ID 214 

MKAATKELIGLIiHGGDEFQMADLYTIT^ 

EGLQWAKAALGGVIilXBARVKIDRWFDAGLRPVGAVNIFSGRVSOT 
CKVNREKFTVNGRWENSRTGTVLKHOTjTQPDGWFSQ^ 
KRRDTCKDKFDNIVHFRGFPFIPSADTVV 
SEQ ID 215 

ATGTCGGTTCAACCAAGCGTTTCGGAACAACTGAAAGA 
AGAAATCATCATTAAAAGATGTACCAAAAGAATTGCAAGCCAAAGTAOT 
ACAATATGCAGGTAAGCTCGGTAAGGGTGAATCTGTCAAGGCAATGCT^ 
CTTCAGAAGATGGATGTATATGTTAATGGTAAAAAATA^^ 
GTGAGC&AAAGATTGAGOTTTACGGTGCGGATAAAGAACAG 
TGTGGTTAGAGGCTATTTTGGCTACAGTOSAAAAGACGGTAATCC 
GACTTGTATCTGGGCGACATTAGAGGCGTGGCAACAGATG^ 
CATTTGGCGGTAATGGGGTTTTATCCAAAGAATCGTTAGATAACCATAAT^ 
AAAAGGCTCTGGCTCAATTGAAGGAATGGAACAATATGGCAAGAT^ 
GAGTCCGGCTCCAGCTTGGGGTTGAAAGACCGGGTTTCTO 
AGATTAAAAAGTATCATTTGGGTATTTTCGGCG^ 
TGTTATCGGTTTCGGCGGCGAGAAGAAA 
SEQ 8D 216 

MSVQPSVSEQLKDNANVIDAKDEKVIEYLKKSSIjKDVPKELQ 
LQKMDVYWGKKYEGSKGGELbVLP^ 

DLYLGDIRGVATDEDKLPKAGSFQYEGRAFGGNGVLSKESLDNHNGVFRYTIDFDRRKGSGS IEGMEQYGKIKLKR AAIERI PYR 
ESGSSLGLTOKVSYFGVlffiGVAMIiEK^ 
SEQ ID 217 

ATGGCCGGCTGCX^TGACCGACTTGGAAAACTGTCTGGAAAAATACCTC 
GTACCGCCAAACTGCAGGAGCAACCGTCGTTCTTCAACAGGCTGATC 
GCAGAAACAGTCGGCAGCCATCCACAACGAAGCCTATATCGAAATGAAG^ 
CAGAACCGCAAAGCAGAOTATGCGGAAAAGGGCAAAGAGCGGA 
CTGCCGCCGTCCTGTTGCCCGAGCGTCCCTACCATTT 
SEQ ID 218 { ; 

MAGCLTDLENCLEKYIjEQFGPVSESIEACTAKIjQEQPSFFNRLMKANDK^ 

HNRKADYAEKGKERIALFPRGIjNG ITKLPAAVLLPERPYHFDMKEVLYIFSRI PR j . 

SEQ ID 219 GGGGGCAAAACCCGCC^TGCATCTGTCGCTGT CATC CTGCTGTCGGA 

CCGTTCGCGGCTGCGGTTTTTCCAGCTCGGACAG 
TTTGATTTGCTTCACGCGCTCGCGGTTAACOKSCC^ 
SEQ ID 220 

MGIHIFEITEKAGAKPAAASVAVHPCCRTYRFNFANASAIAE 
FDLLHALAVNCIiNMKFLVFIi 
SEQ ID 221 

ATGGGATTTATAGAACCGTTTTTATCAAGCTATACACCTTTAT^ 
CCCTACTTTCCAAAATCGTATATACGCACTCTGGCTTTCAGC 
GATTAAAACGCTGGTCGAGCATCCACAAGGCAAGCTAAATAA 
CAACAAGATTATCTAACCGAAACAGAATTGAATGATTATO 

AAATAAGTTATTTATGGAACCTATTAGACAAATTAGACGACTTAAAACGAGTAG^ 
ACGTATTTTTGGTAATTTGGATGAAATTACTC 

GAAAGTGAAGAACATTACGGTAATGTGAAATGTATGCTGGAAAAACTTG^ 
TCATCCTGTCGGATGATACCGAAGCATACGACCCAAATAACGGCTTGCTATTAAGTCGTAC 
ATATATTTCGTTTGATGATGAAGGAAATATGGTGAAATCTAAGCGGOT 
CTAGATAACAATCTGCTCAACGACAAACGTAAATCTTACTTGGCATAT^ 
ATATT 
SEQ ID 222 

MGFIEPFLSSYTPLSRDWQARTNRKRQTLLSKIVYTHSGF 
QQDYLTETELNDYFQQGIESGFIERKYNQISYLWl^IiDKLDDLKRV^ 
ESEEHYGNVKCMLEKLAYPVLIASHIKPFILSDIX^YDPNN 
LDNNLIjNDKRKSYLAYHREIjMLQEDQEFHI 
SEQ ID 223 

ATGACXSCTTGAAGAACAACAAGCCAAAGAAGCATTGGACGGCATT^ 
TTGCAGAAATTTTATATCATGACCGTTGTATAAAACAG^^ 

GAGGGATGAAATCTGC CGTCGGTTTTTAGGGCGGATTTCTACTTCATCTGCAAAATTC 
ATACCX3CCTGAAAAACTAGCGGTATTGGGAACATTAAACAGACAGTCGGACX3^ 

tcaatcgtttttcccaaatgagcgaagcattcgc^ 

tttattttggctcgagccgggattgaaaagaagtatagacaagata 

tcagaattaggcataacggtttcaattgattttc 

tcatcactatgccxsaaaaatgagcatttaaaacttcctgc^ 

ggatatgtggtctaacttcggattggccatacaggtcaaacatctctct^ 

attagcgcagatagaattgtcattgtctgcaaaaaggc 
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GCCGAATTCAAAACATCGTAACTOAGGATGATTTGATAAGT^ 
GTTGTTGGAAAATATTAAAACTGAAATTATGCGTGAATTTC 
CGCGGATATGACATTACTGTTACTCATTTC 
SEQ ID 224 

MTLEEQQAKEAIiDG 1 1 KKSRVHL YKPIQ I AE ILYHDRC IKQIJ^FLNLI^TYRNQSKRWRDE ICRRFLGRI ST S SAKFQDNIiFEKNA 
IPPEKLAVLGTLNRQSDGGVESYIYKQFFNRESQMSE 

SELG ITVS IDF PKENLFLWEEYQDFAEKI ITMPKNEHLKLPAKIHRVGVTNAADRGLDMW^^ 

I SADRTVTVCKKAEQ SVTVSLLTQIGWKSRI QNIVTEDDL I SWYEKALRGQYPIAEALLENIKTEIIdREFPAVNEANEFLDFAQN 
RGYDITVTHF 
SEQ ID 225 

ATGAACTGGCCTTATTTAATCGATGCCGTACCCAAAra 
TTTTGTCGTTGCTGTTCGGCCTGCCCGTCGCCGTC 
CGAGCTGTCGCGCAATACGCCCCTGCTGATCCAATTGTTTTTCC 
TTCACCTGCGGCGTAATCGCGCTGGTTTTCTTGGGCGCAA 
AAGGGCAGGTCGGAGCGGGCAAGGCAATCGGTTTGAGCCGCTTTCAAGTGTTC 
CGCCGTTCCO^CATCGGCGCAAATATATTGCT 
TTTGTTACCAAAGACGTCATCGGTATGGACTACAAAA 
TGCCTGTTTCTTTGTTGGCGCGGCGGATTGAAAAC^ 
SEQ ID 226 

MNWPYLIDAVPKFADAAKLTLELSVYGVVLSLLF^^ 
FTCGVIALWLGASYMAEAVRAGIIAVPKGQVGAGKAIGLSRFQV^ 
FVTKDVIGMDYKTNEALFliLFAAYLI ILLPVSLLARRIENRVRSAKYGV 

SEQ ID 227 

TTGGGCGAAGGCCTGCTCTTAACGGCACAAATTTC 

TTTTGCGTTCGCGCAACCGGCTTGTCCGCTTTGTCGGA 

GTTCGGCCTGTATTTCGGCCTATCCGTCTGGACAGGCATCCACATC 

GGCGTTGCOSAAATGGGCGACTTGGTC^ 

TGAGCCGGGGGCAGGTAOTCCGCTGCATCGAGCTGCC 

C^TGATCAAAACCAGCTCGCTCGCCTGGCTTATC 

CTGACGCAGCCCAATGCTTCATTTTGGGTTT^ 

CGGCAAAACTTGAACAAAAATGGGAACAG 

SEQ ID 228 I 

I/3EGLLLTAQISLISVAASCVI^^ 
GVAEMGDLVRGALESIEKHQVESGLAPGLSRGC^^ 
LTQPNASFWVYGLIFMLYFFCCWPIiSLIiAAKIiEQKWEH 
SEQ ID 229 

atgaaattgaacgccaaactcaaagcccttttggcttccgccg^ 

atgcccaatcttcacaaagcagcggtgcggcaaccgttgccgcca 

caaacccccgttcggctatgttgacgcaaacggcaaaaaccaaggct^ 

ctosgcagccccgacaaagtcgaattc^ 

tcctcgccaactttacccaaacccccgaacgcgccgaagccgtc 

ttcccccaaaaagaaaccgattaccgacatggcgg^ 

ttcttcaccaaaagccatcccgaagtcaagctgtt^^ 

gcgtag<^ctcgcgcacgacaacgccckx:tgtgggcgtgggcg 

tcctgccgaatttatcgcccccgccgttcaaaaaggcaacgccgacctcct^ 

aaagacggcostciksaaagccgc^ 

AA 

SEQ ID 230 

MKLNAKLKALLASAAIAVGLTACGGGSGDAQ 

LGS PDKVEFVLTEAANRVEYVRSGKVDL ILANFTQT PERAEAVDFADPYMKVAI/3WS PKNKP ITDMAQLKDQTLLiVNKGTTADA 

FFTKSHPEVKLLKFDQNTETFDALKTC 

KDGRLKAAYEKTIiLPVYGEKVKPEAIiIiAB 

SEQ ID 231 

ATGAGATTAACGAAATATCCOTATAACTCC^TAA 

CATAAGTATTCCCCCAAAAGTATTAATGCTTGATGATAAAAATATTTT^ 
CAAAATTCAAAAGTTAACATACTAAATATAGATGGAATA 
ATGAGTTGGGTGCAATTATTACAGATAAAAATTTAATAAGAGAGAATrc 
TAAAGTTAATAAGTTAATCATATTAAAAGAGTTAGAC^ 

SEQ ID 232 

MRLTKYPYNSIRELEDYLINTYQKYIIIjQEGGK^ 

SEQ ID 233 

ATGGTTGCCTATGCTTTCCTATTTT^ 

TCGCGTCGGCGCTGTTTGTCTTTTTGGCGGGCGGTATGCT 

CTGGTTTCTGGTGCTGATACTGTTCCACAGGCTGAAAA 

GCCGACTGGCGGAATTGGGAAACGCTGTTTCATTATAAGGAAG^ 

OK3TTTTCGGCTGGAGCGGTGCGGATTC 



WO 02/079243 



-32- 



PCT/IB02/02069 



GGTGCGGCATTTTTCCAAGCACCCCX3GCGCGGTAAAGAC^^ 
CTGCCGATGTCCTGTCGGCK2GG 

AAACCCGGCCGTACGGCATGTCGGATGAAAAGCCCGATATTGTCGTTACCCTGATGGAA 

TTTTGCCGCCGCCAAAATTCCCGAITTGAAAATGTTCX3GA 

TTCGGCGGCGCAACGTGGAAGTCCGAATTTGCGTTTTO 

TTTATTCGGTCGTACCGCATTTGCAGACCGGTTTTGTCCGC^ 

TACCAAGGGCAACTACAATGCCAAGGCAGCATATGACCATTTC 

C^GCCGATGGGCAAAAACCTGTGGCACATTTC 

ATTTGGAAAACGTGCGGCAGCCGATGTTCGTATATGTGCTGACC^ 

GTTTGATTTGGATGCGCCCGATTTGA^^ 

GCGGTGGAAAGTTTCGACCGTTATCTGCACGAACGCGGCAAACCCTTTC 
AGGGCGTGTCCGTCAGGAAGAAATGGGATTACGCTCAGCCGGATTATGTAACG 
ATTCGTACAGCGGCAGGATTTCCTCGACCTTGCCTTTGCAGG 
TTCATGCGTGCGAATATGGCGATGCGCGGT^ 
GCAACTATCTCTACGACGTTTTGAAAATTGCCCGT 
SEQ ID 234 

WAYAFLFLFVTAAVLLrTOSHYRWTYFFASALFVFIiAGGMLML SDFLLI 
ADWRNVraTLFHYKEAVIGMAGLLALACT 

LPMSCRAVFFQVFVFEGTCEAFARQMPSETRPYGMSDEKPDr^ 

FGGATWKSEFAFLAGVPSTDFGALASGVFYSV 

APMGKNLWHI S SEEMMQYARMILEKRHPDLENVRQPMFVY\^^ 

AVESFDRYlJiERGKPFVFGYFGDHQTOFEGVSVRKKWD^^ 

FMRANMAMRGIiCGGGLEDCPW^LVGNYRJ^ 

SEQ ID 235 

ATGAAGCACGCTCCGATTATCCTCCTGCT^ 

GGCCGCCGCTTCGCACCCTGCCGAAAAGCGC^^ 

CXK^TTCAGCACCCGCGCCTCAAACGGGCGCGTCTTGAAAAGCATTO 

CATCGAAACGGCAAACTGCATTCGCACACGCCGGOT 

GCATCCTGTACCGGGACGGGCATATCGTCC^^ 

SEQ ID 236 

MKHAPIILLLTASSAALLSAGCTALPSDRPPI^ 
HPNGKIjHSHTPVENGVAQGYTEQGIIJITRILYRIXj^ 

SEQ ID 237 

<3TGTC(^TCCCC^CCGCCACGCCGCTGCCCGCCGGCGAGGTAACGTTAT^ 

CCGGCGCCGGAAGCGCATCCGACGCGCCGAGCCGCAGCAGACGCTCGCTC 

TCGGCAGCGGGAAGTAGAAAAAGACTATTTCX3GCTATAAATCAAAAGA 

TACGCCCTTTCATCCTACGCCGACCCCATAACCGTC 

GGCTCGCCGACGGCAGCCGCATCTTTATCTGCTGCAGCGACTCCG 

GAAATTCGGGGCGTGGATAGGTCCTAACGGCGAAATCXjACCTCTTCGC 

TTCTCCTACGGCAGCAGCACGCCCGAAACCGCATTC 

GGCAATTTGTTACTTCCTCTTATAGGCCGCCGAAAAGCGGTTCTTATO 

CACCGCCAATTTCAACAGCAACAC^CTGGCAGGCAAAAT 

ACGATTACCGGTCCGACCTTTTCCX^ 

CCAGCACCCGCAGCAGCGAAGTCAGCATAGGCGGCAAAATCAC 
GAGTTATGAAAAAAAACTTGATGACACCAGTCAAGATACCAACC^ 
SEQ ID 238 

VSIPTATPLPAGEVTLSSDNGNIENINTAGAGSASDAPSRSRRSIiDAAPQOTSGISIRQ 
YALSSYADPIWSYSSPDFKIPDRHAGQRIJtfXSSRIFICCSDSGATSYM^ 
FSYGSSTPETALSKGKITYQVWGIRVRNGQFVTSSYTPPKSGSYYGT^ 
TITGPTFSGDATSGGKSGKLEGKFFGKFASTRSSEVSIGGKITF^ 
SEQ ID 239 

TTGCCGCCAATTTGCGGGGGGGGGGGGGGTAATATC 
TCAGGCACATTTCGATTACGACTTATGCCCG 

GGACGTATTGGTCAGGGACGGCCTGAAGCCCTTATGGGATACCGATTTC 
GTCGAAAGGCAGGAAGGATACAAACAAAAAATCGGTATGGCGGACGGAGAATATC 
TGAAAAAGTGGCGGCGGCACGATATTTTCAA 
GGACATTTTGAACGGGCTGTTTAAAGGCGGGGTGTC 

GCGAACGGGTTTGCGTCCCGCCATACCGACCCGCTTTACCTCGACCGTACCAATAC 
GCGGCTCGGCAAAGCCGTGGCACAGGGACTGC^^ 
TCCCGAAGAATGGCGCGGCAAACTTGCCGTCCCGCCGACAAAGOT 
TTACGCAAGATTTAT 
SEQ ID 240 

LPPICGGGGGNIRFIDVNPEDFAGFPLNIRHISITTYARIiKIiGEYIAIX: 

VERQEGYKQKIGMADGEYYFI^GVIilNLKKWRRHDIF^ 

ANGFASIOm)PLYIiDRTNXAMPVAVSHYCGSAKPWHRI^ 

LRKIY 
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SEQ ID 241 

ATGATTTTCGCAGACCCGCCTTACTTTCTTTCCAATGACGG 

ACTGGGATAAATCCAAAGGAATGGCGGCAGATTTG^ 

(XXSCACAATTTGGGTTTGCGGCACATTTCA^ 

AATATTACTTGGGAAAAACCC^TTCTCCCCCTAAT 

AGAAAAATAAAAAAGCCAAACATACGTTTCATTATGAAATGATGAAA 

ATTCGCACCTCCAAATAAAACCGAAAAAACATTCGGCAAACATC 

TCGGCTTCAAATATCGGAGATTOAATTTTTGAC 

GATTTTGCGGTTGCGAACTGGAAGAAGATTTTTTGAAir 
SEQ ID 242 

MIFADP PYFLSNIXSFSCQNC^MVSVNKGNWDKS 

NITWEKPNPPPNLSCRFFTHSTETILWAKKNKKA^ 

SASNIGDLIFDPFMGSGTTGVAALKHGRRFCGCELEEDFLN 
SEQ ID 243 

ATGAAATACACAGCCGCACTCCTOACTTTTCTCT^ 

AAATGCCGGAGGCGCAATTGAACGGTTCAAACGCCGTTT 

CCAAGTCCG05TGCCGGACGCGTACAGGGCTTCATTTGCGCGGG 

GTACTGGAAAAAGCGGAAAGCGGGGGTGTGACGTACTAC^ 

ATGTCGTGAATATGAAAAACGACAAAGGCTACGTCAAAGTTTACC 

GGTTGCCGATTTGAATAAGTTTTACCCT 

SEQ ID 244 

MKYTAALLTFLLTAOtfNPNDAFFQNR^^ 

VLEKAE SGGVTYYTCDAEKGFNGTRFVIYVVNMKNDKGYVKVYR^ 
SEQ ID 245 

ATGGAAAGCAACCTCCAGAAAACCGAGCGTTTGA^ 

CCGGCAAAGTGCGGAAAATCAATTTGAGCCAC^CGCTCAAAAATO 

CGAAGCAGTTAACGATTATTTAGCCTTTGCCGAT^ 

GATAAAATCAACCGCTATCTGGCACGCGGC^^ 

GGCCGTCTGAAACAGTCGCCGGGATTTTAGAAAAACTO 

TGCCGGCGGCTATATCGGCATACACATTTATTTCAAAAACAGC^ 

AAAGATGCGAAAGCCAATATTGAAAACCATATGGCGTACAA^ ' [ 

SEQ ID 246 j 

MESNLQKTERLIREINRI^QYSQDYFETGKVRKINLSHTIiKNVPTEHI 

DKINRYIiARGTQYPTNNILNDIFGARLITO 

KDAKANIENHMAYKRNFVR 
SEQ ID 247 

ATGCGTAACACCGTAGGATTGGACATATCCAAGCTGACATT 
TTGACAACGATTCAAAAGGTTTAGATO^GTTOT 
AACCGGCAGTTATTATGAAGAAGTTGCCGACTACTTC 
TATGCAGAAAGCAGGTTCAAGCGAACCAAAACAGACAAACAGGATGCA 
GCGAGCTTGTAAAGAGGCAGAAGCCTACGGACGAGCAATACA 
ATGCGCGGCAATGAAAAACCGTCATCACGCGGCAAAAGATGAAGAAGCGGCCAA^ 
GAAGAGCTTGAAGTTTTAAAGGAGAAGATAAAAGAGCAG 
TACCGGCAATAGGCAGAATGACCGCAGCCGTATTGTTTCATCATCTAAC^ 
ATTCGCAGGCTTAAGCCCGCAACAAAAAGAATCCGGGACA^ 
TTACGCGCCGTCTTGTTTATGCCGGCCATGGTC 
AGAAGCCTAAAAAAGTCATGATCGCAGCATTGATGC^ 
CGATCCATCGCGTTACAAATCGGCG 
SEQ ID 248 

MRNTVGLDISKLTFDATAMVGKTEHSAKFDN^ 

YAESRFKRTKTDKQDAKLIAQYCRS AQE SELVKRQKPTDEQYRL SRMTAAYAQIKSEC AAMKNRHHAAKDEEAAKAYAE I IKAMN 
EQLEVLKEKIKEQTEKPNCKEGVKRIiET I PAIGRMTAAVLFHHLTSSKFETSNKFAAFAGL SPQQKE SGTSVRGKGKLTKFGNRK 
LRAVIiFMPAMVAYRIRAFPDFIKRT.KEKKKPKKVI I AALMRKLAVIAYHVHKKGGDYDPSRYKSA 
SEQ ID 249 
ATGTTGCGAACCTACTTGAACCTATTGACC^ 
TGTCACAAACCGAGCCGAAAATCGCCCAAAATGTTTTAl^ 
ATTAAGTGCGCTGGCTGATGATGGCGACCACAAAGTTTTTTTC 
AAACGAGCCAAAGCCAATCTGAAAAACTTTATCGTATTGTCTGAAG 
ACX:CCATCTTGGTCGTGATTAAAAAGAATGCCCGTC 
CTATCCTTTGGTTATTGTGGACGATGAAGCCGA^^ 

ATCAATAAACTGCTAAACGACATCAAAAATTCCTGTTGTCAAAGCCTGTTT^ 

TACAACATGAAGAATCTGATTGGCAACCTGAATTTATC 

TTCTGATCCACGAAGCTATATAGTTCGTTTTATTGATAGCGAAT 

GCAAAACAGGCATTGCTTAGTTTTCTAATAACCTGTGCAGAATTCGC 

CAAGCTATAAAATTCAAGATCATCAAGCC^^ 

GGAAGACCTTGCAGGTAGCTTTAAAGAAAGTTATTTGGATTl^ 

GAAAAATTGACAGCACTTTTAGAGAATAAACAAATC1CTACTC 

AAGGTTTCAACATCATTATTGGTGGTAATGTGATTGGTCGTG 
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tgccaaaaaaccgaatgcagatactttctggc^^ 

attccgttcgatctctactacttctttgt 

atattcaggttatttatccgaaaaatatcaatcgaact 

tgtgaattattttccactccacccaaacgaagataatttatcgga^ 

caatctgatttatatcaaatagatatagaagattta 

aggaaaaatttattgctggtgtagaagcattaaaag^ 

actttctcgagcaaccggcacaatgctttc^ 

tatcaagtagtaggaaataaagagaaaggttggcagggaaaa 

tgtaccaaagtgcaaaa 

SEQ ID 250 

MUlTYIiNQLTPPELADS^/KNTVDGFMEIO.SQTEP 

KRAKANLKNFr^SEADDRSFMEVMKAENPILVVIK^ 

INKLIiNDIKNSCCQSIiFIQLTATPQSI^ 

AKQALLSFLITCAEFALCDKANCNFAIiHPSYKIQDHQAFSKKIQAFLOT 
EKLTALLENKQISTLVVNSQTET^^ 

IPFDVYYFFVQIiNQANNIillGQAKNSGGNIQVIYPKNINPTRKT^ 

QSDbYQIDIEDLIOiVIiDKIjGRYVPDDWNKEKFIA^ 

YQWGNKDKGWQGKDFWLPNIKLPHNGLVYQSAK 

SEQ ID 251 

ATCAATACAOTTTTTTCCAATATC 

CGGATGAAGTGTTGATGGCGACAGGTTATCTATC^ 

TCAAAAAATAGATTTATTGGTCGGAATGCATTATTTGGAAGGTT^ 

TTTTTGCGACATCAGAAAAGAGGAGCCGTCTATGOTTC 

AAAAAATC^TGGATTAATTGGTTCTGOT 

GAACGGTAAGCCTGCACAGATTTTGCAGGCAGATATTCAAA^ 

CCAAGTAAATTTATTGAACATAATAGCCATTTAGAAAATTC 

TTGCC CAAACGTCAGAATATCATTTCTCAATTCCAGCTAAAAC 

ACGAGATAAACGAGGGTTTGTCAAACC^CGACCTTGGTATGAAGT 

TATCCGGTTCTGAAGTCTTTTACAGTTATTACCGA^ 

TCCGCTCAGAAAATGATTTAAAGACACTTGGTAAATGGATTAAAGGTTO 

GATTACGCATGAAACCCTACGCGAATACGGTAACGA^ 

TTTAAGGGGAAAAAC 

SEQ ID 252 

MOTVFSNIANAKITEKSIjNAVWMDIjFK^ 

FLRHEKRGAVYVSPFVKFHGKMYSFKNYQKINGIiIGSANLTCFWDS 
PSKFIEHNSHLENCXGVQKIAPEQIRQLFAQTSEYHFSIPAKTEEKSNLNW 
YFVLKSFTVTTDIXSWQFQCKTSGDYSKNFRSEN^ 
FKGKN 
SEQ ID 253 

ATGACAACAGAAAATAACGCTTTTGAAAACGCG/UVAC^ 
AAATCT1TOAATATTCCGAATGGCGCAATTTCCAACGGGC 
AAATCATCATTTTGTTGAAACCAACAAAATGATAGCCCTTGGGA^ 
TATGCCTGCTACCTGATTGTGCAAAACGGCGATCCGTC 



ACACAATACCGACCTTGCCGCCGCAGCCAAGGAIKK^G 

CGGGGGCTATACGGAGGACTCGACAAGCAGGGCATACACAGCCGCAAAGGCTTG 
ACGCCAGCGAACCGGCTGCCAATCTGTTTCGCGCXIACGCAAA^ 
GGCAAACCGGGTGCATTTCGAAGTCGGACAAAAAGTGCGC 
GTACCCGAGAAAAGCATCAAACAGCTTGAGAACGAAGAACAGAAAA 

SEQ ID 254 

MTTENNAFENAKHIDETGNEYWSART]^ 

YACYLIVQITCDPSKSVXAAGQTYFAVQARRQEIjQDBAAFRSMBDKQ 
RGLYGGIJDKQGIHSRKGIaKKSQRILDHMNASEPAANLFRATQTEE^ 
VPEKSIKQIiENEEQKRIjAATBQHQNGKK 
SEQ ID 255 

ATGAAAGATTTATTGTCGTCCCCATCATTCCTGCTGTTTTC 
TCGCACTGCGTAACCGAACCTTAAGCCGCCGtGCCCAAACCGTTATGGAAGCCGC 
CCCCTATTTTGTATCCGACAAGCCGCAC 
ACCGTATTGATAGGGGTCGGCT03TCAGGCATCTC 
SEQ ID 256 

MKDLIjSSPSFLLFSCMIAVTCPTRLIGFFALK^^ sviapyfvsdkpheliaialtafaacrfsmlf 
TVLIGVGSSGISGWLMA 
SEQ ID 257 

ATGGATAAAGAGAAAGTCTTAGAGAAAATCAAGAAATGTTTGG^ 

TAAGGCAGGCGC21GGCTTTAATGGAAAAGTACAAM 

TCGGAAAATGGCTTTTAAATTGGCAGGCTGGC^ 

CGCGGGAAAACAATGATGTTTTACGGCATTGGAAATCGAGCGGAAACCTC 
CCGCCGACCGCCGCAAATOTTTGAAAACCTGCCGGGCGGGAAAGCCC^ 
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TTGGATTGCGAGCGCTTGGGAAACAGTCAAAAAATTTGAGATC 

GAATATCCGGATATGGCCGAAGCAAGGACAAGAGATGC^ 

CCCGAGGAATGGAATCGGGTAAGCAAGTGAAGTTAC^ 

A 

SEQ ID 258 

MDKEKVIjDKIKKCLAIjGRSVNEHEAAQALRQAQ 
' RGKTMMFYGIGNRAETSAYAFDVVYRQISA^ 

EYPDMAEARTRDAKSSILQGSKMEYEALTRGMESGKQVKLHYAW 

SEQID259 

ATGGAAGCAAATAAATTTGAAGTGAAAAGTTTGTCAGACCTO 
TCGGTGTAAAACGCGCOSACATTTCAACCGAATTTGACGA 
AGGTATCAACGACTTTGCCATCGGCAAGGAAGTCATCATCCGCACTTATTC 
GCAGGCAATGAAGTGATTCTGAO^AAAGCGCGCC^ 
GACACGGCATCAGGCAAGACGGCAGCCAAATTTGCGGCGA^ 
AGGCGGCGCGGCTGAATCAATCCX3CACCGCGCTGGAGGTCGCC 
SEQID260 

MEANKFEVKSL SDL IKVFAG I AADFEAAMGVKRADI STEFDEPQHE PQPPVTOAKQKGINDFAIGKEVI IRTYSAGVWFGVLKQK 
AGNE\TCI.TKARRMYSWWAKESISLSGWARH^^ 
SEQID261 

ATGATTCACTTAAATCTCCACTATTGCGATGAAATTATC 
ATATATGCCATATTCCGACACXGCAGGCCATTCTAGGCCGCCTC 
GACTATGAAAACCGGTCAAATTTTTCTGGAAATCGTC^ 
AGTGATACAGAAGAAAACGTGTCATTCATGCTAC^ 
GAATGACAACACACAACCATATTCTCTCCGAGAGCC 
SEQID262 

MIHLNVHYCDEIIRHAENDKYSLIGIFPDICHIPTPQAILGRI^^ 
SOTEENVSFMLHQTISGLPVSDNDRIYVRMl'THinilLSESRPLSFSWLPy^ 
SEQ ID 263 

ATGGAGCTGACCGTCCACTTTAACGCCGAGCAGGATTTAGACCGCCTCOT 
ATGTCATTGCGATGATTCAGGCGGACTCTGCTATTTTTC 
AGGCCCGATTGACTTGGAAGTGAAACCCATATTGTCATTATGGGG^ 
GAAGCCGCCGGATACAGAATAATTTACGCACCATGC 
CTGTTGTAAACAAGAAGAGAGACGAATTTGACTATC 
GTACTCCAAC 
SEQ H> 264 

MELTVHFNAEQDLDRLFEKDEEAVGYLEWIAMIQADSAIFDGLYKNRYFREYGE 
EAAGYRI IYAPCHEKQPNGTYIRRIDILAVVNKKTDEFDYQAEHPITKRI IKDYEELYSN 
SEQ ID 265 

ATGAATCAACAAGAATTTGAATTTATGAACGACTT^ 

GTTTCAGTATCGAAGGCCGTTATATGGGGGAGGAAGCTTGC^ 

GTTTCTTACTTTACAGGCATGGGCGA^ 

CAGGCAGCCTACGAGGAACGTTTGCAGGCTGAAATCAGGCGGCGCGGCGAGGGG^ 
TCGCGTGGTATAGGGCGATACGGCAGGCGGGAGGGAAT 
SEQ ID 266 

MNQQEFEFMNDLARAFERRYRDTRSLNRCFS IEGRYMGEEAC PHKPE IGLRYGEDAMFLTLQAWAKVDAPQQEAVRI SFGIGAKS 
QAAYEERIjQAEIRRRGEGPIiHI^TDIiGIiAAWYRAIR 
SEQ ID 267 

ATGAGTAAAGCTATTAACAAGTTAAATGAAAAGGAAAAGGAAAAAATACX^ 
TCTTGGAAAATCCGGAAC^GGGATTTACTCC 

TATGGGCGAGGTGGTTACTGAGAGGGTCAGAACCTATACAGGGACACGCT 
GGGAAATCCGTAGCAAATGAAAAACAGAAGAATGTTGCGTAC^ 
ATATGAGAAGTATAGATTTGCCGCTGGATAAAATAAACAGCGTGGAATC 
AAACAAACCGATTTTCTTTAATACCGTm 

SEQ ID 268 

l^KAINKLNEKEKEKIRLLFAQREKFYRILE^ 

GKSVANEKQKNVAYGELVIjTNFRLIFVGNMRS IDLPIiDKINSVECFQS S IRI SQSGKNKPIFFNTVFNPQLWKEAILiVLSDKK 
SEQ ID 269 

atgagcggcacgaaacgcaaattaggccgtccgacagattac^ 

gcagaagcctacgtt<^tatgcgccga^ 

tcgccaccaatacgcgcgcgcgagagaaaagcagg 

gagagcgcggcggtttcaaaggcgaaattgcagatagatgcccgaaagtgg 

acaaatcggagcttgacgttaaatcgggcgatgggagcatgagggcggctc 

GGAAGATGTGTTGCGTAGGGTT 
SEQ ID 270 

MSGTKRKLiGRPTDYTiGDMADKICEKIANGRSLRS ICAEDGVPPMKT IYRWLEANEEFRHQYARAREKQADYFAEEI IEIADSAQA 
ESAAVSKAKLQXDARKWAASKIAPKKYGDKSEliDVKSGDGSMR 
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SEQ ID 271 

ATGGGCAATGCGGTTTTCCCCGAGTTC 

CXXXIAAGCGGCCGTGAGTTCCGAACCGCCTACTACACCT^ 

AGCGTCCGTAAACGAGTTGGAAAAACTGGCGGGATTCTT^ 

GCCGACAACGCCGTAACCGACCAGCCTGTCGGAAACACGGTGCAGGGCG 

TTATCGAGCCTGTGTCGGCGGTCAAGGAACGGCCCGCCGTCAAGGTC 

TACCGACAAGGGCGTTTTGGTTTTCAACACGCCGCAACCGCCG^ 

AGGTTTACGTCTGACACGGTGGATTTTGAAAACGTOT 

TA 

SEQ ID 272 

MGI^VFPEFPGLKWGRKKTAVWSTGTQKSASGREFRTAYYTYPQWRF SLSFEVLRTKASVNELEKIjAGFFNARKG SFESFLYEDP 
ADNAVTDQPVGNOTQGVARYQLVRSMGG 
RFTSOTVDFENVIjGSLWAAKKIEFTSVKIi 
SEQ ID 273 

ATGCCGTCCGAAACGGATCTGAGGGCGCGGATCGTCGAAGAGGCGCGGTC 
TK^GGGCGCGGGTGTTGATTGCGCGATGCTCCTGGTCGCCGTC 
CCCTTACCCTCAAGATTGGCATCTGCACCGCGATTCCGA 
CCGCAGGCGGGCGACATTGCAGTATGGCGTTTCGGGCGGTCGTT^ 
ACAGCTACATCGGGCGCGGCGTGGTGTCGGACGACATCGGCCAAGCCGLAACOT 
ATTT 
SEQ ID 274 

MPSETDLRARIVEEARSWIXSTPYHHHAMVKGAGVDCAMIi^ 

PQAGDIAVWRFGRSFSHGGILAGGGKVIHSYIGRGWSDDIGQAELIGRGVRFF^ 
SEQ ID 275 

ATGCCTCGTTTCCCCCGCACTTTACCCCGC^ 

ATCACACC(^TTGGGGCTATACCGGACA(X3ACTC 

CAAAAACCAATCTCCGGTAAACATTACCGAAACCGTTTC 

GACGTGGAAAACAACGGCGACACGATTCAGGTCAATTATC 

TGAAACAGTTCCACTTCCACGTGCCX3AGCGAAAACCAAATCAAAGGCC 

CGAAAACAAACAGCCTTTAGTATTAGCCGTGCTGTATC^ 

CCGATGACCGCAGGAAAAGTGAAACTCAACCAACCGTTCGAGGCATC 

(XGGTTCGCTGACCACGCCGCCGTGCACAGAGGGCGTATCATGGTTGGTC 

GGAAAAATTCACCCGCGCCGTCGGTTCGGAAAACAACCGCCCCGTACAGCCTCT 
SEQ ID 276 

MPRFPRTLPRMAVBLLACTAFSAAAHGNHTH^ 

DVENIK3HTIQVNYPEGGOTLTVNGRTYT 

PMTAGKVKLNQPFDASTLLPKRLKYYRFAGSLTTPPCT 
SEQ ID 277 

TTGAAGACTGATACTGCCAGAATGAACAATCTGATTCCCGAACATTTAG^ 
GCGGGCATCGTTGCTTTTCATTATCCTGCCAAGGTAGAGAT^ 
TACCGATACTGATAAGGCGCCGGTAAAAATTGTGGCGGTAGAAG 
CATGGCTATAACGCTATGTTCTGCGACAAATATAGCCA^ 
ACCGAGTTCCGATTCATCTTTATTACAATATAGACTAC^ 
AATTGATGGCCGCATCATTAGTTTTGACTCATTAAAACGAAATGGC^^ 
TCTGTTCGTGAATTATTGAATGAAGAATTAAGC 
SEQ ID 278 

LKTOTARMNNLI PEHLAAYAHSDNLQIEGGHRCF SLSCQGRDTFHIRYYGEPF3X3LITDTDKAPVKIVAVEAVSGDE IVLFDGAE 
HGYNAMFCDKYSQNQKQNRTLTDLDEYTYRVPIHLYYNIDYEDEYEDFW 
SVRELLNEELS 
SEQ ID 279 
ATGCTCGAGAATCTGCCCTATTTCCAGC^ 
CCGATGTCGCACTGATTTCGCAAATCGGTACATATA^ 
GGGTAAGGCGGTCGGTTATGATGACGAGAAACTGTATT^ 
GACGATGTCGTCGATGAAAGCGATTTGCGCCGTG^ 
GCGACTTTTTATATACCCGTGCATTTCAACTGATGGTTGCCTC 
CATCATTGCCGAGGGAGAGGTCATGCAGCTGATGAA^ 
TATAAAACGGCAAAATTGTTTGAAGCTGCCGCTCAAG 
AAGACTACGGTATGTACGTCGGTACGGCATTCCAGATTATTGACGATCTGCT 
AAACGTCGGCGACGATTTGGCGGAAGGAAAACCG^^ 
GATGTGCGTACTGCTTTGGAAAATGCAGATCGC 

ATTCGATAGGCGAGGCGCGCAAAGC^GTCGATTGTGCCGTTGCCGCGTT^ 
GATTCAGCTGGCGAAGGAATCTTTGGTCAGGGTGTCG 
SEQ ID 280 

LMVQRVLSVNDKAFVTADLDYELPQAYYVDDPKAPPVEISAPVB^ 
WGIALTILLLVFLFQDWIVRYEKWn^ 

TMLLWNRGTFCGWLC PFGSLQELTNRIAKKIX5VKQITVPHMLHTRLNVTKYL ILFGFLAI SLYDLGTAEKFAEVEPFKTAI IIiKF 
MGDWWFVAFAVALLIAGLFIERFFCRYLC PLGAG IALPGRFRVFDWIiRRYKMCGNPCQ ICTKECFVQAIAPEGD IHPNEC IQCLH 
CQVMYHHDTRCPQWAENKKKQKQAAAK^^ 
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SEQ ID 281 

ATGAAAAACTTGAAAAATATTTCCGTTGTTGCTGTC 

ATGCTATCGGCGACGATTCGGCCGTAATCAAGGCTG^ 

CCGCCGTCAACGTAAAAATGTTTCTGAGGAGTTGGCTTTGGA^ 

ATGGGTACGGCTGCCGGTGGTCTGATGCTGCTGAACGGTC 
SEQID282 

MKmiKNISVVAVCAVLLAACASENSVA^ 

MGTAAGGLMLIiNGVVGTVGVMKSVF 
SEQ1D283 

ATGAAAAAAGCACTGCTTGCACTGACTATTGCCGC(^TCTCCGGT 

AATATAC CGTC CGTACAGACATCTCCAAACAAAC GCTGAAAAATGCCG ATTTGAAAGAAAAACACAAAGTACA 
TTTCCGTGCCGATATGCCGTTTGACGATATTCACCACGG 
GTTGTGACCGAAAGTACTACTAAACCATTCGGCAAAGACGTTGAAGA 
AACCGATCAGTGAAGCCACCAAACTGCGTGCCGGTTO 
AAGAGTCAGCACCGAGCGCGAAGCCTTCTATACCAAAGCG 
CCTTGGGCAGAGGTTAAGGTTGACCTGGACGCCAAATTGAAAC^ 
AAACACGCGGATGGGGCGTGGGCGTAGGTGCGAATATCGGCAAACAAATTA 
GAAACACCGTCACTTCAAAGCATCCGGCAGCTTCGTT 
GGTGTTCGTGTCGGCGTGAAATTC 
SEQ ID 284 

MKKALLALTIAAISGTAMAQLPDFLGKGEYTVRTDISKQTLK^^ 
VVTESTTKPFGKDVEEKRTDVYAGYTYTQPI^ 
PWAEVKVDLDAKLKHNATVAGVSADINAKTRGWGVGVG 
GVRVGVKF 
SEQ ID 285 

ATGTTCAAACGACCTGAAGAAATCATCGTCCTGATACTTGCCGTCCTC 
GAGCGGATGCTTATACCGTGTTAAAAATGACTGCCCTTACCCTGCT 
TCAGCCGGCATATCTGGCCGCTGCTGCTCGGCTGCC 
TTTACCTTGGCATGCATCATGGACGTTCAAAACCATCCT^ 
SEQ ID 286 

MFKRPEEI rVLILAVLWIAGTYFLAALFGADAYTVLKITALTLLWSAASF^ 

FTLAC IMDVQNHL SPDSRNRRLSV j 
. SEQ ID 287 

GTOATGATTTTGCTGGACACGAATGTGATTTCCGAACCTTTGCC^ 
TGATATTGGAAGATGTGTATTTGTCTGCC^^ 
GAATCTGCTGCACGAACGTTTGGAACAATCCATTTTGCCTTO 
ATCTATGCGCAAATTCGTTCCTATGCCAAAACACATGGCAAAGAGATTC 
AGCACAGTTTGACAGTTGCTACGCGTGATACCGGCTCATTTTT^ 
SEQ ID 288 

VMILLmWlSEPIiRPQPNERWAWIiDSLILEDVYLSAITVA^ 
IYAQIRSYAKTHGKEIAAAIXSyiAATAKQHSLTVATRDTGSFFAAD^ 
SEQ ID 289 

TTGAAAAAGCCGTGGTGCCAAGGGGGGAGGGGATTCAOT 
ATATTGCTACAGCAGTAGATTTGTTGGTAATAC 
TATCGGACAATATTTAGGTTCTGTAATTTTAATATT^ 
GTGTTGGGTTTATTAGGTTTAATACCGATTTACTTAGGTATTAAAGTTG 
AAAAAGAATTGGATGAAAAAGGGTTGTCAAAATTAGTCGGTATTC 
TGGACTTTTTGTTCCOTACTTTGTGACTTTAGATCTTGTCGAC 
TTAGTATATACAGCACAAAGATTCGCTAATATTTCAGGTGTTGGTGAAATTC 
TTTATATTGGTTTAGGGTTATTTATTATTATTGAAAATAATACAATTC 
SEQ ID 290 

LKKPWCQGGRGFIMRCFMPSWITAAVLYIATAVDLLVILIjIFFARAOT 
VLGLLGLIPim/GIKVAIYDrc 

LVYTAQRLANI SGVGEIVEKFSRWIMAVIYIGLGLFI I IENNTI RTI I S II 
SEQ ID 291 

GCCCGCGTGAATGTGCTTTGATTCACAACGAACAG^ 
GGTGGATTCGGACGAGCCGGAAACGTGCAAGGGCGTGCGCTTTC 
GGCATATTGGATTTGGTGGAAGTGGATAGAAAAACAGGC 
CGGGGGATGAAATCCAGCTTTGCGCCCAAGGCTTGTGCT 
TATGCAAACCCGCCACCGTGTCX:CCGTCGTGTTTTCAGACGGCOT 
SEQ ID 292 

MTALLTETQRENQOTRLIPLSALQHYAFCPRQCALIHNEQAWAENYLTAQGK^ 
GILDLVEVDTKTGRIiKPVEYKRGKPKPDPGDE^^ 
SEQ ID 293 

ATGAGTGATATTGATGATTTGAAAAGCTATTTAGAAAGC^ 

TTCAAAATCAATTATCCAATGTCATTTCTTC 

TTCTTGAGCTCTAGCC^GATTGGCCGAAAAAATTAC^ 
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GCATTGCTTGCGGAACATATTCAAATAGTTTGTG^ 
CAGATGAAATTCAGCAAAATCCCAAAGCAATT 

SEQ ID 294 

MSDIDDFKSYLESHQAAFSAWGRFVAEEIQNQ^^ 
ALLAEHIQIVCEI IESSSQWNAKVSKDFADEI QQNFKAI 
SEQ ID 295 

ATGAAAACTGTTCCAACCATCCTGACCGGCATCCTTT^ 

CCCTGGCTATGGACGAGCTGCCGCCGATTTGCCAACi^ 

AGATTTCGC^CGCAACAACACGAAATTCCTATTCCAAGCCC^^ 

GCTATGGATTCGTTTGC^GAAAAAAACCaSCJA 
SEQ ID 296 

MKTVMILTGILLATALPASAHGMHKS 

AMDSFAEKNPQSELRIKPHQQ 
SEQ ID 297 

TTGCTCAACCCTCTTCTCACCCTTCTCATTCTCATAAATC 

GCCGCATAACGCTTCTCGCGCTCTCCCGCA^ 

CCGTCAAGCCCGTATCCGGGTCGGCAACCTC7Va?AACTGACAGTC^ 

AAACTTCCCGTTAAGCGCATCAATCTGCCGCGCAATCTC^ 

TCGGCGATGCGCTTCCTATCCTCAAGCTCCAGC^ 

CCTGCTGCACATCAAACGCCCTGTCCXSCATTCC^ 

AAACCGCCCC^CTGCAACACCGCCTCACGGCT^ 

ACCGCCGCATTTTTATTATGTTCCGC^ 

CATAAGGCGTAGTCGCCCTCTTATAAAGCTC^ 

GTO^AAATCACGCACCGGCTTGCCGCC^ 

GGGCGCGTATCGCGCCGCGCGAAATCCGCCCCGGCCCGCGCATCGGGGGAAAGCGCCOT 
C&TCCGCACCCCGTTCGAAAGCACOX^CCG^ 
CCGATAACGCGCCGGCCCGGCCTCATAATTCGC^ 
CGCTCCTCCXSCCCCCTTCCGGGCCTTCTCTTTTTCCTCCTT^ 
CCATCGC^AACCCTCCGAACAAATAAAGTCCAAAATAAGCAAAAACCCA 
AATCGCAACX:CCGGACCAATTAACAAACAACAAGTT^^ 
CGCCTTTGTGCAAGGCGGTTGCATTTCAGAAGAG 
SEQ ID 298 

LLNPLLTLLILIITOFTAGRFLTVRIjKAQRRITLIAL 
KLPVKRINLPRNLPALPVRLRKTISPRQIGDALPII^KLQ 

KP PLIrQHRLTVRRILRRSRRQPF PAQFPDRRIFXMFRHNPARRIKLCRRQLTVQGPRIRRSRPL IKLPLIjRRQRIRPGRHQRTIiR 
VKITHRLAAPIHIPVKSQRRRRPSARIRRARXAPREIRPGPRIGG 
PITRRPGLIIRNGTPHRIGTVRLCRRPLPIiRPLPG^^ 
NRNPGPINKQQVTQKlOn'IXSPFPGANKKPPLCKAVAFQNSLGC^ 
SEQ ID 299 

TTGGGCATCTATGGAGC^TGGTGTTTTATGGGAOT 

AESTTATACTGGGCGGsAGOGTCCAAGAAAGTATCT 

GGTTTTCTTATACAAAGGCAGGTTGGGGTATTTGAAGTTAAAAATAOT 

TTTGTTTATGGGAAACACGCTTCTTCAATAGGCCATATCAATAGGG^ 

CACATGCTGAAAATATGAAAAATAGAGGGAAGTlVrAAAAATAATACTA 

GAAATGGCAAGCTAGGATTATGGTTAATAGAAAAAATAAAATATTAGG 

GAGGCAGCATCTAAAGATTTTGGCTTTGTAGTG 

SEQ ID 300 

LG IYGAWCFMGLTQEVLKELLRYDDNTGKL^ 

FVYGKHASS IGHINRDKTDNRI SNLRDVTHAENMKNRGKFKNNTSGHTGVYFHK^ 
EAASKDFGFW 
SEQ ID 301 

ATGAACGGGGCGGAATTTAGACTGACGCCCCAAAAGAAAAAGCAGGCT 
AAAA03GCAACCTTGAAATCACAATCCGCCCGCGCAAGT 
TGAAATTTCAGAAAAAGTTTTTATCGATGGGAGAAGGTTTAGT^ 
TGTATTGAAATGCCTAACGGGCAATTAATCGGTATATC 
TTATATCTTGGGCATCTATGGAGCATGGTGTTTTATGGGAT 
SEQ ID 302 

MNGAEFTIjTPQNKKQVMRS IWD SPDGWFENGNLEITIRPRKSKRSVEQNRRLWFLYREI SEKVF IDGRRFSQDVWHEFIiKRKFIG 
C IEMPNGQLMGI STTKLSVREMSEYQEKI I SWASMEHGVLWD 
SEQ ID 303 

ATGCCGTCCCGCGCAAGGAAGGGGGGTCTATTATTACAAATCGTGCAAAACC 

GCGGCGGGAGCCGGCGCGATGACGGCGGCAAGGCTGCACGGATACATCCGCGCCGCGCA 

GCCTGTGGACGCAACCGGCAGGGGAATGD3CGTGA 

TGGCGCAACAGGGCGGTCAGGAGCGCGGAGGCGGCGGCGTATAAGGAAACCGTTC 

CGTCCGAAGGCGCGGTTGCCGTATATGTGCGGCTCATACCCAAAGCGAACAAAGAG 

GGACAACGCCCTGAAGGTTACTCTGGACGCGCTTCAA 

TATGGCGGCGAACCGGTAACGGGCGGCGGTTTGGCGGTGGAGGTGGGGGAATT^ 
GTTGGGATTTCATCGGAC^GGAGGGTTGGGATGTC 
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SEQID304 

MPSRARKGGLLLQWQNLPQQG^^ 
WRNRAWSAEAAAYKETVRRIAQGAGAMPSEGAV^^ 
YGGEPVTGGGLAVEVGELEMEQTDAADEGWDF IGQEGWDV 
SEQID305 

ATGCTGAACGCGTACGACGTGGCAGATTTCTTCCTTTC^ 
AAAAACTCCTGTATTACGCACAAGGCTACGCCCTTGC 
CGGTCCGGTAGTCCCCTGCATTTACCGCACCTACAAAAAATACGGCGGGAG^ 
TATGCGGACGAAGAGTTGGTTGTCCTCAACCGTGTCCGTAAAGAGCA 
ATCAGGAAGCGCCGTGGATACAGACCCGGCAGGGCGAAGTCATAGGGATTGCGCT 
GCAGACGGATTACAATTTCAATCTTGAAAAACTCAAAAC^ 
GACGACTTGGAAAAATGGTTGGAGCAG 
SEQ ID 306 

MLNAYDVADFFL S PFEEEDGEQI SNLKLQKLLYYAQGYALAILNRPLFAENIEHWQHGPVVPC IYRTYKKYGGSPLPAAHIEPDK 
YADEELVVLNRVRKEQGCYTAWALRNKTHQ 
DDLEKWLEQ 
SEQ ID 307 

TTGTCGGATGAAGGCAGGCATAAACCTACTTCCGCCCCAAACCGCGTTTTO 

GGGATTTTTTAAATATCCGCCCGATTTTTAC^ 

OTTAACX3GACTCCGTGCGTAATATTGAAGAAATCGGTATC 

CAATACGATACGGAGGCATTGCTTAAACTCGGGCAJ^ 

GGGAAAGTTACCGGCACCTGCTGTCCGGTTTGGACAGCCCCAAGCCGTTTC 

GATTTCGGAAAACTTACTGGAAAAAGGCTC 

TTATCCAACCCGC^AAGTTTGGATACGGAATTGAAATGGT^ 

TTTATCTCCATAATAATTTGGCGTATC 

GATGCGCTCCGGTTTTTTTCCTTGCGTATTCTC 

GGCGATTACGGTTTATTCAAGAAATATTTTATTTCGGCATACGAAAAT 

TTTTCCGAAATTTCTCCATT 

SEQ ID 308 ^ 

LSDEGRKKPTSAPNRVFMMKGMDKLRYQRDFLN^ 
QYDTQALLKLGQTAGGKLYSDAVMLI^ 
LSNPQSLOTEIiKWLLQEAPKIENPFDRAVYIaHNNIA 
GOTGLFKKYFISAYENTVM^GPQP0VDIFR3KIFSI 
SEQ ID 309 

TTGAACAAAAATGGGAACACTGACATGGCTTTACTGAGCATCCGCAAGCTG^ 
CCTTAGACTTGGACTTGGAAAAAGGCGAAGTCATCGTAC 
CAACGGTTTGGAGCCGCACCAAGGCGGCAGCATCGTGATGGACGGTGTCG 
CGGCAAAAAGTCGGTATGGTCTTTCAAAGCTACGAACT^ 

AGGTACAAAACCGCGACCGTGCCGAAGCAGAGGCGCAAGCCGGCAAACTGTTGGM 

CTATCCGCGCGAACTTTCCGGCGGTCAGAAACAGCGCATCGCCATTGTC 

GACGAAATCACCGCCGCACTTGACCCCGAAATGGTGCGCGAAGTCTTC 

TGCTCATCGTAACCCACGAAATGGGGTTCGCACGCAAAGTTGC 

GTCCGACCCCGAAACCTTTTTTTCCGCAC 

SEQ ID 310 

MALLSIRKLHKQYGSWAIQSLDLDLEKGEVIVLLGPSGCGKSTLLRCrVNGLEPHQGGS 

QS YEIjFAHMWI ENI LIX5PVKVQNRDRAEAEAQAGKIjLERVGL ITAALD 
PEMVREVLEWLELAREGMSMLIV^ 
SEQ ID 311 

ATGACGAAGTTATACGCGCAAATCGCCAAGACGGAGGCGCAGGACGACG 

CGGTCGATTCGGACGGCGAAGTCGTCGCGGCGGAAGCAATGAAGGC 

AATGCACGGCTCAAACGCGGCGGGAACGGCGATTGAAATCAACGTGGAAGACGACGGGCG 

GACCCTGTTGCGGTGACK3AAAGTCAAGACAGGCK3TTTAGAAAGGCT 

ACAAGTCGCAAATCACGGGCTTGAAGCTGACGGAAATCAGCCTTGTCG^ 

TAAGGCGGAGAAACCGAAAGATGAGGCAGGGGCGGCGGATAAAGACGGCAAGCCGTCTC 

AATCCTAAAGACGGCGATAAAGGGCCTAAAACCGAAGATAAAGGCGATAAGGACGCCGGCAAAAAGGACG 

CAAGCGTGAATTTGTCCGAATCTGAAATCGCCGCC 

CAAATCAATGTACGAAGTCAAATCACCGGCTGATC 

GATGAAGCCGTTACCGCGCAAATCAAAGAATCGGCGGCCGGCCTTGCCGAATCG 
AGCCGGCCGACGGTTTGGCGGCCAAAGCCGGCAAATCAG 

AGACGCGCTGAAAAAATCGAATGACGCCCTTGCCAAAGCACAGGCGGAAATCGA^ 

GGCAGTACGAAAGCTATCAGTAAGGCAGAAGATAACGGCGAAGACCCGTTAAAAGGTTTTCAGC 

CGCTGGATGACGTGGCGACACTCATTAAGGCAAAACAAACAG^ 

SEQ ID 312 

MTKLYAQIAKTEAQDDGTVKVWGYASSEATO 

DPVAVTKVKTGVYKGFS IGGSVTAROTLNKSQ ITGLIOj SDKPTEEEDE 
NPmSDKGPKTEDKGDKDAGKKDEAGKSASVN^ 

DEAOTAQIKESAAGLAESLKALAASEADKPADGLAAIOVGKSGDLAK^ 
GSTKAI SKAEDNGEDPLKGFQP IVKNDGTLDDVATL IKAKQTGRL 



WO 02/079243 



-40- 



PCT/IB02/02069 



SEQ ID 313 

TTGACAATCAAATCGGGGG<^ATAGCTC 

AATATTCAAAAAACCGTATTTGTTCGGATAAACTG^ 

TAAAAAGTTATCCGGTTATCTTCCJUWSC^ 

TATTTCAAAAAAATCTGGCAGGCAGACTGCACGGGCGGA^ 

ATCCAAAATATCTGTATCAGGTAOTGGCCGA 

TGGCAGCAAAGCAGCGATTATGCAATATAAAATCCCCATCCCCCCGCTCCCCGAACA 
TTCGATACCCTGACCCACTCCGTCAGCGAAGGCCTGCCGCACGAAATCGCCC^ 
AGCTGCTTGCCTTCCCCAAGGCTGCC 
SEQ ID 314 

LTIKSGGIADGYQCIUjKNVVWKTIjGEVAEYSKNRICSDKLNEH^ 

YLKKIWQADCTGGTNGDVLVIRVTDEKVNPKJfLY 

FDTLTHSVSEGLPHEIALRRKQYEYYREQLIAFPKAA 

SEQ ID 315 

ATGCTGCCGCCCAACTGCGGAACGGCAACCCCTTATATATTGCGTAACGCCGTAGGA^ 

CAACGGCCATTGTCGGCAATGCCGAATAT^ 

AAGCTTGGGATGTCAGAATCTGCATATCTGCATGGAGGCAACCGGCAAOT 
TACAGCGTTTACGTAGTGAATCCGCTGAAAATAAGCAAGTATGCAGAAA 
CAAAACTGATAGCGCAGTATTGCCGGTC^ 
TTCACGGATGACCGCAGCATACGCGGAAATGAA 

GCGGCCAAAGCATATGCGGAAATCATCAAAGCCATGAATCAACAGCTT^ 

AGCCTAACTGCAAGGAAGGCGTGAAGCGTCTTGAAACCATACCGGCAATAGGCA 

AACATCTTCGAAATTTGAAACATCAAACAAATTTGC^ 

AGGGGAAAAGGCAAACTGACC^GTTTGGCAACAGGAAATTACGCGCCGTC 

GGGCATTTCCCGACTTCATCAAAAGGCTGGAAGAAAAGAAGAAGCCTAAAAA^ 

CX3TTATTGCGTATCACGTACATAAGAAAGGCGGAGATTACGATCCATC 

SEQ ID 316 

MLPPNCGTATPYILRNAVGIiDI SKLTFDATAIVGNAEYSAKFDNDSKGLDQFSDRLKSLGCQNI^ 
YSVYWNPLKISKYAESRFKRTKTDKQDAK^ 

AAKAYAEI IKAMNEQLiE^KEKIKEQTEKPNCKEGVKRLETI PAIGFMTAAVLFHHLTSSKFETSNECFAAFAGLSPQQKESGTSV' 
RGKGKLTKFGNRKLRAVLFMPAMVAYRIRAFPDF IKRLEEKKKPKKVI IAALMRKLAVXAYHVHKKGGDYDP SRYKSA 

SEQ ID 317 _ 

TTGGAGAAGCGTAGC(UAGACATCCGCGTTTATATGGATTAT^ 

AAGAMTCGGCAAAGAGAGCAAAJVTCOTGAAG^ 

AGGTTTTGCCGCCGTGAAGGAAGACGTGGCGGGGCTGGA^^ 

GGCGCGGGCG6TA(XX^GCGGAAGACTGGGCGGGGATGCTGTT^ 

CTCGTTTGCCTCCGTGTTCGTTTACC^^ 
AAAACCmCTCCGCXmCGTATTACCCAO^ 
Alt^TTCGAAATGCGCAAACGCMCGMGAGAAACAGGtt 
GCGTACAGGCTAOGAAGTOGGCAACACQ^^ 

SEQ ID 318 _ 

I,EKRSEDIRVYHDYQGKKDMiE^ 
(^GGTEAEDWAGMI^ 
KTDSAVRITHE^ 
SEQ ID 319 

GOGTTCTGTTTGGTC^^ 
TTTTGOTTCTGTTCCAOTTCC^ 

SEQ ID 320 

VSDAHKTGGKPGSVSDSHSS^ 

SEQ ID 321 

GTGTTTTTATATTCOT'TGATGCAG 

GGCGGGTATTTAAA 

SEQ ID 322 

VFIiYSVIftAIitSRTKeU^ 

SEQ ID 323 

ATGCGCCGATACCGTCTGAACCCTTCAGACGGCATT^ 

TAAATACCCGCOTGATXCCGTCTGTTO 

ATATAAAAACACAGCC 

SEQ ID 324 

MRRYRI^SDGIFPHNETCOT^^ 

SEQ ID 325 

ATGTCCCGGtt'iraXXXX^ 
TGCTGATTTACCTCTACTACACCGCCGACA^ 

TTCATCGCATTGGGCA6C6G(XM3CGT6AA^ 
imTATCG«XKXTrCTTAGGCCCOT 
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GOJACGTAAAAACCTGCCCCACXXXy^ 

CTIKnCAA(XTCGACMTTO 

TCGCCTACATCCCGCTTTTCCT6ACCATCTG 

ACC(X3TC£(XT(X3AAAGATC 

GCXXJIATTTGTTACCGGCGC^^ 

TTTCCO^TTGCGCTCTCCATCT^ 

AftAAGGCTAT(^GGCGGGC6A(X3AAATOG^^ 

GAC 

SEQID326 

MSRHPARTGEKTFFGHPFQLSTL^ 

FIAI£SG<7yKSTASSMVGALYEQDOT 

LmraffSGILLSTVII^^ 

AVFVTGASFLGFVPFISSGTPMPIAWAIJVLOT 

D 

SEQID327 

GTGCGGGC^GGATGCCGGGACATAAAGGTTCTOT 

TGTCAAiTCGCTTATCCX^TAGGGCGbYmu'ACT 
SEQ ID 328 

VRAGCRD IKVLLKDF L S I PSETDADGGGVFCMKISVFNRYFHS LSNDAIiPF F AQGCFYI CFNSMDC I LEDVFRYGAGKS FSVNGLSDRAVFT 

SEQ ID 329 

TflaOroCGAGAGA^ 

GCGGCAAAGGT AT ATTGACGGCGTTGGCAACGGGGCGGTC^^ 
TGCGCTGCTGCACGGTAAAACTGTGCGCGAAGTAC^ 

AAAACGAAATHXXSCT^^ 
CGTGGTTGAA(X!ATTGGGT!^^ 
GCGAAAGAAGCGGCGAAATATGTTTGCCCCGGCGT^ 
SEQ ID 330 

LSARENMMNPKIVFFDIDDTLYRKYTOTLRPSTO 
AVSALSECVCRALQHIASDFFADKOT 

AKEAAKYVCPGVDHXjVIiRGLQDUjVI j 

SEQ ID 3311 

ATGTCCGAC^GCAGACGCGATTTCGGGGCGGCGGGTTO 
(&GAAGGAATGGTTTGTTGTTTAAG 

SEQ ID 332 

MSDKQTRFRGGGFDGEKECAAQJEVSAITRHRI FWGWGVGEGHVCCLRFNFVGPDVSKRHV^ 

SEQ ID 333 

ATGACCGATACCACACCGAC^GATACCGATCC^ 

CACTGCTGTCTGTOTtjATTTKX^^ 

AMCCTCAAAGGCACACTGCTCGACGGCTT^ 

CGCAGCCTGCAWTCACCGACATCTOGCCGGC^ 

TCTATCTCGACCGCTTCGAGACXXmAM 

CCTGAAGGCCGCCGACACGCCGTGGAGCAGTTC 

CACAGTACGGCXXX3GCTGAGCGGCAGCCTGAAGGATC 

AMCATTGGMGAAGTACTGGTCAAAGGATTCAACAT^^ 

CGCGCT(^(X3eKX5CTC^ 

GGCAATACGTCXXXXX3CCCTGCT0GGA 

TGCAAACCGCGTTCAAAGGC^GGTTGGACGGCAGCATCGG^ 

CGCAAGC^CCCOX^AAC(^(^GCG^^ 

CTGGACATCCGTTCCXXKXKATTCGACOTTW 

TGCGTTTTTTGCCCGGTACGTOM 

AACAGACGGCGGOTCGGCAAAAAAGGCGACCGGOT 

GATTTGGACGGCGGCATCCGAACCTTTGAAACCG^ 

CAAGCOmX»5^C^^ 

ACA(XKX!GCCATGACGCTGGACGGCAAACCGTTCAM 

TTCAACCTCMGCTGCAAAACCGTATGACGCT^ 

GGAAAACCGGCATATCXMCAAAAGGCGGCGCACGCGGCCT^ 

OXKXymCGCGCGC(mACCTC^ 

ATCGGMTCCTGCTI^CGGCGGCGCGCGTTTC^ 

TTCCCGACTT(K3GCGCATT^^ 

CAACGGTAGCAGCMCTACGGGAAAATCAACGGCAATATCA(XXHCG 

C(mCTTCCTACCOyr^ 

GCAACCAAACCCAAGGCATCATCTTGGAC^^ 

©rKA{OTGGAAMCAGC(?KXXX 

CCGCAAAAAGGCATATCCGl^ACCGGTATGATTAAM 

AGGcaxxmTC(K?rccca^^ 

GGGC6GAAATGTQCGTGGG6TGGQCACGGTCCGC6TCA^^ 



WO 02/079243 PCT/TO02/02069 

AACCTGAACATCCGCGCCGAACG(XX3C(^^ 

TaOTGGGCTmCtt^ 
GGCAICTCWGCGCXjGAACAGTCCG 
ACCGOTCTTOGGT^^ 
SEQ ID 334 

RSiaiTDISACffllAIVTKPTPPKEERPPQG^ 
HSTARLSGSLKDVR^^ 

I£IRSRAFDPSRIDPQFPAG*TCNGSIH^ 

DIJEGIRTFBTDLSCTARN^ 

FNLKLQNRHTLEAGA^^ 

RNFLPVGQmCBI^^ 
PQKGISVTGMIKrD(K3LFGS^ 
HUORAERRLSPV^GVEIlCSIiNSPRITW 
GISSABQSmiYRLmiQAVARIGSRSSGGE^^ 

SEQ ID 335 

GTGTGGCGGATTPCGATTCTAACACTOTO 
TOTITrCCAATCCGCATATGGAAAAC^ 

SEQ ID 336 

VWRISnmiFGBG(miQTORI»YF(X^ 

SEQ ID 337 

(?ICTA(^TC(^TO(mCACCG^ 

TTTCeGAAAACAAGGOitfS^^ 
TATGGTCGAAGAACACCTGCXX£TCATCAC^ 
GGCTATTTCAGCAGCMGGTCAGCCTGACGGAAAAAGAC^^ 
CAtoCGGCAArcTCGOm^ 

(y^^CCGGCACACAGCGTTACCCCGAACAAAOT 

TTCGGAATACGGTTTGGGCGGCAAAATCGOT 

AGCCAGCCGCGCAACTATCGGGGCAACTACTGG 

CGGGCA!KX3ATGCCAGGCFGGGGGCGGMT^ 

GCTCAACAACGTGCTGCACCCCGAAAACGGCC^^ 

TOACGCCCGAAMCAAAAAACTCGGCMGTTCATCATAK 

CCGTGCGCGGTTACX3AACTTCA(^GCATCGGGCTT^ 

(XXXXX^GGTATPCCACGATATGGGC^ 

GOTACGGAC^CAGCGACAAAAAAATCCGCTGGC^CA^ 

SEQ ID 338 

VLESKSATPMHOTRTMMIKPTAL^ 

GTFSSKVSLTEKIX^YTVHITPGPRTKIAW 

EIT^RYPEQTVSGUU^^ 

AYGHSDKKIEWHISLGTBP 

SEQ ID 339 

TTGCGCCGAATGATAAAACAAAGCCTGTCCC^ 

SEQ ID 340 

LRWaRQSLSIimANQRTSKHSLKNKK^ 

CAGGOTGTTCQGCAGOTGTTO 
GATTATCCTCCTTOCCTCATCX^ 
TCCGGCATTGTGGAAGTGTTGAAATCGOT 
TGCGGAATCACGGTTCGGACGTTACGCG^ 

T(K!ACGAAGiOOTATTCGATTTC^ 
TTTWXX!ACCGCGCTGCTGT!^ 
GAT(HCGCCATGCAGGTGGTCGCCGTCG^ 
AGAGAAACCGTGCGGAA 

SEQ ID 342 

MAFGKSLFHAI(SVSLVRQIi^LAIX3IVIGSVSPQM 

SGIVEVLKSUmVANPIKAXAKAmGIl^ 
KRNPYFLVFTCLRESGVYAFFTRSSAAM 
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SEQID343 

ATGCCGTCTGAAGCCCOTTC 
CGCCCTATGAAGGAGATGATGATGTT 

SEQ ID 344 

HPSEARIAFRRHiraLWAY^^ 

SEQ ID 345 

ATt^GGAGAmTGATGmAAACGCA^ 

AACCGGCCGCCCCCGTTGTTGCTGAAMTGC^^ 

AGGCAGCCAAGATATGGCGGCAGTTTCX3GCAGAAAATACAGGCM 

GMTCCGCAAATCAMCAGGGAACAACCAACCCGC^^ 

TTGTCATTGACG6ACCGTCGCAAM 

TGAAGAAAAAATTAAGCGATATAAAAAAGACGAGCAAC(^ 

CCTACTCCTTCFG^ 

ATATCTTCGGGCCCGAAGGGAATTACCGGTATCT^^ 
HXXBTCTACAACOX^ 

AGCGGCGATGATTTGCATATGGGTACGCAAAAATTCAAAGCC^ 
C0?GCGAG(^GTGGC(KX3AAAATACAGCT 
SEQ ID 346 

ME4MMFKRSVIAMACIFPLSACGG^^ 
ESANQTGNNQPAGSSDSAPASNPAPANGGSDrc 
PTRSARSBRSLPAEIPLIFVHQADfriilVDG 
SGDKiHliGTQKFKAAI^ 
SEQID347 
TTGATAAATCOTATAAATTT^^ 

SEQ ID 346 

LINPYRFPGKIDREHEKADNPPIFKHPSDGICSNAV 

SEQ ID 349 

TTGATGTOATGCCGCCCGMGC<K^GGTATra 
TC03CG<X»TTCC<^^ 

SEQ ID 350 |j 

ItfLMPmQVFmGRRVQIIKQAPEK^ 

SEQ ID 351 | 

TTGCACCGCCXXX^TTCGGAAAGGCX&TGCC^^ 
TTGAAACACAGCCGTTAAAAAAACAGGCCGTC 
SEQ ID 352 

I^DSERRCLSmPEPCRPEERGSHHYLPAAACMRSVETQPLKKQAV 

SEQ ID 353 

GTGCAACGGGGTGCGCCTTATGTGCCGAAAGGGTT^^ 

TOATAAACCCGTTGTCTTTCAm 
CTTCGAGAAAAGAAAACCAAAGTTGCACAATCA<^ 

SEQ ID 354 

VQRGAPYVPKGFAAWCRLKRVRSIXjIW 

SEQ ID 355 KaCftCTOftClA 
AGTATTTTTCTGAAAAGAAAAT^^ 

SEQ ID 356 

VUtLIVQLWSFLEAELFFKITLT^ 

SEQ ID 357 

ATCAAAGAATACAAAGTCC^TTTATa 
GGCGGGTCXjTAACGATGGAAAAAGATTTGCGC^ 

SEQ ID 358 

MKEYKVVIYQBSIiLSSLFFGAAKVNPVICFSAFIiN^ 

SEQ ID 359 

TTGTTTGTOAGGMCGC(K?rGAAATO 
C«mt3AM(XCCGCCmTGGGGC(^ 

SEQ ID 360 

LFVEERABIDGVDI£RA1&QAG^ 

SEQ ID 361 

ATGCGGCGAAATTTATOGOTATTGCCCAACT^ 

TCXX&GGCCX^QGCCGTTGAT^ 
GATTTGGCTTCGATGATGCGGT^^ 
CAACCGGCACGCCTTCTTCAAAGCGCACGC^^ 
(^GTOTTGGCTTCGTGO^^ 
CCGAGTTCGTCGATAMTTGCTGGT^ 
CCCAGATATTCACAMTC^^ 
' TTGGATGGCGGCGATGCCTTCXnKSTG^ 
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CG&GGTTOOGTGTAGGCATAA^ 
G 

SEQ ID 362 

SEAQAVDAVLF1X3VFVV^ 

— IGDAAGGHVffiAA 



LDGGDAFVCQLCA^^ 

SEQ ID 363 

ATC^CCAAMCCATACTATTTTACAAAACCTCC^ 

CmTGCCTACACTGCCAAOT^ 

^CACGAAGGCATCGCCGCCAIK^^ 

AAAGAAGACGATGTGAATATCTGGGGCGA(X5GCAGC^ 

ACCAGCAATTTATCGACGAACTCGGC(X^^ 

(^CTTTGAAGMGGCGTGCCGGTTGCA'n^ 
AAAACCGCATCA!!KX3AAQCCAAATCGC^ 
CXXXATCAACamtXTC^^ 
ACOT^6MCT(X^GCGCXjGCAACGAOT 
ACCGCATCGGACAGCPCACGATGCGCAAC^^ 
CAATAAGCAA 
SEQ ID 364 

IffEEGVFVALlX>KEYADFVEL^^ 
TLEliRRGNDYSILNTESPl^TyQPERLSMEKVEGAAra 

SEQ ID 365 



ACGMTOGCCAAACTC(&COTOT 
CGAC£T<^<XX£CT<a^^ 
Aj(XACCAACTACTC<^ 
TCGAAGTCTTCAITCC^^ 

CAAAGTCCACAACGCCACGGAAAGCGAGGCAGGCGATGC(^^ 
CATCOTACGCmCAAAGTTTTCCTC 

CGCAGGCCGTCCGTTTGATTACAGCCTGTTGAGC^^ 
CATTCCACCGCCAAACAAAGCCAATACGTO^ 
CGAGCGGCAACCAAACCATCAAATOGAAC^ 
CATCGCACGCGTGTTGAACC3CC 

SEQ ID 366 

MKAPELLLPAGGLERMRAAYDYGADAVYAGSPRYSLRARNNEFAK^^ 

TTNYWGVKFWQNIGVERIILSREI/3MEEIAEIRQECPDIELOT 

HPYADKVMEESNRPGEMMPI^ 

HSTAKQSQYVGHVTEIDm^ATIBVK!mPAVGDSIiH 

SEQ ID 367 

TTOTCGGACATATCGGOTCGCGGGCTGC^ 
ACGATGCGGATTATTCXXn'ACTCCGGCGGCT^ 

SEQ ID 368 

I^ISASRMYlTOXQYPFmAVCWNH^ 

SEQ ID 369 

ATG(HOTGOTCATGAAAAAATCOT 

TT 

SEQ ID 370 

mVHffilLDSLKNQSHYLINOTCQPSCRI^SAV^^ 

SEQ ID 371 

ATGAACGOGTTCAACCTCATCATCATO 

MTATCTGCCCGACGAACCCGATT^ 
OXIAGCGGCTGCTTTGGATOGCCCGTC^^ 
ATGGCGGATTTCCCCGAGGGCGCAGAGCTTGTGCCCAAC^^ 
AATGGGTTTTGGAAACTTACTATGCCGGCCX^^ 

TTGGACG0GGC(3TGG(KIGGAAGTGTTGCACAG 

SEQ ID 372 

MNAFNIiIIIGDKII]HGSRQDKHFAFFKSIiI£SKGIiKLNQVQ^ 

MADFPBGABbVPl^PimAGFSIREHYFFPGFP^^ 
LDAMAEVLHSU3GIiGAK1jKNRVN 

SEQ ID 373 

ASKmTAOXiAAiraGAAGAGW 
AGW6ATCTGGAAAAACAICTGGCGCG0CGT 
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ACTCGTCAACCTCTACG^^ 

AGCCCCGGCGGTTOCCCGTCGTOTC^ 

ACTACATCGCGGCGGC(XXXjG^^ 

CAGGGTTAAMTOCGOX^GCMCAMCXO 

GTCAAAACCMGADGCGGCGGCOSCCTCAAAT^^ 

ACAGCGTGGCGtXaAfiACGTGGTCAAAGCTrcC^ 

TCTGCA6GC0GTCCGG 

SEQ ID 374 

MQYRIRRENEAPEAKNAGBTLWERDI^ 

SPGGSPVVSNTAFEEIRRLKAQHPGIPVY^^ 

VKTGRC&RLKFRQYPIWSGKVY^^ 

SEQ ID 375 

ATGCACAAAACCTCCCG<XKjACAAC^^ 
.CrGGTATTCCCTTAGAAATACTO " 

SEQ ID 376 

MHKTSRGQPAFCGFKCTVHSl^ 

SEQ ID 377 

ATGTTTGA!TGGCGAAGTAGAA<^ 
•nTCCTOCCGTCCGTTATGT 
6GCTTCCTG<XXaramX^^ 
AAC 

SEQ ID 378 

MFDGEVEADBSYFGGQRRGKCGRGAAGKVAVFIUiIiKRN^ 
K 

SEQID379 

GTGAAACtfK&CAGCAT^^ 

mATG(^TTGOGAACTTTTGGM 

GATAAAAGTTOTGTTCCATTT 

SEQ ID 380 1 

VKPDSIVYTIX^RRYmDM | 

SEQ ID 381 

ATGATOTTCAAAATGAATTTAATITATATCCT 
ATMTTTIOCTGGTGGTTTGAGAATTATGGMttGM 
AGOTATTTTGATGGTAATGATGTAACAGGAMTCCT^ 
GATACTTGG 

SEQ ID 382 

MIIQNEFNLYPSNMbPEGFCYPEKyWIS^^ 
DTt? 

SEQ ID 383 

ATGTCCACOKCGCCGCTGT^ 

GTMTGGTATTGTTGATGATGCCTAGAGTTTGAOC^^ 

GGTGCAAC(XTrCGCGCCGTTCCCAC^ 

GT6AAGGCAAAGGTCA6GAAGCTGGCGGTTACGAGCAQ6C6CA 

CTGCCCAC6<X^GGTATAGCCCM6TTGGTC^ 

TTTAAACA6CGATAAAKIGACAATC6GATATTTTTOT 

AGTTCCTPACCOTGTCCAGCATCATCTC^AAC 

AT6TMTO^TGCO^TATGATCCC6AT6GGGATAW 

(OTGCCCACAATGCCAGT^^ 

AGCGACTGAAGGTTGGGCGCGATGCCGCACAGCCACGATGC^ 

AAACGGCGTTTGCCACGGAAAMGAA6TGATGACCXA 

AAATACGGCMGCCCCmGACAG^ 

SEQ ID 384 

MSTAAAVH^RQPAN(MQIH)KQNAAG 
RQCDGGQ^EEHGNAypLPKLRPIMADIHVGIKISAP 
FKQR'IIMIFFSQFPNMIRQAHQRGYAQGW 
H(nWC(^PCGRIGCH*QAL^ 
KYGKPQRQRNPR^TLKRWI IHCVLRFRRH 

SEQ ID 385 

TTGGCAG(£TGCCAGCAT^ 
AAAAO^TTCmCGGC^^ 
SEQ ID 386 

SEQ ID 387 

ATGAAGATAACACACTGCAAATTi^GAAAGAACTACAGAAAG^ 
OCTCTTTTACC^ 

ACGCGGTC6CG(XMCGGCAfi6AAAAGCGGTTG^ 
AAAATCATGOCGGACGGTATTGTTTATGCCGATAGOCC^^ 
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ACATTAACGGCATTGG(MCTTTTG(^TCA 

cacACCGTCcam^ 

SEQ ID 388 

MKITOOOiKKEVQKEPLRSmEWARSAADIIXJI 
KIMPIXSIVYADSPGSRGKLDAGGFTRCRINRSKKPAD^^ 

SEQ ID 389 

ATGAATmttCAACCTGTTMTTTTW 

AAAATCAAATTATTGMTATCAAMTCGTGGATTATGGGGG^ 

TCTAAAAGAAAAAITACAAAATTCAGATTTO 

SEQ ID 390 

MNEHNLLIFCIiKnNVSIS 

SEQ ID 391 

GTGGGCATCMTGCCMTCCTAACTGTGCTGATGAAG^ 
CGGTTMTCCTTTTTTAACGG^^ 
AAMTTAAGTCCGGAAGCACAACTTGOTGTCG(^ 
ACAGCCCAAACTGCCCTTGCCGTA^ 
(XCGACCTATGCAGACTGTTGACGGGGAAATGGCAGGTC 
TTCAGCAAGTTTTGATTCAGTGCACAAAAC^^ 
AACGAAAACAACTATTTTAGAATCCATGA 
AAACTCATATCAGGAACTTAGACAAA 
SEQ ID 392 

VGIKANPNCADEAGKLIWENDPDKNWWANI^ 

TAQTALAVAEAAGTVWRGKKVELNPTKWDWVK^^ 

NiaMFRIHDNSRKQYIiDSl^^ 

SEQ ID 393 

TTGAC^GCAAAAATAAMTTATTATGMTATAAT 
GTGTTTCCATAGATGCAGMGMTTTGAGATO 

SEQ ID 394 

LTSKNKIIMNIINYTGDDIIISLTREELQLIiRSLVIKIYAGVC 

SEQ ID 395 i 

ATGATTAAATTGGATTTAAAAAGCATAMTTTATAT^TAT^ 

AAGGCTCAra'AmTTTCACTTTC 

TAAAAGT(^TTGATAAMTATT^ 

SEQ ID 396 

MIKLDLKSINLYDIDFEKFTPEI^ 

SEQ ID 397 

GTGCAGATGGCACAAGGCTATATCX3TCCTCCAOT 

SEQ ID 398 

VQMAQGYIVlJQLQljQTPIiNHLTQ^^ 

SEQ ID 399 

TTGTOGGATACAAATAAATATCC(&CCTA(^^ 

GTGCCTTTGOSGTAAAAGACGGCAT^ 

CGGTAAAAAAGTAGAACTTAACCCGGCCAMTGGGATTGG(^A 

CCTAAATCTATAACGTCXsAAAGGAAMGCTAATGCTGCAAC^ 

TAAAAGATTGGAAMCAATACAACCAAATAAAAAGGGGGAAATTAATTT^^ 

AGTTAGTAGTCCATCTTffimGGATm 

TTTGTTACTAGAAGTGTTGATGGAAAAACCTTAAOTAATG^ 

SEQ ID 400 

LSDTNKYPmSAQQTUJGIHDIiGNLSPA^ 

EKSITSKGKMIAAWPK^^ 

FVTRSVDGKTLTNGHLKIK 

SEQ ID 401 

TTGTATCCX3ACAAAAATATTTGACGCGCOT 

SEQ ID 402 

LYPTKIFDAPIVSSIPIiLENVGF^lTK 

SEQ ID 403 

ATGAATATACTATOATAAATAATCAAAACT^ 
TTGAAMTGTATCTGGTGTTAGAAAACATGAAGTAGATAATO 

SEQ ID 404 

MNILSINNQHSTISM$DE^^ 

SEQ ID 405 

GTGGGCATCMTGCCAATCCTAACTGTGCGGATGAAG^ 

CGGTTMTCCTTTTTTAACGGGTTTT^ 

AAAATIMGTCCGGAAGCACMCWGCTGCCCK^ 

ACT«mAACTGCCCTTGCC(^^ 

TTCCXACCATGCATACTTTG^ 

AGAACAAA!ITAGTAGTGGGCATGCTTATAACAAGCATGTCA 
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OCAACAAATATGAMGMHTAOt^^ 
GTAAAAAATATTATGATGATFEA 

SEQID406 

VGINANPNCADEAGKLIHEDD^ 
. TAQTAUVAmGTVWGGKKVEL^ 
PTNMKELPRGRTAYWD^^ 

SEQ ID 407 

GTGTOTCTTCATATTOX^(^ 
CAGACGGCATTTTGATCATCAA6 

SEQ ID 408 

VLSSYFEOTSAKIVQTOIKL^^ 

SEQ ID 409 

GGG^CGGATTOCCCGAGGATTTCATCGAlTO 

TCCACTTTAACCGTCATO 

(XXKCGGMTCAGCGAAAACGCGCTG^^ 

ATGCCGCAACTGCGTCTCTIT]^ 

TGACmmXXXBCOXX^^ 

OXX3TQCCGAA<&TTC^^ 

(XXXXOTAAATCATTC^^ 

TTTGC(XJGCTCGTAAGCCAGCACATCATO 

TAAOTGMCCACGCGTCCGGCAACATMGCGT^^ 

CAGGCGGCGTTTCCGTTTGACTTGCGTG^ 

SEQ ID 410 

GRGtfEDFIDRPIDFRPVOTF^ 
IJPQl&LFIttGIHPKIRQBHHRHQ 
M*IILRARQIRTAQGI^FRIiRS^^ 
QAAPAPDIiRVLKFXTVCVRFHKIGFRIS*^ 

SEQ ID 411 ' 

<n^CATGATTCnXGGCCGGTCTCCC^^ 

GATGTCCTTGTrTG^^ jj 
SEQ ID 412 

LHDSPAGYPM&VCAFLSARPD^ jj 

SEQ ID 413 

ATGAMTTMTTGATTTTGAAGGGAAmAGTTAAAAT^ 

TTGAGATAATTCATGC^rTTGAAAA 

SEQ ID 414 

MKLIDFEGNIjVKISU)KDBLniQAIVGEnSGVCVlXSDF 

SEQ ID 415 

ATGAGATTAAAATTAAAAAATTATGATTGTTCTGACTTT^ 

GTGCTC&TTATTmATGTT^^ 
ATTGATAAAl^TATATW&AAAT^^ 

SEQ ID 416 

MRLKLKNYTCSDFDI^FPQDKLH^ 

SEQ ID 417 

ATtmCTCGACAATATCGGCaCCAACAGAAAA^^ 
TGGACATT 

SEQ ID 418 

MALDNI^QQKKNHNIiQIiQVPKQIIiKLIIiM 

SEQ ID 419 

TTGCTC^yiATCXXX^TTAff^ 
TCAAAAATCGGGTTTGCGTO 
ACACffl&CACGAAGTACACAOTCC^^ 
CAT(XXXjCAGACGGATATGACGGG(X^CAAGG 

GOTIAACCCTATGGATGATATTCC^^ 

ACAGQCGCTCACXAGACOT 

CCGCCAGACAATGG6CT6ATGCCCATCC(Mra 

ATGGGATTGGGTTAAAAATACCXjGCTATAA 

MTGCTGCAACCTAKX^AAGTTCGTTAAT^^ 

GAACTGCMOTATGAAGAGGCAGATAGACTAGCTAAMT^^ 

AGAAAAAAMTCACAATTTGCAACTACAGGTATT^^ 

SEQ ID 420 

LLQIPISHMWLDARLRIH^ 

HPAIX^WPQGGGYPEPQGARDIYSYHlKGTSTKTKIinVPQAP 

TAAQQTLQGINDLGNLSPEAQL^^ 

mnPKLV^LNEQNlMIAAQDPRLSLAI^ 

SEQ ID 421 

TfGATCraCACTGCCATAACC^^ 
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SEQID422 

LIKTMTACCQGKQNBF^SKIHDAVCSGNrVAANPfi 

SEQ ID 423 

TGGCCGOTCTGCCAGA(£TGCC^^ 

AraOTCTTTATCTACAOT 
TGACGACCAACCCGC^CGACA(n , GCCTO 
CGTOTCGGAACGATAC(mCA(^ 
CCCAAAACCMTGCGm(^GCTGOT 
AACCATACGGCAATCATACGGGTMCTCCGCCCCATC^ 
SEQ ID 424 

MRARLLI PILFSVFILSACOTLTGI PSHGGGW 
TT&SO^PAI^TQSK^SRSSL^ 
PKTNAFmYKENYALWl^PYiCVSKGIKPTEG 
SEQ ID 425 
TTGAT6ATTGACXATA66G1XXjGCAGATATAGTCGCA 

SEQ ID 426 

IMIHRVGRYSRIKASDSHbLyGDYSGLNLQF 

SEQ ID 427 

TTGGAATCAGGGAITCCXGTCAGCTTCGGCG^ 

COmTCAATGCCGCCCGCAAA 

GGCAGGTTCGTTOACGTOCGCGCGGTQ 

SEQ ID 428 

LESGIPVSFGGGVPVANGRNLVTRQAAIWIGXS^ 

SEQ ID 429 

ATGGTCAATCATC^TCAGCAG^ 
AGGGCAAAGGC^WXKXTO^TGAC^^ 
CGTTGGTGGCTnEAGCATCACGTCGCAGTT^ 
CAMTATTGAATGOTTO 
ATCACGGTCGCCATCATGCCCATGTAGTCGG 
TCACGATTTCG<X^TTTCGCCGA(^ 
SEQ ID 430 
MVNHQSAVDQCAFVFAGNDAFERALrc^ 
QILNGFRTGVSFSDLIiH^ 

SEQ ID 431 

ATGACACAGCAAATCAAATACAAACGCGTATTACT^^ 
AAATCGTGAAAATGGGTGTGC^GTCG 
&ACCGTGATGMCGCCTTGGCACTC 
CMTATTTGGAAGAA(MCAAAGTCGTGATT™ 
C<aCmCGTCGAC(mTGTAC^^ 
" CTTTGCCCTCTGCCGCGMCGC^ 

SEQ ID 432 

MTQQIKYKRVLLKLSGESIJ&SDPFGI 
QYLSEQCVVIPAAGKNPFFTTOAAAIfi 

SEQ ID 433 

TTGCTCTGTCATGGQATAOCTTGOT 
ACGGCCTTTATATTCGGTCGTCT 
SEQ ID 434 

LLCHGIPCFLKCRRSELGRI^KWS^ 

SEQ ID 435 

GTGGAAAATCCTTTCCGTATmmAAATOT 
OCTTGGTTGAAGCTGMGGCAACTTCGACAM^ 
CAACGGTMTGTCGGCGCAT^ 
TCTGTTGMGAACTGAGCGAACTGGTTGAA5^ 
• ACATCCACGGCGCATTGGCGACCGAAGGCGTTTTGG^^ 
AGTAGATGOTGAAACCGTOAAAAAGAACGCCAC^ 
GAAATCACTCTGAACGGCCAAGCAS^^ 
AJ^OXGCTGTCGACTACGCAGCCGAACT 

SEQ ID 436 

VENPFRI SQKSRRFKMAEITAKMVADLIl^ 
SVEELSELVESERKMIARLGENMSVRRFQVIOT 
EITIilGQAFVMNPDQfPVAQPAK^ 
SEQ ID 437 

ATGTCTCAMTTACTATGCGTC^ 
TAAAOTGGAAAAAACC^ 
CGAAGAAGCGACCttXXHXXX^ATOCCT^^ 
GAAMTGCTGCCGAAAGCGGmCAGCAA^ 
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SEQID438 

MSQimCMlEAG^FGHQTR^ 
ENAAESGFSKKHLE^RDVEKI^ 
SEQ ID 439 
TTGGGCAGCGOTACG(nTIOT 
TTCACGCCGTCTGGGCTGTTGT^^ 
GGCCTTTCATGTTTTTGA^ 
TKnTCCAGGCGTTTGATGGATTGCTTM 
GCCTGGCGTTTGGTACCTACGAACAATACCCT 
TGCGCGC^CCGAAAATCH'ATTGAGCCATTTT^^ 
GGGTTAAAGAGCACACATTCAATCGCA 

SEQ ID 440 

LGSGYGFI£RLWAFQ!CT©ASAVQAIX3 

FFQAFDGLLNGFVVGQHTAQPAV^^ 

GLKSTHSIA 

SEQ ID 441 

ATGTGGCACATCSTCGCCATCGOT 
C«nOTCGCCGTMCCGTCCGCC(mm 

SEQ ID 442 

MWHIVAIGYLFVAWSAAQPSIARALIYL^^ 

SEQ ID 443 

TTGTTAACATTCAAACAGtSACGCACACAATGTC 

TTTTGGGTGGTTCTG^^ 

ACAGOjGCACAAAACCCTGMTCC^^ 

SEQ ID 444 

LLTFTCQDAHNVAHRRHRLSF^ - 

SEQ ID 445 

TTGTGTGCGTCCTGT'rreAATGTO 
GGCGGCGTTCGCAMT^^ 
ATTGCGTTTGGGCGGCTTTGTCCGCGCGGCGC^ 

SEQ ID 446 

LCASCUMIKmiATFYRNPERPS 

RERKRGRAKLHVPQFAGRKIR^ 
SEQ ID 447 

AGTCACTATTTTAt&TKAATAT^^ 
SEQ ID 448 

SEQID449 

ATGOCGTCTGAACGGCTAAAAACACAATATCCCCGC^^ . 

CGTGCACGCGGATTTTGOGCCCTGCAA^ 
TTCXSATATAAGGCAGATAGAGTTTTGCGCCGCGTI^^ 
OriTOAGCAGGCGffl^TP^ 
GCCTTTCAGGA'ITGCGGTAGAATGTTGO 
CC60GCAGC06AGTATTG06CGCQCTT 
SEQ ID 450 
HPSERLK^YPRFKYKT^ 
FDIRQIEFCAAFLRRADXMQTQFLAHR^ 
PRSBVLRAL 

SEQID451 

GTCTTTTTAGCanTCAGACGGCAOT . 
GAACCGTT6GATGTGCATCA 
SEQ ID 452 
VFIAVQTAFHAEMPSH£II£0TO 

SEQ ID 453 

ATGATTTTGCAACAGGTGTTTATCCGTT^^ 

SEGID454 

MILQQVPIRFRRADAAYHVNPISKRGICGSK 

SEQ ID 455 

TTGGATGTGCATCATGAA&CGGGTTTGCMCX^^ 
CAACGCGTTTOGCGTTTQCGGTTGTCGATC^ 
AStt&CACGGCGGAAAKX^AAACACOT 
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OmXX&TCGAACGGCTG^^ 
ACAGCXRAAftXTGACC^ 
SEQID456 
LDVHHETGI^VCDGETWKL^ 
RAI ERLGAEKDGVLRMHMLRKTCSVRDT^ 

SEQ ID 457 

TTGTCAGCATATCXXX3TTAAGTTTTO 

CTGTACACAACCGTATCGCGCACAC^ 

T(XmaTCCCGCGCAAOm 

OAACGCGGAATCTGCGGTTO^ 

ACCCTATCOGGATGGQCXX^^ 

SEQ ID 458 

LSAYRVKFPKFEmSHLHMPCQPAGQDFPW 

SEQ ID 459 

(TTCAATCTAGTGGATTTTACTG^ 
AAAAATCAAAATTGTTTATA5AT 

SEQ ID 460 

VNWDFTGKCKVFIiSPAKPG^ 

SEQ ID 461 

GTGCTCGGACACACACaTTG^ 

SEQ ID 462 

VLGHTPLPVSPAFMQAFTOQmRAEDACFLGVHBG 

SEQ ID 463 

TGGCGTTGGAATCGTCAAACGCGTGGMCM 

!KXXX^CAGTTTCATGTCAGCCG^ 

TGCCGTTAG^PGCAAAAACGCTATGATGTGTTTAAM 

ACGAAAACCAACCXBTO^^ 

(XACGTCGAA(mTCAi^^ 

(^TOCmCCCTGCTGCAAAAATCCGGCATra 

MGTCTACX^CAGGCTTCCGTCGGCGOXXXXm 

AAAACCAAAGMGAACGCTTTGCCTCCCTGCTGGAATACT 
GCCXXGTGCTGCAGTTTGGTACGGAGATTGTCGCCCACGCC6A 

GTCTTCGATATT 

SEQ ID 464 

MAEATDVVLVGGGIMSATIiGVLLKEI^PSWEITIJER^ 

(RYI^KRYDVFKTQKLFQWEFSM 
GALTLLQKSGIPEGKGYGGLPVSGLFFRNSNPETAEQHNAK^ 
KTKERRFASLItfYYPEANP^^ 
VLDI 
SEQ ID 465 
AT(XmACACCGCCGCCC<^ 

SEQ ID 466 

MANTAARTMPSEENYTNTAAYITIAAPWFHILPH 

SEQ ID 467 

A!TCACCAAATraTTTTC^ 

ATCOTATATCMCGTCaTCCCGGCACX^ 

CACGATGACCCGCCGCAA^^ 

ATCAAAC6AD6CATCCAC6MGG0SC6GCG<^^ 

A(^TGCX3ACGCAACAACACCC^^ 

GCAACCa^TCCTGATTT^ 

AICTACGAATGCCCCGAAAT^^ 

AAAACCCGAAACACACCGTCAAAATCGCG^ 

GATTA(X!TTC6TT6ACIAGC6AAAGCATCGAGAAAAA^^ 

GcmmtmrPACGcccffra^ 

AGTTTGACTTGAAATGCGCTGCCCCCGTK^^ 
(£AAGAAGTCGAMTCAAAGCA^ 
GCAG(?PTTCGTCATCGGCGGCGTAT^^ 
(XmGGQWTCOTTCT^ 

SEQ ID 468 

mtkfifvtggvvsslgkgiaaasiam 

ikrrihegaagydvaiveigotgdieslpfleair^ 

IYBCPMfflDQGIMinTEQi^ 

AAVRYARENNVWIiGICLGMQI^ 

AGLVIGGVSAGRERLVETIELPNHPWFTACQFHPEFTSNPRK^ 
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SEGID469 

ATKXXZPCTGAMCAOCt^ 

GGCCGACATTAGAATGTCAAfl 

TGCTGCTGCOmCGAA^ 

TCCGCCGTCATCAACGGCAMCGTGTGCAM 

MTOTAOCGCaAACAGCCTATTAT^ 
SEQ ID 470 

MPSETPDRPPYPACRPCOTPHRPNPPFYKLYPISCLFQOT 

SAVINGKRV(?iPINLDRSDNMOT 
SEQID471 

ATGAAMTATOTAAAATATAGAAGATGTTAAAGTC 

GCCmAGATGCCCAAACCCm^ 

CGAOCTGTATGCCGAGTTGTCCAM 

SEQ ID 472 

MKIFENIEDVKAIRKKTGi^IDFWG^^ 
SEQ ID 473 

ATGCAATATAAAAACAGGCTTGACCCXJGCACATTAC^ 

SEQ ID 474 

l^YKNRLDPAHYENKAKRNLPRNQKKIiKEVL 

SEQID475 

TT(HTTACGACTACCAC(?rcMOT 

CCGAO&CAAGCCGCTCAM 

AraCCmTCGACGTAACCCACGCCTO 

OTOTACAAGAATTC^^ 

AAOTATGATCAACCAAGGCAAACGCCACCT^ 

CTACGAAATCCTCACCGTCA^ 

SEQID476 

LFTTTTSTSKAAIPPPUffl^ 
VVQEFCEHGIGRGFHEAPQVLH^^ 

SEQID477 I 

GATGCAAIATTCAAAAAACAACGCTTCAGGTGG^ 

SEQID478 

DAI FKKQRFRWQN* FTI SFIYIS5ANTAAR^SEENYWPAAOTIAAPWEtLPH 
SEQID479 

GTGTTCrcCCGCCmCTTATCTGG^ 

ACAAGTCGGCTTATGTGTTTGATATGTACGGCTTCT 

TTTGTATAGCTACAGCACCAC^ 
SEQID480 

VFSRLTYLGAKKAKDAQYTVY^ 

SEQID481 

ATGCCGTCTGAACCGCCTTCAGACGGCATT^ 
TCATCTGTATCACCGC(^TCGGC^ 
CTGGCTCO!AAAOGGC60CG1^ 
GCCGCACCCAAGAAAAACCGGGTC^ 
CAAACGCCCKXXATCCC6AACCCCGAAAAGAAACA 
CAOXXXXmAACCCCATAAAGA^ 
SEQID482 

MPSEPPSDGIAI^KSTIKHAKKPNKP^ 

SEQID483 

GCCGAACCAOXKXXKXmACX^TGACM 

OXXSACMCCGCCACCT^^ 

GCCTCCCTGCCGCGCACTGCGGGCGACA^ 

GCAATGTAACCTACCOTATGS^^ 

C 

SEQ ID 484 

MKPKPlfPVRTIiIAAVLSHAIXX^ 
ASLPRTAGDIAGI7IWOTSKVRAT 

SEQ ID 485 

TTOCCCAGCTC(3CGCATOTWt^ 
CTTTATGGGGTTTQGCCGG^^ 
GGGTTCGGGATGGGGGGCGTTTGTGGGCAGTACGGGGGOT 
ACCCGGTTTTTCTTGK5GTGCGGCAG 
CG&OGGCGCCG^^ 
. GCCGATGGCGGTGATAC^TGAGCAATAOGGCGCCT 
TCTGAAGGOSGTTCAGACG^ 



WO 02/079243 



-52- 



PCT/IB02/02069 



GGGGTGAGGATGCCCATAACGTAGGTTACATTGCCOT 

TOTCCGAGCAGCAGCAGGTGGCGGTTGTAGCC^^ 

ACOT^TCCXJiTTGCGC^ 

flCAQGGIGCGGflO^^ 



SEGGSIXSIAESALim^ 
LSEQQQVAVVAIBGDLGBVAFam 

TCGOCGCCGTCG^^ 

CGGCATTOCACCTCCGGCAATTCCGCCMCCn^ 

AMGACACCGACGTTCTCCTCAACGTCCC^ 

KimJVLLNVPmmiQENHILLIH^ 

^ATCCGOT 

TTGTCAAAAACGGCGJX&TGA^ 

6TATTATCTGCAACASAACAGGCTGACAAACGTACC3G^^ 
ceo ID 490 

S |SreGS(maTMTCGGmftacCTm 

TCGTTGCCAOKmTC^^ 

CGGO^immmWSTCAGTGTQCamCACM 

(KXOTGTKGACCCGGGCamACTCW^^ 

CCGATTmATTTC&ATOGOTTTGTAC^ 

AACGCTTGCCGATATGOmAWGT^^ 

GCGGACGGCMCCMTCGCGCGGCGAGATGGTCTTGGTGCTTTATC^^ 

CGACCAAfiCAGQCQGCGGAGCTTlXXGCCAAGATTACAGGTGAGGQCAAAA 

AVCDPGAKLAREVREAGFTWVPWGASAVMAALSVAGVAESDFYFNGFV^ 



gen in 403 

A^^<m(^<^^ 
ACTATTTCXSTCAAATTCGTCTGCCXICGAATO 
CCT(mMCTCTACCTCTI^^ 
<ITOOCCGCGCGGCG^ 
SEQ ID 494 

FTPEGGIAVHPPAMYGKAGTEFEAIiABKRIf HfflAQ 
SEQ ID 495 

ATOTGGTTTCmaCTGTCmTCm 
TGACAAAACACCGCCTGCAGCCMTACTTATAAAGGCTGGAGC^^ 

AATOBICCGAAGTCTATTTCACTTACAACCA^ 

GT4CmG(3ATCCCMCCTOTATaUlAACM 

mTCOTATTCCfSAAAAACCGGMTGGCAGATGC^ 

TOaMITGTraXCTCCCTCCCTTATCCCmA^^ 

GGGTGTGTTCTCCCGCC 

S SroA^FHttMaajyvps(K!ismssiQHPW^ 

HASBVYFTmGStmPNSNU^AERSTTHTLSLQGRSK 
HKIiFGSLGYAKSKLSGDIKIJjSTQPPKVIAGVDYEARAKKGVCSPA 

S |£gaMaS^^ 
CTTTTCAAAATATGGGCAAAAOX^AOTATGCCGAAACCGGC^^ 
(XXXiAACGTATTGCAATATCGffiTTGCCOTT^ 
GOaramTCAKffiTTTlGGAAW 
(XCTGATCAATTK&TCATCCGAAAAATCMGAAAATGGT^ 
AWAAACOJCGAAGATACCGCATTGTTGCAATACACGGGOGGCACGA^ 
AAAAA0CMTTG(XCGAGGGAAAAGAAACCGTTATCGOCGCCC , rcC^^ 
CCAAtXXGCGCGATATGAAAGGCTWATCGGCGAACTGAAAAAGCAG^^ 
TmTCAGGATTOCtmWTITCGGCGGCG^ 
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TAAACATCGAATCATACAGCGGCAGCATi 
CG(XXXMTTGTGGGTAAMGG^ 
CGAAAAAGCKXSTCSm^ 
GCGTCTATCGGCGTTCCGAACGAAAAAACCG^ 

SEQID498 

MEltfWLESYEKGVSAEIDITQYNSVSD^ 

ATAIIVLEHFAKTLELVLPRTQIKHV^^ 

KKQLREGKETVIAALPLT^HIFALTVNIj^ 

PGVCCNPLNIESySGSIGLPVPSTEVKLR^ 

ACIGVPNEKTGEALKVFVVK^ 

SEQ ID 499 

ATGCXXSTCTCAAAACATOTK^^ 

GGGTTTATTCTTTT&TC^ 

GATCGATTTG 

SEQ ID 500 

MPSE^RRHraCSPF^SNVHPmFATWHHKSTA^ 

SEQ ID 501 

ATAAAATTCAACAAAATCAAACGGTTGCGCGM 

SEQ ID 502 

IKFNIQKIUiRBnCVTVHLHIVKKRK^K^Q 
SEQ ID 503 

AT6AACACAAACCAACCTCKXGTTTACGAOC0GTO 

AATGGGCGGGCAGCOTrrO^CT^ 

GGCGGCAGCXjGCAGGACACGGCATTCTGTAI^^ 

CA(K3GTPC6CCGGAAAmTCGACrr^ 

TTCAAGGCAAAGACGTGCTGTACCGCATGACGGGGC^^ 

SEQ ID 504 

MNTl^PAVYDPLTRAIflWI^ 

QGSPEKIEWMAKLGOTFHGNIXMiIjFAAVVGH^ •! . 

SEQ ID 505 : 

TTGCCGCCAAGATTACAGGTGAGGGCAAAAAGGCTTTGTACG^ 
GTCATGCGGTACAGCACGTCTTTGCCTTGMCA 

cairacrcGAmmccGGC 

ATACAGAAT<XXX3TGTCCTGC^ 
TT(?rca(^CCGMCAGGCIXXX^ 

ATAmTGTTTGCym^AAC^ 
AGACGGCATCAAAACATATGGCAATCCGCCT^^ 

SEQ ID 506 

LPPRLQVRAKRIOTWCRGK^ 
IQHAVSCRCIMVALRTI^^ 
IFCLQAEDLLKNSLRWAE^IVSPRnEIPIJPYlM^ 
SEQ ID 507 

ATGAMTTCATCGACGMGCAAAMTCGAAGTCGOT 
AAGGCGGCAGCGTCTGGGCAGMGCCGACGMAACACC^^ 
AGGTGCGGACGACATTGTCCTCAAAATGCCC^^ 
AAAGGOKOTGGGCAACATC^ 
TCGGCTTCTTAGGCATGCCCAACGCCGGCAAATC^ 
CATCGACGAAAACCACACKTTCXSTCATGGC^ 
CAWTTCTCGATCTCGCAC^^ 
TGAACAAACTCGATATGCTCGACGMGAAGAAGCCCGGGCGC^ 
GCGCCTCTTCCAAATCAGCGCGCTGACA 
GCGGCAAATGTTGAAAFEA'P 

SEQ ID 508 

MKPIDEAnEVMGKGGNOTSFRREKFVPRMPD^ 
KGGLGNIHFICSSVH^^ 
HVVDLAPFDETVNPAEEALAIINE^^ 
AANVEIIBQQPKTDTGVFKPE 

SEQ ID 509 

AimCAAAACMTAGATTTATTTTCA(S^^ 

CAAACTPTGGAGAAAAACCTTTTGGCGACATT^^ 

GGGATTTGAAGATACGC(XGGAACATTG^ 

ACTTTCCOTATOTTTTAGAAACCOT 

CmCAACATCXCTm^AClT^ 

ACGTTTGGCAGCCTTAAAAAAAGCTOGCCC^^ 

TTGtfTOTAAAtf^CT^ 

TAT0CGT0CCTGTMTA6AAGCCATCGCGGAAAATA 
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SEQ ID 510 

HOTIDLFSGI(&IRLGFEKYGOT 
TPRIIIiETLKQLGYTVYFKVLOTLD 
LWNGVRRLTGREMLR^ 

SEQ ID 511 

ATGACGCTTGAAGAACAACAAGCCAAAGAAGCATTGGACTC 

AGTTGGATTTTTTAAATTTAGATAlOT 

TCAAAAAAATGCMTACGX£TGAAAAACTAGCOT^ 

GAAGCATTGGCTTATGTQ3GCAATACA^ 

TOATGCCTTGTTTGATGC^^ 

OXXSAAAAMGMCA^ 

TCTTTAGACGAAGAGTTGGCAGA^ 

AAAGCCGMTTCAAAACATC^AACTGAGGATGATTTGAT 

TGMTTTCCTtXXXITTMTGAAGCCAA^ 

SEQ ID 512 

mEEQQAKEALDGIIKKSRVHLYKPIQIAEILYHDR 
EAIiAYVGM)RYSFQI^EPLNl»FWLEPG 
SLDEELAEDIVSSISABRIVIV^^ 
SEQ ID 51 3 

TTGACGGGTMGGOTGTTGTAGATGGCGGTCATATTC^ 

SEQ ID 514 

LTGKGVVDGGHIHQVSLEMSNSNVI SAHGEI 
SEQ ID 515 

GTGCGTATSTACGTCTGCGGCA^ 
ATGTGCGCAACATCACCGA(^TCGACGACAAAATOT 

TACGCCGTGCGCGAGTTTGCCGCTTAC^ 

GGAAA6CGGCGAMGCAG(XXjAACCGGC6TGGGAAAGCCO^^ 

CGGQjGCGGCGCGGATPTGCAGTTCCCGC^ 

agccacgtcaaatactggctgcacaacggct^ 
tccgtttcttcatcctgcgcg^ 

TGAGTTTGATTTGTCTGAAAACGCCMCGGCTAC^ 
AAAACCAATGACGCACA(XTO3^ 
AAGAAATCX5AAGAOTGATTGCXXXjACGCAAACAG(X^ 
C6GCAC6ACTT0GC66CQCG6T 
SEQ ID 516 
VRKYVCGHTVYDyCHLG^ 
YAVREFMYGQISGKSm^ 
SHVKYllfIiRNGFIKVIX3EKHSK^ 
KTNBAHLAGCLKAIiGGIIGLLQRNP^ 

SEQ ID 517 

AAAAACTAAGGAMCCOTATGTCCCGCAACAA^ 

ACAAACATCCCGACAACGACTATTTCGTCAAA^ 

MTGGTGGAAAGCAAATCCCTGAAACTCTACOT 

ATCGAAGTGTT(X3GC6AGTTCACAC^^ 

CACAA 

SEQ ID 518 

KN*GNPMSRNNEELQGISI>I^^ 
IEVFGEFTPRGGIAVHPFAHYGKAGTEFEAIiARKRLFEHDAQ 

SEQ ID 519 

TPGTCGTCTTPGAA^ 

GACGAMTATACAACCGCAGCATGGACACCCGATTC^ 
SEQ ID 520 

LSSLKADFDYQKTKVAAVNNKGSFPKDYS^^ 

SEQ ID 521 

ATGCCMTAGCCGTGCATCCCXjTGCGGATTTO 

CCTTTTTTffTAGGCGGA(XXX 

OX^TPCGGTOCTGCCTGC 

SEQ ID 522 

MPIAVHPVRIFHI(2*QISLIiRPEFL^^ 

SEQ ID 523 

ATGCACGGCTATTGGC&TTGCGCXX3AG 
OTITK?nXX3TTGCGA 

CGAAGCGATGAAAAACGAAGGGCGCGACATCGTC^^ 
GAGCGCGAGTGGATAGGCAMGTCATCCATACGCTGGGCTGG^ 
ATCTCGGGTGGCGCATCGATTAT(^ATGGTTACGC^^ 
CTATGCTGCCGAA 



WO 02/079243 



-55- 



PO7IB02/02069 



SEQ ID 524 

MHGYWHCAEKRGYSGVAVYSKRK^ 

SEQID525 

TTGTGGAGTATGA^ 

(KGTT6GAAAGGCT66CGG0GGGTGGTGAATATTTO 
SEQ ID 526 

SEQ ID 527 

TTGAGCAGGTCCCGGAGGGTAAAGGCGATGAGCAGCGM^^ 
TGGCXXXZAATCAGCAWCAGGGCAGACAAAI^ 

SEQID528 

LSRSRIWKAMSSEAGTLNMAQTTO^ 

SEQID529 

OU&CGGCAGCA0GGAAGCGTCX3CAA 

GCAWGGT(Xat3TCCGAAC^ 
GCTGGCAJ7rcGGGTA.T6AAGCTGATGC^^ 

SEQID530 

MKSFELEDLTLWLIRDADEAEHWIDRWAVSYFVVOMSE 
ALVVSEHGGVPHGWAQKQADLWfJARIiL 

SEQ ID 531 

GTGTKX^CACCACXIAATGCGGCACGCX^ 
TOGTCAfiGMOTCK^^ 

CTOVAOCTCGAMCTTTCCi™ 

AcwcccGccGcomrmccAA 

GCATCOt&GACAACATAAACAGCGCX^ 

ATTCGGCAIK^TAGTCATACTCCACMCTAAAGGGGCGTG^ j 

SEQ ID 532 

WGHHQCGTARTADALBSVFKIVGKmGTRRYEN™ 

LKLKTraCPRLFEQVPEGKGDEQRSCTAQ^ 
IRQHSRTPQI^GRDPRIFHLYKRVRTRLWEQVRA 

SEQ ID 533 

ATGAMGTAGGTTTCGTCGGCTCGCGC^ 

(XXKXXSCCmCCTGATT^ 

CTTCCAAGCOTWmCA^ 

GGTCTCAAAAACGGCGTGAAAAACTAC^^ 

AAGCCGOTCCGGCGCGGGCGCGAAAAATATGCGCGAACT^ 

CAMGTGTCCGATTTCCTCCGCAGCGAAGAOT 

TCGMGGGCGtXETGGAAAC^^ 

TGAAAAAAGAOTGCCTGT^^ 

TTGCGTATCGTGTTCGGCAGCCTG 

SEQ ID 534 

MKVGFV^GHVGSVIIJQHHK 
GIiKNGVKNHGGNCTVSUCJiALGG^ 
WK<K3VEmiLGRSGOTTVI^ 
LRTVLGSL 

SEQ ID 535 

AT(»AACCATTACACATGCTTCCTATT^^ 

!I^GCGTTAMGACCAGOTMTGCGCCTGCM 

GAfiCGATAfiCGQCCGCCATCAAAAAGGOT^^ 

TATGGOUHSTCAACA^ 

ACCAAACCCTGCAAGGCCATCMTTGOT 

0&CC(XX3T<X»TGCC^^ 

(ECGCTATCCCT^^ 

CGATP 

SEQ ID 536 

MKPLHMLPIAALVGSIFGNFVIiAA^ 

YGNFNSSRLSIDP ELVRNI EIAKGAD5FNTC SGALGGGVNYQTLQ^ 
RGIPDPSKHKYHNFLGKIAYQINDKHRIGASFNGQQGHNYTI 

SEQ ID 537 

TWnTCTTAAACAC^TGGAATTC^COGCOT 
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SEQ ID 538 

LPJjNRWNSTAIJJiQTPIPKEKFDmyiFPIRR 

SEQ ID 539 

ATGGAAGAAAAAAACGAmCACC^O}^ 

CCTTTTTGATTTTGTTCGTGCCG 

(mCTGQCGGGOOTPTGGCGTAC^^ 

TOCTTGCCAACmGT^^ 

SEQ ID 540 

MEEKNDYTDAVSDNRNGQEIE^ 
GTOTWiVREIAWSVLVMIAAVIGIiAW 

SEQ ID 541 

GTG<XX£AACGCCMCAGGACMTCTGC^ 
TCAAGTCGCTGTCGTACGAACTG(X^ 
GCACCTCGACAGCCAAGCGATCTTCCy^GT 
AAGGGCXnXSATGACGCTTTTGGAC^C^ 
CAACGGTAACGGTCTGCCACAGTGCGAC^^ 
CGCGOTCGTTATCGATGTGGGCATCAACCGT^^ 
CCX&TGACGATTGCCACATT^^ 
SEQ ID 542 
VAQKQQDNIiHTPCIA^ 
KGVMTLLEAYGIDPKGmWVGAS^^ 
PtfTIATLHEHTLHAASIflDA 

SEQ ID 543 

(rrGTTTTTGCGAAACTTOTT 
AAAATAAATTTGCCCCGAAA 

SEQ ID 544 

VHiRNFFTVDQIX3ffiIRPPYKQV^ 

SEQ ID 545 

ATGCCGAAAATAAATTTGCCCCGAAATAAMT^^ 
TCCCTGAACTGTATATTCXH^ 

SEQ ID 546 ! 

MPKIlJLPRNKIRIiFGMKYFFKSKTO 

SEQ ID 547 

TT^TGGTTGACGCTTGGTCT(3TCCCTGATO^ 

TOCTCCACAGGGGGCACOTACTTATCTGGCGCAGGCG^ 

TAATOTITTGGAGMCCGTGCOGCX^^ 

AAGGGCGGCGATTGGGTCGTGGATAAGATTGTCGGTGC^ 

TCCGTGCGGCAGAGGGCGGAAAA 

SEQ ID 548 

IJlTOAWSVPDFESnCPPEALAAIUjALLPRPLVFTNGCra 
KjGGDWVVDKIVGAAETIiARGGQVPS ipflhqtsttetlakiraaeggk 

SEQ ID 549 

atgacggttttgaagccttcgcattggcgggtgttggc^ 
ggcagca6atgcxx3(kx5catatacgcggg 
ttttcagacgg(^ttgaagca(^gtgcgcgtcc^ 
agggggcggcaggggcggaagtggtcgcaccgtttgggrc 

(^TGCCGGC^CGCTTTGGGGreTTTOG 

AACGGTTGCCGTGGTCGGTATCGGCATCMTTT^^ 

TTGCTOGAAACATTGCWGCGGAACTGGGOGCGGTGTT^^ 

TOTKmX^CGOmACCGTGmm 

GCCCGACAACAGGTCGGTfflCCGTG^ 

AACAGCTCGCCCGAAAAATCGAGTGGCTCXXOT 
CCGOTCAGCC(mCGOTGC^^ 
GAATCQCKBCOCflXXmO^^ 
TGATGATGC^CGGCCXSTTO^ 
TACCGTGC(KXrPGGCGGACAACCTCGl^ 
SEQ ID 550 

RGRQGRKWSHRLGECLMFSFtWD^ 
bLmiLAEIAmEQYAEE^^ 
GSAFYRDLSPLGAEWAEKADGNVRIVGCAVCGESKKAQVK^ 
ESLAVRTANLNRPAGKRYPFPTTO 

SEQ ID 551 

ATGAAGATAACACACTGCAAAIITAAA 

CACTGTTTTACCGTAAUTCCOC^^ 

ACGCGGTCXXX3GCGCGGCAGGAAAAGCGGTTGTCTO 
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AAAATCATGCCXjGACGGTATTGTTTATQCC^ 

CACACCGTCCCGGCAGCTAAA^ 

SEQ ID 552 

mTHOCLKKEVQKEPIiKFVP^ 
KIMPDGmW)SPGSRGKSD^mCRIl®SEEP^ 

SEQ ID 553 _ 

GTOTAAAATCCGCCTnAAAAACCGCCG^^ 

(xxhotcaag(&gcagggttatcaa<^^ 

tattgtcggcatcx&catcg^ 

gccgaaatcaaattcamtgot^ 

(^gaaaaccatcg«:gaacacgtcggg^ 

juocatcgto^acaaggacacga 

ttacxxnatggcggacgcgotto 

GACA3Xrr<nTIGGGCGGC^ 

SEQ ID 554 

VLKSAFm^AMMNTTSOT 



GKTIGEHVGUSFYTIjGQKKGI^IGGAGEPWFVAA 
DICLGGGIIQTTDKPVim 

SEQ ID 555 

T*nX^XXX5AAGGCGGGGAATCGGAACACGCTT 

(XXOTTCAAACATTGAGt^^ 

GCCGGAAAACCX3GGCCG!KXS^^ 

GCGGAGaGTTGCGCTTGAAAmGAAGCGGAACAG^^ 

AOGGOGCGGOGGATGCCTTArroGCOBOGCAGG^^ 

OTGCATTGAGTTTGGGTGTCGGCAACAGCCGGGAAAA^^ 
AGGACGATTCTTTGAGCGTGTCGCXjTATGACGGTTO 

a i 

SEQ ID 556 

LPPKACailOTLiKKYAFFQHGAASVRIC 

AEQLRLKKEAEQKKLREKKQREEKARBTO 



SEQ ID 557 

ATGTATGTCTOGTATATGCCGACTCXAAGTC^^ 
TCAATAAAAAITPAAAAATAAATGCXjGATTTTTTG 

SEQ ID 558 

HYVSYMPTPSVKWK^IQC^^ 

SEQ ID 559 

TTCTTCAMCACGGttGGC^^ 
TTAATTTGTTTAITTTTCATOGCATO 

SEQ ID 560 

LPKHGRLLSKfflEAPVH&CRPRFVRRNBSLPTO 

SEQ ID 561 

TTGAACAACCGTCXX3ATTTGGTGCGGACAGG 

TTTTAMTGTATTCAMCGACAAAACTCCCTOT 

TTTTCCGGCCGGTTTCCCTGTT^^ 

SEQ ID 562 

U^NRPIWCGQGCHSGVPERGFGDAKAVKPLR^ 

SEQ ID 563 

TTCGCATTCKSGCGGAATATOaAAAAAGGC^ 

ftjQGGA2ADCGGCC6GftAAA(X?rTGOCGO^^ 

CGTT 

SEQ ID 564 

WGGILKKAAKMPSEGFRRHRGPKRANRH^ 

SEQ ID 565 

ATGCTCACCCTGTACGGCXSAAACTT^^ 

j^GACGG(X£AMTGGG3CGCGAAGTGn^ 

AW AAAGACGGCTTCAMGTGCTGOTTATTGCACCG^ 

CGGOTATTGTTAAACACCGCQjTTTCC^^ 
ACX^U^CCCAAGCCAGCACGCC^ 
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SEQ ID 566 

KLTLYGETFPSRLIiI^ 

IKDGFKVIiPYCT©IiIACRRLLDAG(^ 

TKAQASTPTVGQPFWHSAEY 

ACACCACTTGGGGCGCGTTTTC^^ 

G(^CAAATCCTTGATATTTTCGT^ 

CCTTTTAC6CAA6CA6CGAT(^riTO 

(MCTGCTGAOG^ 

^ HJAiflM^ 
GQIU)IFVFDKimKAHWIAPAASTVI 

AACGCCTGAAGGCGGATACCACGGCGGTTCAC^ 

OAAGC(K7TOATCATATCTATAAA(WX:CC^ 

A^TTCGACAAAGAATTGCCACACGAAACOGGC^ 



KFDKELPHETGNKAV^^ 

COTCATGAmTCCC^ 
C6AAGTCAAAGGCGGOTGCCTTI\K^ 
ACCCKaAMOTAAAMCGCCAAAGC^ 

SEQ ID 572 

MKKI4LAAVMMAGLAGA 1 

TIAFKIC^Q^QI£VKTAPMSAJOTGHHHGEAHQH 

TATACGCGCCTTTGGGGATGAAAACGGTGTTC^ 

SEQ ID 574 

&SDAMPSESRRRAFGVSIX3IQ 

AAGOKOTATACGGAAACCGTTTT 

SEQ ID 576 

MNIILNGGPAEIJJCTSVADLIA^ 

SEQ ID 577 llfmrt ^*^^» 
ATOIGGCMMCATCAAATI^ 
ATGACGGCAACCGGCCGGTTGCGTATCAAAAAAIAAAfi 

SEQ ID 578 

MWHIIKPIfflSBSGGNPPENLRNHHIjKNSPRlSK^ 

ACGGCAACGMTTGAMCGCGAMTCGCCCGCTGCff^^ 
TCT0G(X^GGAAGACATG6ACACCGCCGACCTT^^ 

G^TCGAmGAACAACGCC^ 
GTCGGATGGA 

SEQ ID 580 

MITPPIJCSLLKFYAVVPTAD^ 
yiAS^FQTTTKQOTAPQGLDKLREyV^ 

S MGA^^ 
OCGCXyiATGTOCOGCCGCGATGCT^ 
CAC(XnCACGATGATGCA(mA^ 
CXHTOXIGCGCCGATGAMTCGCCGAAC^^ 
ACGCTTTCGACGAACTC^^ 
CCAATOXXm«mCCAfiCACCT^^ 
AICG(XXXX3AMGAAMCCACGTCTTCGTO 
CCGOOTCGAA(mCATCC^^ 

AGACAAGAT 
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SEQID582 

RHRADEIAEREPQ1/jGRFSIX5IYIjPTEGQ1M 

IAPKmvmGA^IESBSQAPASVRSGimSALYAVHPAF^ 

RQD 

SEQID583 

CATCC 

SEQID584 

KIIYKHMiSNRLVGRRjGSRSK^ 

SEQ ID 585 

AflXSAACCTTCCTC^ 

CTGTCCGCCCTGTTCGGQ 

SEQ ID 586 

MEPSSXAAKKKGKSGIRKVINAF^ 
IjSALFQ 

SEQ ID 587 

GTGCTGATTTCG&AATO 

SEQ ID 588 

VLISAIDTAAPNQTSPMAAEDEGEEALPDI 

SEQ ID 589 

TTGGGCTGGCAGCGCGGTCTGGOGGCCCTGCTOTGG^^ 

TGTCGTTOGCAAATGCGGTTCAGT^^ 

GTGGGACGGCGAATCCTTTGTOTGGTG 

C£TGOTGC0GCTG^ 

GGCTCTGTTTACCGGAGAAACCGACGTGGCGAA^ 
GCCGGO^AGTGCGAA(^CATTTCCGCGCGTTT^^ 

AGGCTTCATCCTCTACCGCTTCCTO 

ACCCAATCTTTACAAAGGAACCOGTCA 1 

SEQ ID 590 

LGWQRGLAALU^VGGAIiFPAAAnG 

LAVl^SVEVFASSGTNAAPAVSIX2mX^ 

AGASANNISARFAEIPMVGVTLIG^^ 



SEQ ID 591 

ATGCCCTATMTCAAGCCOTGATOT 
TGTGCAraTCX^TGOGCimCGTO^ 
TGTGG(XXyK£GCGCCGOT 
" GTGTCCGCCAAAATCXEGTCT^ 

ACCTTQGTGCGGCAGTTTGTCQATGOOGCXXaTC 

STTTT 

AWPCAACCGCAAGGGGACCAGCGTCTTTTC^^ 
CGACX^GGCAACX3AAACACXKX}GCACCGTATTGA 
CAAAGCTACGCCTCCGTCCGA^ 
GCGGACAGGTGGACATCAKAGCGGGAAATTTGTC^^ 



GCCGGOTGACCGTCGGCGGCAGCX3CMTC 

SEQ ID 592 

HPYNQAPDFLCADMPTYSAVQARLL^^ 

VSAKMPSETKGKPVCSASDPIAGLDSAAKVAUMVEA1AKAAD 

KQALTNLESRPAPAGSfTVVLGNG^ 

QSYASVE^RlflfflPFMEWSS^ 

AGLTVGGSAI 

SEQ ID 593 

ATGCOTCTGAAAACCTrTCAt^CGGCA^ 

TGCOCTOTTPGCCGCACACGCCGAC^^ 

GCAGccTCaxxxnrcacc^ 

GATTTOTCOGGCTCCTAG^^ 

TOGTTTCGTCCTGC^ 

C(^C6ACGGT^CGCCTTTGGCX3GCGA(^^ 

COmAOCOTtmAAA^ 

TAACKXMGTCGTAGCG^^ 

CCAGACGG(mGOTAAACCATATCCTA!ITO^^ 
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GGAATCGAGGCCGGCMTGGGGTCGGATGCAGj 
CGCTTTATCGAATC(&TGC^ 
CCAAATGCCWKnTPCATMGCXj^^ 
CAGCCGCttCTGTACGGC^ 

SEQ ID 594 

MPSENLSIX3IVPDSASDCMADGQPGIDW 
DFFRLVAAVFHKOTGHTfttfl^^ 
RPTVAQNIXJHLAG^TGFEIGQGLFDGGIDK^ 
GIEAGNGVGCRTNJUA^ 



SEQ ID 595 

ATG(XX3TCTGAAAG(OT 

TTTCTGATGAACTGGCCTTATTT^ 

CCGTGGTAACGGCATACCGCA1XXGCCCCTTCT 

G6GCATCAAATGGGACGGTTTCACCTG(^^ 

GGCAAGGCAATCGGTTTGAGTOGOT^^ 

TCGTCAGCACGCTCGGCATO^^ 

SEQ ID 596 

MPSERPSIX5IALPFPRm)IPKHSLTLSYKPmPPFVm 



SEQ ID 597 

ATGMCTGGCCTTATTTMTCGATGCXCTACCC^AA 
TAA0GGCATACC3GCATCCXaCCCOTCT 
CAMTG6GAC6GTTTCACCTGCG6CGTM 
GCAATCGGTTTGAGCCGCTTICAM?PGT^ 
GCACGGTCGGCATTGC^GMTTGTTATITOT 
TTTGTTGGCGCQGOGGATTGAAAAOOGOG^^ 

SEQ ID 598 

MNWPyLlDAVPKFADAAKLTLELSVYGVVL^ 
AIGI^ItfQVFRYVEIiPQWAVAVPMGArc 

SEQ ID 599 

TTGGGCGMGGCCTGCTCTTAACGGCACAAATTTCOT 

GACGGTTTTATCTCGAAWXATCCGMTCGTO 

ATTTTCCCTGTGGGGCGTTGCCGAAATGGQ^ 

TGCATCGAGCTGCCGCAMC&ATO 

GGCAGCAAAITATTGAAMCTCGT^^ 

TGAACAAAAATGGGAACAC 

SEQ ID 600 

I/miiLTAQISMSVAASCVL^ 
CIELPQSmVLPGAVHIiFTRra 

SEQ ID 601 

TTGCAAACACTGCCGGAAC^ 
CTAGATT0C0GCTTTC6CQGGAA 

SEQ ID 602 

I^FERIHraSHPIiHHSRESG^^ 

SEQ ID 603 

TTGCATCCTGATTGGGTGTATCGCTTTO 

ACGAAGOXTGCOKXm 

GGGCA(mCCCTTmCCGGT^^ 

CTG(XXX&GATCA!KXXX3CT(&^ 

CGCKATACGGAAACAGG^^ 

CTCGTCTGTGGCAAAGGTTCGCTTCGGGCCGGC^^ 

CTGACGGGGAATMTGCGGTTCAGACGGCATCGC^ 

CATCOTAGCGGCATOGCTOTAreOC^^ 
GCGGCACGOTCGCTTTGCCGGCXSGGCGCAA 
TOCACflOTATGCCGCGCGCGTCA!^^ 
TOAGACGGCATTGCTTTCM 

SEQ ID 604 

IflPDWVYRFFSFIMPPFIFATFFMKPFAHttP^ 
LPBWRl^DyGFLABWm 
LTGM&VQTASEGKTC^ 
AARFALPAGAKKLSHLYRITEAVGRW^^ 

SEQ ID 605 

TTGTG(X5QCA6AACX^TGCATOTTG6 
GCGATMTCMOGGCAGGCGQACGGCGGTTTC^^ 
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SEQ ID 606 

LCGRTCIFGAFGQLVFF(31SMID(^ 
V 

SEQ ID 607 

TTC^iACAAAMTGGGMCACTGACATGGCOT 

CTGGCAAACCGOTGGCAAAAAGTCGGTATGGTCOT 

GAAGCAGAGGCMCAAGCOGGCMACTGTTGGAACGCGTC^^ 

GCCTGMTCCGGAAGTCATOTGCTGGAC^^ 

AACCCAC6AAATGG6GTTCGCACGCAAW 

OXX^TTTCTGGCAGGTATGGACTAC 

SEQ ID 608 

LNKNGNTDMALLSIRXL^ 
EAEAQAGKLLEHVGIjIjDI^ 
RQPLAGKDY 
SEQ ID 609 
ATGTTTGCTTGGGAraTO^ACA^ 

SEQ ID 610 

MFAKDVSTHRTTHKALPYSiPSEKGSDGLR 

SEQ ID 611 

ATGAMTTGMCGCCAAACTCJ^GCOT 

TTGCC<m^mAGAAAAAGGCGTTATCa3^ 

GGCCAMGAOTGCTCGGCAGCCC^ 

CCC(^CGCGCCGMGCCGTCX^TTT^^ 

TCAACAAAGGCAJXIACCGCCGACGCT^^ 

ATCXXXJUi'ICICGGTCCTGCCGMOT 

ATGGCX5AGAAAGTCAAACCGGAAGCATTGTTGG 
CCGAA 

SEQ ID 612 

MWjNAKLKALLASAAIAVGLTACOT 

PERAEAVDFADFY>KVA1£WSPKNXPIT^^ 

IMWVNGEIAAMKKDGRLmYEKm 

SEQ ID 613 

GT(XXX&TTTTTTTCAGCC 
GTTTT 
SEQ ID 614 
VRIFFSRRKYCKFKGKIlMiIVm 

SEQ ID 615 

ATGACGATTTGGAAGTTTCXX^GAMTTCAAAMCAGGCAAM 

SEQ ID 616 

MIWKFPKIQKQAKPDELDFRTRKNNRINKTJffl. 

SEQ ID 617 

TTGACAAGAAAAT(ECTATCC(£ATT^^ 
GCGC(nTOK5AAATCAAGTTCGTCCGGTra^ 

SEQ ID 618 

LTRKSV5AFHK3CGFFSAAENIi^^ 
SEQ ID 619 

TTGMCAAGCGTGCCGCAAAGCC^ 
TGCTGCTGCMTATGTTTCGGAATATACGCGCGTGCAff 
GCCGGTTGCCTATA3TITO^^ 
CXXXSAAAACGGGCGCGT<ny3GGATTO 
CGGCGOX&AAAATGCG<K^ 
CGAAAACGGC^ATAAACATTTGTOGCMGGCGATTTGCG^ 
TTGGCGGAAGGCGGTTTTTTCTTCTTGGAA^ 
ACAGGGTTACGCTGGGGAAOTATATGAAGCATTTGAAA 
SEQ ID 620 

iMRAAKPEKMFKFHIffiQTG^ 

PENGRVWDUH^SGAVAVTVALERPDAFVRASDIOT 

LAEGGPLIiLfflGFDQGMVRGVLAEN^ 

SEQ ID 621 

ATGAAKXaGTCGTAGTT^ 

TC^TCATCCTOTGATCGGC^ 

TCftTTGCGTGCGTCA^ 

TC&TGTGAAAAACATCAACGTGAT^^ 

GATACGGCGGGCAA0(XX^TGCGGCAAACCC7IXXXX^^ 

TGTTGGGCGCGATGCTCGGTTTCGCCG^ 
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fflttt'WfflDBfflVIKHIlfflBnVPPUjISHIMi^^ 

ilCMTCraX^TTGGATAAAM(XjTTGC(X3MTTGAAC 



OTCCtXXXCGTTTTCAGACGGC^ 
SEQID626 




SEQ ID 627 



SEQ ID 628 

LERRIVEIjBIQT 



GCGGCACATAAATCGCGGCAA' 
CGCmTACTGCTI^^ 



££52255^^ 



- 1 1 TGTTTTGCAGCG6CATAC0CGCCACCGCGCC0 

XTTCOTGTTOICOrKmACCTATAMTCTmCAUTCAAA^ 

\3^I\j\ft'im.\-*»-oi«wi(v-\jv.«v-w«wju^«.«-.™-™ 

AACCCMTAACGGAAMTCCGCCMm^ 



BPITBHPPIIfiTIISiaSWSEA 

AAACACACTACGGGGAAMCGTCTTACTCMTGAGTCTGCGAAACAGACATT^^ 

SEQ ID 632 
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CCATACTGTCGATGTCCGCGCCGTCAAQjA^^ 

ACGGCGGATCGGCTTCAGAOGGCATCTGTO 
GTXmC(mGGCTGGCGGm 

SEQ ID 634 

IiDEIGIBGQQKIjSAAHILVVGCGGIiGAAAIiPYLAAS 

CAGACGGaTTTOX^^ 

SEQ ID 636 

VMVPQQECVWWFGEGHDD^^ 

SEQ ID 637 

TTCAGCTACATTTAC^ 
ACCTOTGCACGM(^^ 
CGATCTGACGCTTGCCTGCGGGCTGTT^^ 
ACGGAMOXTICCGaGGCTCAAAGCG^ 

GACCA^^ 

TOCAGM3QQXTATCCXXyfflGTCQQCOT 
GCCACGCCGATO^^ 

GCCTGACGXAGAAATCGCCCGCACCGAAGMCOT 

CTCGAGCOTAWn^^GC^ 

TGTCCGTGTTCGGCTTCCAC&TGX^^ 

AGAGCAAAAACMGCCGCCTTGTTGCGC^ 

ATCAAAGACGAATTTGGCGAAM 

ACGGCAAACC^CACAGC(XXATCAACATCGTO 

AAGCCGCGGCAACATCCAAGAAATCATQCTCGOT 

AAATACGATGTCCGC^TGCGCTTGTTCCAC^^ 

TTA0C6AGCAAGGC6AAGTCATCAC0G0CAAATACQCX3^^ 

T<^TG0CAAACT6ATGCAGGCATTGTCGGACGTA^ 

ClCAACCTXXmKXMTCCCCm 

•p(X5GCAGCGCGGTG6AAACCTTCSGC6AAGG(^ 

GAAMCCGACATCACCTTGGCGGAAMCT^ 

CTGCTGCAAAOTAAGAACTITr^ 

MGAACCCGACAATCC^CGCCCTGC^ 

SEQ ID 633 

LRYnSnGNBFKIMQLHIIMPKDAALAADAEPLKQSLF 

TETVRRLKAGKAIKKSVQR^^ 



IiEPYASAQKFLDDIdatLQR^ 



KYDVRlKLPH£aiGGSVGR(X^ 

UJLGSRPASRKTIjARIQDLRAIPWVFSVIMQN^ 

U£TEELLRDNBSIi^ 

ATGATCAAAGTCXJPCTTTATOGCC^ 

GGGAATraOTOTAAACGGCGGTO^ 

AAAAGAATTT6ATCCCGTGATTA3XK^ 

GCGATCCGCGACTTTAACGAAAAACTGGCGATTTO^ 

TCGTCAAACCGC^CGGCATGCKCCm 

ACGCTACATTOCCGAAATXX^AC^^ 

GQOGGQOQCGGTOTOQCa^^ 

TGACCGAAGTCAACGTAACCAGCCCKSACCG^ 

SEQ BD 640 

MMKVLPIADPMASFmKmryAMMR^^ 



SEQ ID 641 

AIKEAGGGtra^ 
AGGTAAATGCCGTC 

SEQ ID 642 

MOGERRTPRTCADNGNFTO^ 

CCGTTGCCGGATTGAGAAGCAGCGCAG^^ 
GAWSMCICCTMCJGCAQC^^ 
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TGATGGAACX^AGCGCGCGTTTCGAA 



CAGCAA 



SEQID647 

ATCAACATCGAACMfflOTACATCGAAGTCA' 



SEQID650 

LLTBRLPRWSGASARGRSRPLLLPA 

CNSVSFSSAjSVLTVSRAQETSH I 



LKVYAGAESAHAAQQPKVIiEIiK 
TQDFVLGIDDBPPAIiHIiVGFR 




(^CK3AAACW3GCGGCGCAACAGATGGTCGAG 
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SEQID661 . 

ATCCGATTGOGACACGGTAACC^ 

accgaaacaggaiitcacgggctgcccg^ 

cct(»ccggcotc^^ 
mstkaagox^^ 
cttcc^cxmaaatatcgcccgccxx^^ 
ccggcttaaggatitxkggatttogacgggcaot 

GCCGA^^ 

SEQ ID 662 _ 

PDRRSIACGIIIMVQPPCVQIXSVHTVA^^ 
PA*GFADFDGHFRPRQRTARRIff^^ 



SEQ ID 663 

ATGAQATOACGAMTATCXOTATAACTCCATAAGAGAOT 

TTTCTTTTTATAAAGMTTCAATATAGGTATTGG^ 
TTGTATTTCTATACAAAATTCAAAAGTTAACATACTAAATATA^ 
GATAAAAATTTAATAAGAGAGMTTCAGTATCTACCG^ 
ATTATGAT 
SEQ ID 664 
MRMKmHSIFELEDnJmQ 

DKNURENSVSTOIISBWEIDK^^ I 

SEQ ID 665 

ATGGATTCCATTATCGAATTGCGQyiCC^ 

TGCAGMCGAGATTTGGCGCGMTCACGGMGTO 
CCCCAAGTCGAACTGGATATCGTATCAGGATTCX^ 

AccxacTGrmra^^ 

CGAGATGCTGGATTTACCCAAAAAAATCCTGATTCCG^ 

CTTCXXTATTGGACAGTCATGCCCT^ 

GTTATCTGAACAACTTT^^ 

SEQ ID 666 

MDSIIHiRHLRTLIAIiE 

PQVEU)IVSGFQADPVGLIjI^HRADLAIVSEAERQSSISFHPLPA 
LPYWIVMPyLBKGYVVHRQITADGl^^ 

SEQ ID 667 

TTCTTTGGAAATGACAGGAAAAATATGGTTITO 

GGGAATATATCGGGAAAATCGTTACGTCGOTAACGGCAAGCAGGGCa 

GC^GACCGTTACCGGTTGGAAAAAAGAACGGATTGCC^^ 

GCCGACA(&TTCGGTTTTGCGACAGAA 

AACGTMGCCTGTCGCCATCGGCAATCCCGAACGCGA^ 

(KK?TCAAATTTCC6AAGCCCAATACa 

GTTFrCGGGTTGGAAGTGTGCCATTTTC 

SEQ ID 668 

LFGfflJRKNMVIiRRDFIAWCD^ 

ADRU^ATEKRFGBQNLLNSGSLKQAKTLGAIi^ 

VFGLEVCHFDEHNPA 

SEQ DD 669 

ATCGCACACTTCCMCCOGAAAACC^ 
TATTGGGOTCGGAAAJTPCAOCCGTC^^ 
T(£C(&TGGCGACAGGCTPAC^ 
TGTCGCAAAACCCAATCTGTCGGCGAGTTGG^^ 

SEQ ID 670 

MAHFQPHJUIRRFCQRAYAVTFRRVMPCRNE^ 

SEQ 8D 671 

TTGAGAAATA5CATAAAACTTTACOT 

SEQ DD 672 

LRNIIKLYLILNNALSITANWVl^^ 

SEQ ID 673 

ATGGATAA3<^GAAATCAACMCXK3CCGOCXX^^ 

AAAAAGCCAAGGCTIXXGGTGCTGCCGTC^ 

QOTAftAGACCTGiAAfiGCCTGAACGGC(^ 
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COKCCGCCTATATAMGGTGTTCC^^ 



mAMXTIGCTtai^ 
GCnTTTCCGAAGGGAAAAAAtT 



TCCCOTCTATACCMGCTGGAtA^ 
GTOGCAACCAftCATC 

LVmiOTO^LGAmWimVARILSFmAm 

CCCGA»TTGiaa^lliU^iiiw«u^ mnMMnn!l mM»n«^^ 

TCCGAGCTTA 

GTGGGAAGAACGCTGGTTCGCACCGCTTT 
CGGAWmTTGGCG^ 

S ^^?iCTASARSAYPKCTAESOTAim 
fft/JMsUXUA '« ^.-li -iKw >- ^.«^»»mFrm/*!T.i/!ir^pix3EDMjEKLRVQ^ 

'APLYEAVRTGnKRMIATDGQHGGTIiTFKPrDB 



RKFwcnsmoiH 

(^CGCOGAAi^CTGTCCGGTTTACCGT^^ * 
SEQ ID 682 



H&ESCPVYRKTKYHPKWLDAV 
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M(XXX&CGAOTTGCajAA^ 
CMTGTGGfl'TGAACTCAAATGCAG^ 
WTCTCTACC^OCGCCTOT 
CCGAACCKXXSCGAAAGATTTG^^ 
GSTGAAAGATTCTTG(&AGOCTAAG 
SEG ID 684 

MPSEMCLKKTTOKDQFAIMIFIRTIIEIKLKSGKHE^ 

YAVGLIKKKAYVDDLTPEBMRBYRGTimOT 

MFEAHRPLYDWVLDinPAI^^ 

LKVTLTNFQAGKTQSRRAAFHPNH^ 

RLYDRIiPrVERPGATOGEDGEYLPmFLHPESVKEITO 

SEQ ID 685 

TTGCTGC^AACCCCCAAGCTGCGCGT^ 

GCGGCGMTTGATGGAAATGGTGTACAMGTCGMGCCGCT 

(XXXXXjCTTTGAAAACGATGCGAC^^ 

SEQ ID 686 

LLQTPKLRVTAVHNDSGSPAFCRVM 

SEQ ID 687 

ATGACTGATAATCGGGGGOTACACTGGTTGAAl^^ 
CAGTGCGGGCAGCC^TTAGAAAATGCAI^Tm 
TTGTATCXX3TTTGAATGGAATCGCGC^^ 
TTTATTTGCMGMGTCCGCCAGTTCGTGTAGTG 

SEQ ID 688 

MHMGFTLVEIJSmiliSVIJ^^ 
PICKK5ASSCSIX3LDYFKGNDKDCKLLK 

SEQ ID 689 

ATGAGGGCAGTCGAMTGAMCAAGCTATGAAGMTTGGTTT^ 

ATTTGCG(X5ACCAAGTMGOTG^ 

AGAMTCAAGAAAAACCTGATGTTTACCACCGATMTGTCGG 

GOGCGTTCCAAAGGTGCAGACTACCTTTACGCOT 

GGGAACAGCAAGGCGTTCAAGCCGTTGAGTTGGATGCCAM^ 

CCGTCTGCCTAACGGCAMGTGATCCAGAMGAGTACGAW 

GTGAIGATmCCTATTT^^ 1 1 

SEQ ID 690 

MRAVEMRQAMKNWAAI^^ 

ARSKGADYLYAYMRGFYKDPTRPS^^ 

VMIH1PAVMI1PIAYFI1KKBYHKDVH 

SEQ ID 691 

TTGMTATTAGGATGATTGATCXaAGCTGAT 
TTGAACGCGCAATTGGCGGAAAAGGCGGCAAGATG 

SEQ ID 692 

UTCRMIIXjADDCFAAFGGVGVGRVVYMV^ 

SEQ ID 693 

TTGGCGGAAAAGGCGGCAAGATGTGATTTTATTGAACAGGC^ 

ACCGTTroTCGCAGCCGAAAAGCAGATTGCCCAOT 

CGAACGTTTGCmCGaGCTC^ 

GACGTGC^ACGAOTTPC^ 

AGGAAAAGGCGCAGGGTTTGGCGGTTGAAAACGMCGTTTO 

TCAGTTTCAAAATCTTGCCAACACGAT^^ 

GAGTTGGTCAGGCAAAC^ATGATAAAGAATCGCGCGAGCGGC^ 

CGCI^CCGGCACGCAGMCAAGGTTCAGGGCAATTGGGGC^ 

AAMGAGGAAGACGGCGGCACGCGC(XX£T(^ 

aXHXX^TOX^TGAGtXX^^ 

TCGTCTTTATGOTATCCCT^^ 

GACmXXmOGGTGGCGW 

AGCGTCGGCAAAGGCATCGATCAGGCGCAAAACAGCTT^ 

CA(mAACAGCTTCAACGGGATOGGTCGAGCG 

SEQ ID 694 

LAEKAARCDFIBQAHAEIASEIA^ 

DVQNDLSDVGNRFAAAEKQIAYliQEKBAE^^ 

ELVRQim£SRERLTLQfEliKRLQGI>NA 

AADADEAARELAAYIASIEAm^ 

SVGKGIDQAQNSF^AFKQLAEGRGNLVGRAEKimG™ 

SEQ ID 695 

ATGOCGTCTGAAACGCATCCGCCTTTCCTATTCGGOT 
TGTTOCCTGCCTTCACGCCCMWGACGC^^ 
TGCCGACarmCGAGCGTC^ 
COXmGTCGCCAGCA(X^ACT^^ 
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A^AGAOGAA^T*^^ 

GAAAAACGGTTGCTCAAJ^ 
HPSETHPPFLT^ 

QRV^OmUVLIiGKTAAFFF^VGKI 



laiAQAPAHliAQQGQLFFRSKAFLPDl^CQTFV^ 



GCMCGCGA^T^ 
03AAAAACGGCACMTCC6TTTTTC 

SEQ ID 700 

VTUtRFFBRBRVCRLFYGGtPPVIiIICBSGD 



AGCGTCCAACT 

S «^AVRS la «I^IG re iX 3 IEF 
VMWTRI6KQPFHIGFAKCGDRI07ETFKCRIW(I>PI>IflBQ8PT 

SEQ ID 704 

LPMPSHlYVKAIiKSGTIBrSSKRFHfiffPBPPQCRIiKPSV 

PKIGTlilTYRYRGFTRRGTPKFATFVRVB^ 

TGGTOGTG 



OTILDEGHTHSMQAKLIjEMGAASCRAA 
AAGACCAAGACGACAT 
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CGTGCGCGAAAACAAATCGGCGATTTTGACCGG 
aXJCTA^GGCATTTTCGCCAT^ 

SEQID710 

MIGLLIITHETIGEAYRKLAHHFFPGGL^ 
AVKGIFATTSAPEDLVCRRSGDAV 

SEQ ID 711 

ATGTPTTCCTC0OTGO3GC^^ 

GGGTCGCGCCGAAGATGTCOGTCATMTCAATAC^ 

SEQID712 

MFSCPAASSIXSIAAAPAHQIFGRGRIXJ^AFYRRLSDAFG^ 
AI5V7RQPARKKKMRQLAVGFAYGFVCDD 

SEQ ID 713 • 

GTGGAGTCCGAOTT(n^ 
TTCTCCAAGGTGCTGMGOVCCGGGCGAA^^ 

SEQ ID 714 

VESKFVDDFDGLFEHTCSRVHYSGLTKTSTAIiPRLG 

SEQ ID 715 

ATG<X3GCTGATGA!T6CTCGCAGCCGCCAA^ 
GCGAA 
SEQ ID 716 

SEQ ID 717 

ATGACTATGTFGCTGCACGG^ 

CCGMGCCGAACGTTTCGATGCCGCCGT^ 

GClCT^CCGATGTTACCTTGTCGOGCGAArc 

GACGATGOTATTTGCGCGAACGCAAGCAOGATAT^^ 

TTCTGATTGCAAAfX^CCTTTCATC 

6GACATCCC(mXX3TCG!PCGGGCTCCACAAC^ 

GAATACCGCOaXXjCGCCCGCG^ 

CCGAAGACGTGAAACCGCTGCACAACCT^^ 

CGCGATTOTCAAAAAMTGMGGGCAAMGCGTAACG^ 

CTGGGCATGACCGGCATCCGCCTGTGC^^ 

(XOTAira^GTGCGCCAGTGCC^ 

GCI^CCGTCGGCAGTATTTTGAMCTGGTC^ 

GGACATCCTTCCCIOTGAAAAT^ 

GTATGGGGCTGCGCCGTTTTOXM^ 

CGAAGAGAAGTCGGAAAAGCTGATCAAACAGATCAACAGOGT^^ 

SEQ ID 718 

MSIVLHGVAAGKGIAVGCAHIilARGTEEVPQYDV^^ 

BYRRRARETOHKRDLNRLKKTAAATAIX^/Q 

LQfTCIRLCMEPVMFRTQMRAILRMW 

GHPSVLRMIjQHVIRTANRHISCEVSVCGE^ 

SEQ ID 719 

ATGAAACCGAAAATCCAAAGGCATGGAGAGATTW 

TCCGCX^GTTGGAGGAGGGTGGCAGCCTG^ 

CATTGCCCGGCTCATTGCGGAACACATCCC^^ 

ACmCAATATCCACGTCGCX^KXXS 

TCAACXAGTTCAAAGTCGATTO 

ATGCTGAMGAGGCGGGGGTGGMTGCCTGATTGCC^ 

SEQ ID 720 

MKPnQRHGEILSLVRRHQFMSVDELAAALDVTPQTI 

TNNIHVASWSARTDYmiTSGVVRPIi^ 

MLKEAGVEOJADAV 

SEQ ID 721 

GTCGCTGGACGC^^ 

TOCCGTACTGGGGCGGTWX3GCTGC^^ 

CGAAAMCGCCGTATCTCGCT^^ 

GCCCGAAGAGGOTGCACGGCGGCGGACGAAAI^^ 
AT6GGGCTGCCCAATACC6ACGATC3ACOGCCATA 
CCGCC(mX^TCCCGAACACG(mGC^^ 
0CX3A 
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MGIFNTDDDRHIPQNAVCLDHHGT^^ 

0CBIC0BWC(»OMmG0CK^^ 

AMGATCGACATTG&XaGOCTATT^ 
TCCGAGTGGCTGGACGGCATAOGWm 

SEHLDGIBL 
SEQ ID 725 

SEQ ID 726 

LRFQTaFEKQCRLKDRIREHSPERyVPDRV 
TGGTTTTCAMTGCCGTCT^^ 

SEQ ID 728 

LVKIETGVLMJVQAAHCYPIjHPIGIIPAWGVFTDSVPQTMiVFKCR^ 

(OGCOOGACGACATCCG^^ 



MJTTOXXBCCMSCQCGAftCGCMGMQCTGGfcCG 

acgaaac^^^ 



q^TAGGCGCGGCCMTATCGGCTGGTA^^ 
TOXX^CCCACGCCKX^CCTCGCCAAA 

cgamSgccgagagcttccacgacgaaaccctgcccxx^ 

TjCKXJQACAAGCftAAAA^ 
SEQ ID 730 



TTCCTTCTTACCGCCTGCG<^^ 

OGCG^ACATOiCMT^^ 
GCAATAAIMTCTGmiraACCAAATCC^^ 
ceo in 732 

(jVPHITI/GALFGHLSDRPLYEWiSBBYGIPI^^ 
CGOTGCCAAMQCTCAAfiOA!^^ 



GTTTGCCAACGGTTCAAATi 



■AAAGGC(mXXmm^^AAGMTAAfiCGTCGT^ 
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TGCCGCCGGAAACACCTGCOGTCATATC^^ 
GCCGAACCGGTMTTGACTGCCGTGCCGTC^ 

O0CTTCTGCATAAAGTGOCAACATOT 
MTTCGGGATTTTTC&GCAGGGTTTO 
CGGGACGGOTCTCAAATCCGGTTCGCAC^^ 
ATAAAAAATAACCATTATCAA 

SEQ ID 734 

ERFVKLDKRPVFLVFVKHIVAPRFA^ 
ARIPVKHQRIGNLQMGIVGTRPARIL^^ 
AKmDCWVHLFRESPIAVLUiFRGLVDIWAGN^ 
PI£XKCQHLILPRLL*VNRQNHADTGNIV*DH^ 



SEQ ID 735 

TOCGOTCAAGGCCCTO^TOT 
GAATGACGGCGGAAAGATTCTATTTTTCCC^ 

SEQ ID 736 

LRFKALHVPKCHPraiGm^ 
SEQ ID 737 

TACOGCTTGAOTATTTAAAAfT 

SEQ ID 738 

MGGIiEAQSIYYFHRNGRPW^ 

SEQ ID 739 

GTCO^TCaXTK^CAATC^ 

(ECTGTGGGCCAGGCGTCAAGACGGTTCGGTAAAATGff 

AATCXmCCW^CCTGC^ 

CACGGCGAATGGCCGGAAAACMCACTTCTGCCGGCGTGG^ 

CCGGCGTAAACAAAGAMTCC^GGCAAAAAAC^^ 

CAAAGCCGACGCCGCCAACGCC&TCGAAACCAAGCACC^ ! j 

SEQ ID 740 j 

VASASTIKGKYVQKVEVTNGVVTAQMASTGVNKEI^ 
HGEWPENNTSAGVASSDmO^ 

SEQ ID 741 

TTGGCCGAAGGTCAAAMT^GCCGTCACCGAGTAT^^ 
(XX3TTAC(K^CAUCGGCGTCOTACC^ 

SEQ ID 742 

LAEGQKSAVTEYYUffiGIWPKIM'SAG^^ 

SEQ ID 743 

TTCX3GCTTCAAGGCOTGCATOT 

SEQ ID 744 

LRFKAI£VPHCHPFKHGFYI/TCA^ 

SEQ ID 745 

ATGCAGGGCOTGAAGCGCAATW^TATAITATW 

TACCGOTGATTTATTTAAAATT 
SEQ ID 746 

MQGU3AQSIYYFBRNGRPRP 
SEQ ID 747 

<H«XXaC»GAC(X^ 

AGGCAAAAAAClCTCCCTGTGG(Xm 
AACMQKKIATCAACACCAAGCACCTCCCG^ 

SEQ ID 748 

VPTDPDSRIJIGNDEAILIAEGQKSAVTGY^ 
NNGIMROfliPSTCRDKHDAK 

SEQ ID 749 

GTOKXXXXX^TGTGTC^^ 
TTCCAACOmTCMTGCCAAACC^ 
SEQ ID 750 

SEQ ID 751 

GTMTGACCGAGGCTTACXX3GGCGGM 
TGGGGGTCGGCGGOGCGACGUJPATGGATTT^^ 

OTGATOTGcrKnrna^^ 
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(XMCMCMTTTA^^ 
CMXrreGSGC ^^ 

CGTCAGTATCGGTACG^ 
ACGGTCATCATCACATCCGGC6TCGTOT 

GGTCAGGCTCGGCGCX^TTACGGCXJTP 

LVYIVISFRmVCRIGRWCVVDERFGrc 

SIX5GVFHAGI^VKSASRCKEAENDET^PKAWRDFKPCP 

RQmDHGSRGBU^AQQP^ 



SEQ ID 754 



m^CGAAACACaCGCGCCGATTT^^ 
CCGACATCAMGGCAMTATGTTCAMGC^ 
GCGTGAAAAOTTTCGGTAAMTGGTTCTGCGGAC^^ 

AATCTAAJUsAAACCGTTTO 

S ^T^raFQnSRCTVLPBSR(^QHFCR^ 
KHLPSTCRDBSSAVCIETPPTAPYOT^ 

COCTCTCGGOCAAGCGTGAAGCCGQWCX^AAMTC 

CAAAAWX&CACCAAGCACCTGCCGTCAAC^ 

AATCACGGCAAATGGCCGGAAfiACAACACTTCT^^ 

MHGKWPEIOTSAGVASPPPTSKANM^ 

GGGCGGOTTCGATC^^ 
TCGGTTGTTTSAAATATTGTTGGCGAAC&TT^^ 

SEQ ID 760 _ 

KSGENELRQM^ 

craxxmcGAc^^ 

AOCGOTACGACAAmTTACAGCACCaTTCO^^ 
ACCttGAAAAACAAAT^^ 
ACTGATCAACCCGTATTCCGCCOT 
CCGCACGACGGCAAGCCCCATTOmcra 

CJUAGCCGMMSCCA^ 

(yiAACGOTTGAAGGTCGTCCGAAAGAOG 

GACAT^TCGCATTGGCAGACATTGTCMCGAAT^ 

CCTTCACCA1OTGACCGOT 

CGCCATCAACAAATACGAACATTTAATGCAAC^^ 

GMftMyBTCOCCQAACAGGCGGGCTO 

CGGCGTATCCGAAGGTATCASXX^CTCCGCCX 

SEQ ID 762 

LMMPSEARFFRIiOTAFPRUCGIIRLYPLPftKH^^ 

IGYDmSTHSTENKQFSQDIY^^ 
Pffl)QCPHLQPEAUIKMKBWW3K 



EKV^GFBDFHKII^^ 
S A3^ 
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TCGCAAACGGATACCGAAGTCATCGCACACAGCAT^^ 

TftCQCGCCGCGTCGCOTACCT^^ 

TOC(XGOTOTTCGAACCG^ 

ACTT(XGCAAAAATGCCAAAAGCGTGTI^^ 



TACGCCAAATCTTTGGGACACCGCCACAGC^^ 

(X5ACTAAA(X!ATTTACCA0GC^ 

TCCCGGCAGCGT6CAGCACGCTTTGAA0TO 

GAAGGTGOGTTGAAGCTAAM 

CCAACGACAGCCT(?P^^ 

GCGAAATCCGTAACCGTCGAA 

SOTOTEVIMSINHmQNGGRLFEAVQ^ 
^AStiKPGPHSHPMQmHEQPRAIAmAEVF^ 

BGAIiKLmTHHAEA^^ 
AKSVTVB 

SEQ DD 765 

TTGAAAAAAAAGA3X3C(K5ACG 
GGJOTTCCCOGOTGACCG^ 

SEQ DD 766 

LKKKMRTASVPHIiPFRIIl^ 

S 

AGCGTCTGTTTTTCGCCGTOC^^ | 
SEQ BD 768 

SVCFSPSISMPKDFDLCKTLKAAKG 

TGCTGCTGCTGATCG^ 

GG(^GTGTTGGGAAAGTTCTCCCCGTGG 

HGATGTGOGAM 

ACCGGCGGGGCATOGGCTGTCGGTTTGGTTTATATO^^ 
CGCT^ATKXrTCTCGG^ 

TGGTGGTCMTATCQOCGCOCCGTTTCTG^ 
MKRFPDMVCTGGATS8DFTL!OT 

S |tggaL^^ 

nxXX3GTCAAAAAAAMGGCAATTACTTCOT 

MTCcx^TmAcma^^^ 

GCACAGGCATACOTCGACGCGCGAAATGCGTTGCC^^ 
AMTCAAATGCMCGGCAAACCGACATTbTiV^ l 'rI , 



cgcmxx^ca^ 

CCGTTGCAIT 

S ff^^TCCCCTCTCC^^ 
CCQCCGCCMCCr^^ 
TCCCGGCGtXXSGGQOCG!^^ 
CAAlGGCTGGCAGGATGTGTGCGCCCAAGC^^ 
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GCAATCAAA6GCMGTTTGAA 

CAAAGOTAOTCAAGCTC^ 
CTPGCCGGAAGCGGCAATCAGGGCCCCGT^^ 
CCCATCCOHTACCX^^ 
TGCCGGCAAACAGAAAACCACGGGATAC^^ 

SEQ ID 776 

VUCSPLPGYRNAARHHI^^ 
Q^DVCAQAFQTI^F^ 
AIKGRFBGSRFLPTC^ 
LAGSGNEGPVGAIiCTPLffiE^^ 

SEQ ID 777 

GTCCAAAACATOAAAGCGGCMAAGGC^^ 
GCCXX^GCT(mMX3TAAAGCCG^ 
SEQ ID 778 

SEQ ID 779 

GCCTAGGTTTOaCTraGACGGCATATGCG^ 

SEQ ID 780 

MISACPGFIRSRFGRSVMPSEPA(3U>S^^ 

SEQ ID 781 

AOOT^AAGGTCTGGTTCGGGGTCGTCCT^^ 
TCGGTTTGGTTGCCGTCGCGATGC^^ 
AG6ATG6GOTTG(HtGGTAT6GC6TTTCTT^^ 
AGamSGTTOGGCTGCO^ 
CTTGCCG(X^GGCT6C6TGCGCTGCA 
SEQ ID 782 

MPKVtfFGVVLASLMCLGIGYGIHSATC 

S^FGCRRDWRVDLSGRDAPESGE^ j 

SEQ ID 783 

GTGGCAGCGGTCAAGGATGGGOCTTGGTCGGTATGGCGTTTC^^ 

CGQCCGGCGCGGTAGCGGGGGTTTTGGCIQCCQCCGTCA^ 

TGCCOX^TCTGC^^ 

CTGCTCGGCG6MTTOCGGCTATACCGW 

TAACCCGTAAAGAGGAATTGAAA 

SEQ ID 784 

VMVKWPtfSVWRFDPWRAKVWR^ 
LLGGFFGVTIHPA^^ 

SEQ ID 785 ^.>n^_vvv^^ 

J^GAAAATTCAATTTGAO^ 

AAGGCGGTOGGTCGGTAITOCCGTCAACGACA^^ 

AATCCTCAA»3GCGTGATGGTG(?rc^ 

CCGCGCXK&AAGCTGGTOT 

AGCTCGCGGCGAAAACCAmCXTTTACCG^ 

T(XWEl^GGAAAC<Xa^ 



GGCCAAAAACAAAGCCTTG!ITGGAAAAMCC6ATA(X^ 
CGCAAAGCATTACAGGCTCCTATTAACGCGCTTG^^ 

SEQ ID 786 

MRKFHLTALSVMLAL^ 
PRGXLWADSGFKETANEA^ 
SGVKBTAAKIMTOVEAI^mnAL^ 
RKALQAPINALAEDLAQLRGILGLK 

SEQ ID 787 

TTGACTCATCXjOGTITCCTTCMTATOT 



TACOGGTTWTGT 

SEQ ID 788 

LTHGVSFNIKAAGWASLHAV 

SEQ ID 789 

ATGAGTCAAAAOCAACCCGCACAA^ 
MCO^C(XmAAGCCAACACTCGC^ 

GGTAAAGCOTCAACCCCGACGGATTGACCOTA 
CCAACaTAAGCTGCAAAAMGCTCGTGC 
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GCCGACGJVGGTOTATGGACGGGCGTGGC^^ 

CGCCGOTCAAGW^ 

CAAWoACAGCCATAJKXXjOT^^ 

TCTTGOC(XH3CG!KK3^ 

SEQ ID 790 

MSQl^PAQPTKRNIiFKTAI^^ 
(3(AFNPDGLTVTVGVGSSLPIMjRFGIi^ 
ADEVLWT6mNSU)EPEHAK^ 
GIiVFVCYQANIADCfflFV^^ 
SEQ ID 791 
TTGTCCCAA(&ATCC^^ 

SEQ ID 792 

LSQESAFSHAGCDEEIAAAEGADVFFQRFAVEEVLHEDEW 

SEQ ID 793 

TGCAGGAAGGCGATCACWCATCGGCTGC^ 
SEQ ID 794 

SEQ ID 795 

ATCKjMTATTGCGAACCTGAAGAAGCATCCGACCCCT 

SEQ ID 796 

MEYCI^EEASOFYATYRRANI^^^ 

SEQ ID 797 

mCAACCAOTGACCGACAA^ 

SEQ ID 798 ! 

KQPFDRQCXjFFVAVRPFAVSDACGH^ i 
SEQ ID 799 I 

GTGCCGTCTGAAGCCGGTTTCAGGTTTCAGAC^^ 

SEQ ID 800 

VPSEAGFRFQTASQNKHPIRTANHVITPLPAMRIAIHL 

SEQ ID 801 

CAAACGCGATGCCGT^ 
TTAATTGACGTTGAAACCCTCt^ 
GCGTAACCGCCATTGTCGAAGAAAAAGACGCAAGTGCC^ 
CAGCMCMT(XX£MGGCGAATATTATO 
mCGCCAGCIXBCX^CTCGAACGCA^ 
GACAGGATGTCGTGATTGACGTCMCGTCGTCATCGAAG^^ 
TGCCCCCITCTCOCACCTCGA^ 
ATCAAAAACGCCGCCATCGGCAAAGG^ 
TGCACAAACACAAMCCGTCATCX3GCGACGAAff 
TATCGAAGACAACAAACTCGC^^ 
SEQ ID 802 

MPSETASIX^QENAMPQNTIM^ 
IJDVETLETLLEAAGNKVGIATD^ 
KRQIiAELERIFQTEQAQELIiRAGVTIiRDPA 
IKKAAIGKGTKANHliTyiGDAEVGSKTHFGAGTIlANYIX 
SEQID803 ^ 

(XX?ITIATTACCAAGACGGCAGCTACTCGAAAAAT^ 
OTGCAAAAACCAAAGTTOAAAACCGA 

SEQ ID 804 

MRKTFLIttlVSAALLSG^^ 

SEQ ID 805 

ATCCAAGAAAACCOjACCGTGTGGCTGTTCGAC^^ 

TCTCCGAATCCGCCGCGTCCGAC^^ 

TOGATCGATGCAATCCT^^ 

GCX^GCGCACTCGGTTTGGAAAAC^^ 

CCGMTGCTGCATTATGGTGGACXSACAG^ 

AAfXXJATAT(£CGCAACTGGCAC(OT 
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SEOID808 ___ mvmnBT ™^^ 



AGALGLENCFDALFGTDDFGLLYKPNPQAY 
"ICGACCGGGCAAACGATGCGfMGTGTCGACTJOTC 



. GATTATTGCCGCATOMjGACGG^^ 

GCGCTGATTATCCTGTTOXXXXSjTTCAAAGTCGG^ 



SEQ ID 811 

SEQ ID 812 _ „™™™ S KP™,PEmEHGAVSR^^ 

MDALHTIARNLT 



RGFATDRCRAARPPPIAVGKAAV^ 
C^TCfcTGCGQCM!TMMGM3QGTM3GOGA 



OTACGGTTTG ACGTiTl-riwiVAvavj 1 1 i^^^i"*— » 



QAVFFGFVAV 

SEQ ID 810 ,,, mnnrm — ..~,, P .. 1 . rrn «^^ 

T^GCMTCATGAACACGCGCACXOTCTACC^^ 
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GCAAAAATACCAACTGCCCCTCGTTGTT^^ 

CttaAAGCCGACGAACCGOT^ 

MTACQAGTGACC£AAMCAG(X£GACCGAAT^^ 

CAAATATGATTCCGGCTGCGGCTGGCCGAGCTra 

GATTCGCACTTGGGACACGTC^^ 

GCGCGTOAAGGGCAAAGTGAAA 

SEQ ID 820 

MQTVHTGNIR4KHRTPFSLCWGCI^^ 

WXAGUTCPKLPVVTCH3CTIAQNIOT 

TENPSYEDVSYRHTGKAETVKV^ 

RKADEPLPGKTKAAPQGKGFDAATYKKP^^ 

DSHLGHVFPMPRIRGRIfl^ 

SEQ ID 821 

ATCXEGAAACOTACGGCCCt^TTTO 
AAGAAAAMCAGGAAACGCCGGCTCTCGAGGAGGCCC^ 

SEQ ID 822 

MPKFYGPEFQTTLPKnK^ 

SEQ ID 823 

ATGCCGTCT6AA0CAGGG6AAGCG6CTGAAAAAGCG(^^ 
CTCAAGAACACMGCTCGGTTGGGCTGCGCGra 
CGAAGAGCTGGAAACCGTGCT&OT^ 
GGCGCGTOAAAGAAGCCTTGTACGACCTGAT^^ 
CGTCTATCGGCAAACTCGCCAAATATTT^ 
TGTAACCGTCATTTCACAAA(X^C6G60GAITO 
CAGCTTCaTTTGATGGAA^ 
AAGTCAWaXTTTTGACGACGC^ 
CATCGG(Km3GGCGAAGGCATAGACGACC^^ 
SEQ ID 824 
MPSEAGEAAEKAVERVESAKI&TO 
GALKEAIiYDIiIKPLGKFIiVIiFGTKEFFVIML^ 

QLHUmKKVKRVLQmP^ j ' 

SEQ ID 825 !;' 

ATGCCETCCGAAMCGGCAGACCGAA^ 

GCGCGGGMTGACGMTTTCGAGATTGCGGTGTO 

GATTAAAAATGCCGTC 

SEQ ID 826 

MPSENGRPNRHSHGSGNbGRGVWATTO SPRP PRACKRRAIKHAV 

SEQ ID 827 

ATGCCGTCTGMCCTTCAGACGGCATTWM 
AACACaOATCTCGAMTTO^^ 

SEQ ID 828 

MPSEPSDGIFNRPPFTGAGRARGNTO 

SEQ ID 829 

GTGCAGGCGGATTTAGCCTACGCCGCCGAACG^ 
CCATCttCCCCAGtfyTC^^ 

TATATCGGTGCGCGCGTCGCCTACGGACACGTC 
ACGGACATCTCCAAAAAAACAOTATCAAAAM 
CTACCACAACTGGGGACGCTTC^xAAAACACC^ 

SEQ ID 830 

VQADUYAAERia^PBPTAPGKNKISTVSDY^ 
nGARVAYGHVKiSIDSTKKITGTLTAYPSDADM 
SEQ ID 831 

TT 

SEQ ID 832 

LPSIAACMEEKRREEKRREEKRREEKVFCGLDSFSAPYSV 

SEQID833 

TTGCGCTTCAAGGOCCTGttTG^ 

GAACTOGATATAATCCGCCCTTCAAttTCAGTSAAM • • 

SEQID834 

LRFKALHVPHC^PKHGmTGAQSAPSFVNPTO 

SEQ ID 835 

ATGAAOTTCAAAACGACX3ACATTAT^ 
AGGTGCTGCTGCCCGGCAAAAAGCTGACCGAATOGAAC^^ 
ACCO1ACAGGGGCX5CGTTCGT0CAOT 
ACGCGCCTCAAGCOXrmATGC^ 
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SEQ ID 836 . tt nr-Timvun r omnrr ipBCRirsQMWCSFOTVBGMiSIilMDro 



TBLKPLYMORC^BiSGRGDRVGViMBIjSHAFHVEIjMIj' 

SEQ ID 837 

RTGCAGGQCCrCGAAGCGCMTCGATA' 

SEQ ID 838 

HQGIiEWSnYFHBHGRPEPPCKPMDKPPGIiIESiVGHiniPEBIfn 

SEQ ID 839 



SEQ ID 841 




SEQ ID 844 

MELSLTOCFYFVPSFBWVWFAEVGG6SVE8CQF 



MTGCTTTCCAW 

SEQ ID 846 



TTCGraTTCXXXX^GC^ 
SEQ ID 848 



SEQ ID 849 ^^n^^*™™^^ 



SEQ ID 850 

LAEGQKSAVTGYCPNHGIWPENFVI PAKAGI 

SEQ ID 851 ^^«mHmii^^ 

^^^^^^==== 

HaiETKHLPSTCRDEPTAT 



(SB 
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<^GGAACA<mA(aCG^^ 

CGT(X5ACGACGTGTACGGCGACCXKIM 

SEQ ID 854 

HKLSEUWNEFAARHLSF 
ALLNFQQVCIDI/TOPVAGASLIJDEATAAAEAM 
IMSLVLLKSPAEL^ 
VFADALVSDGLKVVHEVFFI^^ 

GLKWWIOTEHGNVNIDDI^^ 

HTLTDIHSASAGQTSMAAA^ 

EPTESESKAELDRPIMLRSIRIIEVQKVITC 

SEQID855 

GGGACGCMTATATAA(KjAGT^ 

CAACAGGCATATTATCGACGGCGAGGATGOCGCT 

SEQID856 

GTQYIRSKYLFFY*RINLLKIPIjIjKPIKH^ 
HTVCRLV*K*KIGSIAHICB^IKmiTESTBAIKBH 

SEQ ID 857 

AACGOTGCGCGAATATA2TFG^ 
CATATCAT 

SEQ ID 858 1 

NGCmPV*PCICIS*IMVNKNNNINQHVA!^ 

SEQ ID 859 

TflX&GCAAMTATCTTCATC^^ 

SEQ ID 860 | 

LQQHIFILCRADGMLGQffSIJLKTPKKR ; 

SEQ ID 861 _ 

GTGCTTCATCACCTTA6GGAATCGTTCCCOT 
mCCATGAAAAAMTGTTTGTATTGTTCTCT 
GGCGAAGGCGAAGGCCATTGCGGAATACCGCGCGCA?^^ 

SEQ ID 862 

VLHHLRESPPLSRjGGATPYRFIiLIR^ 
SVGAPAPKGPAKPVIiPAVKK 

SEQ ID 863 

GTGGAAACGGATACATOGCAAATACTATTGAAAAACATAGC^^ 

GGGCGAACAAATCMGCTGAACCOTATTTGAT^^ 
TACG(XATCGA<^T<^^ 
A6CGTTOTTGAAAATCCTCAAAC(XMTC6AAA 
CXIGCAGCCCGCCTACGTTTGJUATTGATTT^^ 
TTAGGCGQCAATTTGAACAfiCGCCAT(X?^ 
M&TCGTCGGACATCCG^^ 
CGACAGCGATGATTTCCCCGACGCATTTCAC^ 
SEQ ID 864 

YGimAI^ELNAPEIPIMDGSSLPFiyiiLQDAGVVDQKAQK^ 
LGGHLNHAIVIGDTDVyffBGLRYPDEFVR^ 

SEQ ID 865 

GTGCXjGAAACTTATCGGGAAAAAAGGTTTCTTGAGATOT 
GCCGTACTGTCCGOGGOTCG(XGCCTTGTO 
SEQ ID 866 

SEQ ID 867 

GAATC 

SEQ ID 868 

LRPKALHVPHCHPFKHG7YLTGAQPAPSFAEQ 

SEQ ID 869 

AT6CAGGGCCTT6AAGCGCAATCGATAT 
TACCGCTTGATTTATTTAAAATT 

SEQ ID 870 

MQGIxBAQSmFHRKGRmPCKPUa)KPPGLim 
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(XXGTCAACCTGCCGCGATAAACATGATGCCAAA 
SEQ ID 872 

STCROTSSAK 

TCA1TCCCGCGAAAG0GGGAATC 

SEQ ID 876 

UAFSVHKHFPISSVWIHIimTIfl^^ 

TCGCGG^TGACGGGATrAAAGTTTCAAAATTTATTC 

SEQ ID 878 

MFGQKimiPAQIVGIHVaOTiaiSGm^ 

S toS!a^^ 
atcgggtaaaacggtttctt 

SEQ ID 880 

UIESa^ILSFBffliSGAT2TAPPS?PPJ«ESRPIGAETYHVKSFL 

AC^TTTCTATAAAAATACOCAMCCGT^ 
GCXS3GGAATGACGGCGGAGCG6TTTCTGTTGCI^^ 

lAPYKHTQTVIPTKAGIQHAKSKETVL^^ 
S GTOQ(S , rcCG0CTCOa(aTCAAA!GGC^ 

caamtoacaccaagcacctgccgtcaacctgccgc^ 

Stcwxxsgacagcc^ 
ttgccgggtattaoc 

S ^a^hjhgiwpkiotsagv^ 
ugh 

S 5goxK^ 
ggaacgcaamtctamgamccgtttfcccgataagtitc^ 

SEQ ID 890 

MPDGLYSGLmRTRKRSRCGmiOtKOT^ 
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SEQID891 

ATGTGGATTTCGGTAAACACGCGCGGCGO^ 
AGCXEGOttCXXXXa^^ 
TCOTOTK5CATO 
SEQ ID 892 
VQJLCKFAQHNRSRT^ 
SVYPF 

SEQ ID 893 

TCCGCCTKftTmATC^^ 

SEQ ID 894 

Sk* PWSQAVCLYNTAFRKGVGISJVPLVQ 

SEQ ID 895 

GATTTTCCGACTCCCAACGCGTTTGT^^ 
CX^GCOT^ATCTITCGTGTC 
TTGGCTCAAAAATCEGTG^^ 
CCAACCCGATTGAGAAAGTGTGGGCGAATATTAAGCGCT 

SEQ ID 896 

LYGSRRLPCAQTDADGAQKKTAAH^ 
IAQKSVnSENARFRRMGALRCT^ 

SEQ ID 897 

TTGTTOGGCAAATGSGGGGGCGGATTGCGCJGC^ 
GACGACXXCGCCCGCCTTGCAAACCOTAAAAG^ 
SEQ ID 898 

LFGIWGGGLRACQIKTVFKRVMRyMQGIiE^ i 
SEQ ID 899 

SEQ ID 900 I 

VASPADmGKYVQKVEVAKGVV^ j 
SEQ ID 901 . . ! 

ATGCTTTCAAG(WXn , AAACAATGAAATCAM 

CAAAAAATCACXXXOTACO^ 
(Xy^GGOmXJmCCGCCCAAATGA^ 
CGGTTACGCtf^CGAC^^ 
C(XmTTTCAMCAOTCCAAAA^ 
SEQ ID 902 
HLSSGVNNEIKGKKLSI^^ 
AKGVmQMNPSGVNNEIKDK^^ 

SEQ ID 903 

TT(^TAAAATACATTTGCACGGCCGOT 

SEQ ID 904 

UfiOHiaKaLGIJAGILKHRPKKSA^ 

SEQ ID 905 

SEQID906 

I/miFRRANLTGRANVFYSIIPAFKQTBASQ 

SEQ ID 907 

A^raTCGTTGCCGGGTATOCCTGAAT^ 

CAAAAGGOTCGTCACCGCCGAAAT^^ 

GGTTAAGCG0GACGC0GGC(X^^ 

TGCACGAAACACCACGCGCCGATTTCAAA^^ 

«rCTGT(^GCAC(K5AAAOT 

SEQ ID 908 

MNHRCKVLPBSWtMA(&^ 
CTKHHAPISNTSKK5AVAGYCPKHGTWPKNTO 

SEQ ID 909 

TTCGCCGAAGGTCAAAAATCAGCCGTTAC^^ 
MOTAAAGTCmAACGGCGTCGTCACCGC^^ 

GMTCA3OTTGCXX^TATOC^ 

SEQ ID 910 

LAEGQKSAVTEYYLNHGEOT^ 
ESSLPGIA 
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SEQ ID 911 

GCGGCGATTICTTTGTTGCCTT^^ 

TTTTGCCTCCGTTTTGC(KjT^^ 

TOTGG ( nX^AGCTGTTTCAC 

SEQ ID 912 

SUiPAADW^QIPLFPraOiL^^ 

SWFKLPH 
SEQ ID 913 

CGGCGGCA6AAGGOGCGGCCAAGC6(X3GGCCCAG^^ 

•nSACTGCCGTGCGACGGTTACGGCGAAACG^ 

GTTCA6ATGTCTGCCGAAAAATACTGTTCGG 

SEQ ID 914 

VQKSAm(^ASIMLGKAAKITHSFEXAGAEK 

SEQ ID 915 

ATCACCTCCCTGATOACACCCT^ 

CACCCTGCTOTCCGTCTO^ 

GTTCCACGGCAGGCAGCTTGCCOTACCG^^ 

AAACCACAGGCATACGCC!fCAA(XX^GGCT^TCG 

SEQ ID 916 

MTSLIHTIiSDGIELTVEIimRA^ 
LERQAHSYMPRLERHARATQLFPASSSLTSAKTFWGVCRKTO 

GGGTCGGCATCCTCMC^rcGTCGCTC^ 

AACGCTCGCQCTCCAAGAGATTT^^ 

AACCGCCGCGAAGCCAACGAACAGCTCATAAAACA(^GTTTGGC^ 

AACTCGGCGQCGCGCGCATGGOGAAMTGTTTGAGATGGAC^ 

OtfCTGTCCGACC^^ 

CCGTCT6AAACCGCA 

SEQ ID 918 

ffillWIIJmiLCSSLIFIfPFMLIASPFRre 

NRREANEQLIKQGLARKNEGYW 



SEQ ID 919 

GTGATGAMCTCATCATTCTCGACCGCGACGG 
CGCAGGCAG(K?PGCACCGTTGCCG!ICGCCACCAA^ 
CGGGGAAATCGACGGCATCTG(nVCTGCCCGCAT^ 
ACTTGGCTGGTC^^ 
CCGGACACACAttGGmTCGATA^ 
SEQ ID 920 

mVGBSUU)LQAIEAVG^ 

SEQ ID 921 

GTGTCCGGGCAATTCGTGTCCGTGTTGGGAGAGOTT^ 

TTTCCCCGCCTGCCTGAC(MSA 

TTCATCACOTl^^ 
SEQ ID 922 



clrqschrihaafdryplvggfde^ 

SEQ ID 923 ........ ^iuu.l. 

ATGtH'TGCCTATGCTTTCCPATIT^ 

TGCraTOTTGACGGCGCAGT^ 
ACGACC&GCGCGAC^ 

GTA(X3GCATGTCGGATGAAMGCOC6ATATTGTOGT^^ 
GAAGATACTOTATTTTCCTCACCTTO 

TGACCATTTCGGOT^AATCTGATGTTO 

GAAAAACOCCATCCCGATTTGGAAAACGT^^ 

ATTOMTCCGAAAAtXX^ATCCGCGOT 
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mTTTCX^mTCATCAGGTftCC^ 

CAGOGGCAGGATTTCCTCGAC^ 

G(n i TGGAAfiiOGCCCGAA(XX3GGAGTT6GTCX^ 

SEQ ID 924 

MVAYAFLFliFVTAAVLLIVRSHVTOT 

DSLQIPWRWAGAVIJMAFVSV^ 

EXJTVFSSPIiKVHTFGC&TWK^ 

EKRHPDLENVRQPMFVmTMKEHGPYim^ 

QRQDFLDLAFAGGVLMEAAGLEAKDG^^ 

SEQID925 

ATGATGFTTTCTPGGTTC 

TTCAACGAAATCCCCGTGCTGC^^ 

SEQID926 

Miff SWFKLFHIJFVISWFAGLFYliPRIFVNMAMro 
FHBIPVUMAAIjYLWPKPF 

SEQID927 

AT6CCGTTTGAAACCCGCCGGC6GC7TCAGGCG(XA 

GACGCAACCATAAAGGMCMTGATGAAGCTCAAAA 

CATCACACCGG(XX3TTTATGCGACGAAA^^ 

AGCCGGCTCGCTCAAACCAGAAAAMCAGTGC^^ 

AAmO^CCTGCTCGACC(mraX^TACGCCGC^ 

TCCCCCCGAAAGAATCCAACTTCGCCCC 

CTATOXraTCGACCGGCC^^ 

ACXnCGTCGGACTGGGCITCTGCACCGAT^ 

OTAAAAMCAGGGCGGACACCGCCTGTCGjA^ 

SEQ ID 928 

MPFETRRRLQAALPQQTATVFR^ 

SWiAQTRKNSAmUCHHADYADLLICGGTHL^^ 

LFPSTGHTRKGLELLADFFEQTEL^^ 

IjKKQGGHlUjSDPMRALNyNPSIjSHHIDRLTDML^ I l - 

SEQ ID 929 

ATGCCGTCTGAAAGCmTTC(XKCGG^ 

TCTTCGGCG<£GC(&TGAraTnK 

TTCCATGCTCTCCGGCGGOTK3A^ 

TCTTOTC6<^TCCTCGCCTOT 

CSATCCCAAACOmX^TCC^^ 

AGCCTGCTCAAATACTTTAAAGGCX^CGCGCCCAM 

6CAACGAGTTOKXXXXX?IK^TTGC^^ 

MTCGCCACCTCX5TTTTTA(X^ 

TCGQCGCAGGCAAAACCTATAAA(?m 
TAAAGGA 
SEQ ID 930 
MPSESItfGRRGMmiQTm^ 
AVAPVLYLLTYAyiFVGGDLTAKGIGSAV^ 
SLUCYFKSttPKVARSmm 
USCLLFPTGrnAItXTO^ 

SEQ ID 931 

TTGAGTGGGGWX3TTCGTC^^ 

TCAAAACCTTGCACGACATCAATCCGCTGC^^ 

GCGGCGCGGCGCGGCGTTTGTCAAA 

GACCTTGCCOJGGCG^ 

T^TGTTTTTTTCCACCATCM 

TGCXBAGCraXXX^ 

TTTGCCTGCCGTCCGGCGTTC 

SEQ ID 932 

LSGAVRQHGGKMSDKKYNVDEGEIAKFSRIADKWWD^ 

DLAGABPHSFDVVTCMEMEHVPDPAAIVRACAa 

FACRPAF 

SEQ ID 933 

ATGCGCGGCTTCCTGAGOXSmC^ 
TGTTTGAAGTGTTGAMCAGGMGAGCTGC 
OTGGQGGGCAAGCCGGCCTGCCTGGT^^ 
GATTTCCCTTIWAAATGGAAAACC^ 
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TGAAGGACMTCTCGGCAACCATOTCCTO 

AGTGC(^6AGaAAAGGCGTATTTCCC^^ 
CGAACGCGTTCXIGCAACCTGCTGTTG^ 

SEQ ID 934 

MRGFLSGYDLGEFVSLQGIAQGnWSKYFLTWSGRYV^ 

DFPLBCENPKTOAWWTE^^ 
SRGEKAYFPTAQRAKIBFWVSiailiDFHP 

SEQ ID 935 

ATGCC(mxmACOTTT^ 
G&AAATGAAAATCCMCAGGCGCGACAC^^ 
ACGCCTTTTTCMCGCCTCAT(XMOT 
TACCTGACXBCC^ 

TCGTCTT(^GACAGGACGG6CAGCAGCCGGGCGCA^ 

CGGCG05<XAACCGC^^ 

AGAJ^TAATTGCTGTTGGTAATCCCCTGCGQA 

TATCAACTCACTTOT^ 

CATflQCCTTCGTATCCXXEA^ 

GC(X3CCAQ&T(AGATCCAI^ 

OTCCATCATTTCCATGCAC(OT 



GGCGOTE 
SEQ ID 936 

HPSETLSMISVKRM^^ 
YLTAVKQKIVFKQVGVDDARRQKVAEIV^ 



HTLRIRHIQARLTAHPRQIjGRRD 
GGFQRHiRHVDTFDKERAAPRHAFRH)AAATAYI 

SEQ ID 937 

AWaACTACGCATTTTGTOGTTAC^ 

MCCTGTTCCGCCCTCmCGCG^ 

TICTGAACTGOGTTGCCTCGGAAAACCTGOT 

SEQ ID 938 

MTTHFVVTGVCGCX^TTMPSIjQKHLGQCPY 
OiYMKAGVLDSQIiEILEEI^GEYGVKIANOT 

SEQ ID 939 

TTGAra&GCATOTATtt^ 

AACGCACCATCCGCATCCGCATTGCCGGAAAACGGGCGT^^ 

GMGGAA^TGl^MQOT 

GfcGTTGCCCGCCGAAMTGCCM 

CCGTC 

SEQ ID 940 

LTEHWIEIERRFLIENBKWRQ^ 
ELPAESADFDRPDW^^ 

SEQ ID 941 

ATGGACAAAAOSGCAAA^^ 

(XXXSGAATATTGTTCAGGGA^ 

TA3«rCAG<XX^ 

Af^CCGTACCGCCGGATTCGAAACGGATATTGTA 

SEQ ID 942 

HDKTAranua^^ 

IiLGFALARFAVGYMVGEFGFRIQFAPAYWIVGRVQGM 

SEQ ID 943 

AT(KX&TCTGAAACCCMOT 
ATTTTGACCCCGTATTCGCCTTC^^ 
GCGGACTGAQGTGGATGAAQQCXSGCTTTGCX^T^^ 
CAflXa(nX^CA<&^ 

AGTQCGGACAAGGCAAAGCOGA 

SEQ ID 944 

MPSEHIPVPGFRRHIASKQ 



SEQ ID 945 

CGGCGGCGGCAATCATCCTCATTCTGATT^ 
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CGTCAftCGRCGTACTGGTCAAAAlCTO 
GACQCGCTGATCCGGAT6TTCXjGC(3AAAA^^ 

GAACTGCTCAGCGGCGGCACGO^AGA^ 

<£CTCATCA«£AAAAAC^ 

O^A(XXXj6CGAARCCGGCAGCAC^ 

OT<^GGCTGGTCGGCCGCCTOT6GAa 
GTC 

SEQ ID 946 

VbSFSHNLKKPDHFVLU£H^ 

EALIRMFGEKRAPFAPGVASLIFGFP^^ 

EUjSGCE^SDPPKEPARACTWAVJ&IP 

GI(XALADSMM}LGIPVIiWm^^ 

V 

SEQ ID 947 

TTCAAATATACGCTATGTGG^^ 

TmTCGCGGTMTGG^ 
SEQ ID 948 

SEQID949 

TT(?RmXKm3G0G(m , ATGAa 
CTTTC(KXX5TGCCTGTCa 

QGCTCTGTTC6GGGCGGACGGGCT6TTGTGG06C6GCC6G 
OCTCAACTGCCTGCGCSC^^ 

tcttcctgtattgtttttc 

ggcgctggtgtggct6gt6ttgggg6taacggcggcggca6g6t1^^ 
gtcgg^tatgtattgctggcat™ 

M&GTGAAACGTGGCAGGCAGTG^ 

ACTGCtfCTTTTTACOTTO 

CCGOTGTGGCAAAAGATGTTTTATCGGCCT^^ 

TCAAACCGGCGTT(XX3GCGCG(^^ 

TTTGGGGOGTGCGGGTGAGGACAATTATGCGCGGGC^ 

ACGTTA 

SEQ ID 950 

I^VAAYDGLAWRAVLSI^mKRIAWTVFQA^ 

AQVPAARLQTARTLGAta^ 

VGEYVLLAFSVAVLSVCCOT^ 

PFVAKDVLSAWDALPPDYGRAAAGLGANGFQ^ 

TIi 

SEQ ID 951 

TTGCGGTGTGCATTTTCCTGCTGTTGGACAA^ 

ATMTCAGAAGTTTGTCGAGTCGGGAGAATATGAA^ 

GTTCXXOTTGAAAAACGGC^ 

ATTATGGTCATCGCCCATCA^TTGCOT 

TCGACCTCGACGGCTGGCTGACCGGTTTC^ 

CCMCCC^ACCGGO^^ 

SEQ ID 952 

LRCAFSCCWTAKAiiKjRKKYNAN^ 
IMraHHDCGMQGliNAEEPljGRV^ 

SEQ ID 953 ____ 

ATGAAGAAAATCXJGACTGTTCGGCXjGCACOT 
GCCCCTATCACAAAGACGCCGC^^ 
AACCTATACTTTTGATACCGTCCA^ 
GTGCGCGAAACCAATATCGCCGTCGCCATCXXSX^GG 
CCCCGATGCATMTGTGTCGTCAACGGAAAT^^ 
SEQ ID 954 

MmGLFGGTPDPIHNGHFHIA^ 
VRBTmVAHRQGDSLH^^ 

SEQ ID 955 

ATCMCGAACAAGMCTGCAAGACCTGCAAAAAAIXXn^^ 
(XAGAATGATTATCG0CAG0GG0GA£^6T^^ 
ATC^CGTOGmATGCAGGAGACCTO 
AAGCCGTGGCACGCTGCAGAC 
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SEQ ID 956 

KPWHAAD 

SEQ ID 957 

AGCTO&AAAATCCC^^ 
CTTCTCTCCMTGCGTCCGGTCTAT^^ 

OCGGGCAGGGGTOGACQGGCAGGGA!!^^ 
(XXKXXmTGCACAGGTTTTCCGCi^ 

SEQ ID 958 

SWKNAV* SGFRHHLPOGYPI&*CTNVY^^ 
PGR6LTGHDSHVPHPIMT6LPKV*AIiKNAKSSGMCQ^ 

SEQ ID 959 

ATCGCATACTCTG^^ 

A3CACCTTAG(K^rcGTTCCCTO^ 

TOiftGAAACACGCTnAOT 

SEQ ID 960 

MAYSADLRNKALIOISGLTKIRTRRRAAGST^ 

SEQ ID 961 

ATGaCMTTGAAOGCM 

ATAAAGACATACGCAGGO^ 

GCTTGCCTGGCAGTTGGC(XjCAA^ 

AAACCTATACGCTGAAGCTCAMTCGGTGCAGATC^ 
SEQ ID 962 u 

MTFERSGNAYKIVSTIKVPLYNIRFre^ 
GGVETEVVKyRVRRGDDTVTYFFAPSLNNIPAQ 

SEQ ID 963 

<H*mCGG(TK7^ 
ACCMCGTCCATOGCCGCTGTCTC^^ 
TMTCAMXTOXy^GCSAAff 
TTGTTCGTTCATTGTTTTCCTAAC^ 
SEQ ID 964 

LFVHCPPNGMPSDGIIASPY 

SEQ ID 965 

TTGAACAmcamrn^ 

^WAOGCGGCGCGGGCGTG^ 
CTCCGTCGAGCTO^^ 

SEQ ID 966 

IiJITVLAVGTKMPRWVDEAVAEYAKRFGRDAAYAFKEIKPE^ 

RI^aTLPHQIVmi/FBQLmVSILHNHP^ 

SEQ ID 967 

ATGGCACGAATOTATTCTGCGT^ 

CGTGGAOXXmCCAG^ 

CGTGCAGGGATACOTCCGCAA 

SEQ ID 968 _ 

MARHVFCTOjNKEAEQ^ 

SEQ ID 969 

ATGCAACA<£ACCHTIACGACT^ 

CmTGCCGGTTCAATm 

CGTTACCCTGCTGATGGTATTC^CATTCGCT 

GCCGTCTTTTTMCGATGTCCGCATO 

T^ATTCCCGCACTOXXri^ 

ACmCACTCTmTCTCGCm 

SEQ ID 970 

MQHDVYDYTAmSKN^ 
AmTMSAIARRTRUmi^ 

SEQ ID 971 

GTCTATTCGGTM^ 
GGTTTGCATGGTGTTTCGCCT^^ 

SEQ ID 972 

VySVTPDSSRLHSFLCPYWASVLYAQNKMPSENKCFQT 

SEQ ID 973 

Araraoci^qra^ 

(XaTCAATCCa^^ 
TTACGCCXXJTGAGGTAGATGCGGGATTCAT^^ 
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TTCCTaTKXXECGC^^ 
TGATOCAGMCASX^CAACAACAATCOC 

SEQ ID 974 

MLEHTPWUWttlSSFDPPT^^ 
FI^PREIAEVSSTMIKGLVGPEXjWMBTVKRyVPP^^ 

SEQID975 

TTGTATATTAJTCAACGGTTTGAAAATTi^^ 
.miff 

SEQ ID 976 

LYI IRGLICraHniGinUnCETjSPQGYIAFR^ 

SEQ ID 977 

TTGTITnMTTGACTATAAAACAAATGGOT 
TATTCACACOCATT 

SEQ ID 978 

SEQ ID 979 

GTGTTOGlOSCXmAOCOGQGCG^ 
CAGCCGGCAACGCGGCAACCTOTT^^ 
GCWTCAGAATCCGCATAAAACCTGCXAAAA^ 

SEQ ID 980 

VFDSQNPGGKIVCRIAVKn^HNPIjRYlfi^AAIQIiGW 

SEQ ID 981 ^ TATCCAACGOTO^^ 

GACAMTCCACTTCGGTTATCCGCAAAT^ 
AACFTO(XACCGTOCCGCGCCrGCCGCGQ^ 
GGCTfTOjCCCAAGCTAT^ 
ATATTTTOraCAAmTGCCGTCTGAA^^ 

SEQ ID 982 

SLEV>MECICNAFAAIffAFRPI£re 

GFAQGIGFGKCEIFGCVPRHSIABPDAQVFRTB^ j 

SEQ ID 983 ' 

GT6TGTTATCTTCATATTTCGAGGGTMCATATCTOT 

ATMTTCTGTTGCCAGTGGAAATAAATOT 

TGCACTAGGTOTAAAGCA^ 

AAAACAACGMCACACTGTTAGATGGACCGGCATATOTGOT 

AGCAAAATGGAATQGCACTK3GC^AGAATCA6AAQCT(^ 

AGAGGCAGATTTTGCGGTGGGKSTGGGGAA^ 

GAAATAAAAGATGGTGTGAGCMTTCTGTGGCA(XXK^ 

AAATTGTAGGTGTTGATGATCGTGTCAATGATTTTGATC^ 

TC(XAOTCGTCAAT^ 

CTACAAGTGATAAGGTGCTTATGGAACAGTTGATGCAAGA^ 

SEQ ID 984 

VCYLHISRVTYLLIWYSPKKYAVAI£^ 
KTTNTLLDGPAYYADAIAVGYQAEAGKGGAIAliGK 
EIKIXWSNSVAPCTGSVASENNVVSVAYKETPQS 
SHPVNDRfWVKVGG^^ 

SEQ ID 985 

ATGAAGATMCACACTGCAAATTAAAf3AAAGMGTACAGA^ 

ACGCGGTCGOGQCGCGGCAGGAAAAGCGGTTGTCTTCG^ 
AAAATCATCXXmCGGTATTGmATGCCGATA 
ACATTMCGGCATTGGGGATTTTTGGMTCAGGC^U^^ 
CACACCGTCCC(XX)AGCTAAAAArcCI^ 

SEQ ID 986 

^THCKLKKEVQKEPLRSFVPEVTARSA^ 
HKPDGmADSPGSRGK^^ 

SEQ ID 987 

GTCAACGTTCAGACGGCATTCCGMTTra 

(XACACACCCCATCCTCGACAACCm 

GCTTGAAAGOGGCAGCATCGCCTAOCGAfiGCAAAAA^^ 

CGCGACCTCCTGATGTTCGGCC^^ 

TGACCGAACTCTCOXXXWACAA 

GCTCATGCAGATOTGCAAAAGOT 

GCXXSCCCTGACCG^^ 
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SEQ ID 988 

miilFWPYHQGRPTAECRRIVNGAIEKF^ 
AMflOTEEVFTraHIKHj^^ 

SEQ ID 989 

ATGACGCMGAACGTTTACCC^^ 
CCGTCCGCCTCTG(X3^ 
GCAGGGAACCXX^ 
AACCTATMCClTOffi^ 
(XXjACATCA'IG(X£AAA<£G^ 
CGTCATCGTCCGCGAACTG 
SEQ ID 990 

ffl,LSF(»GEIN(m,T^^ 

SEQ ID 991 

MtXTGTCTGAAAAAAMM 

ACTOGTCTTGCACCTGCGCCmCC^ 

TTTGGGTTTCGAITCGC^ATC 

GGCOXmCTGCTGCTO^ 

GCATGATAGACCCCGAAGAATTTACCGCCGCGCAGGC^^ 

TTGGCACGAi£GCTAC(XX?I!CQGAOT^ 

ACGOX&CCCXramS^^ 

GCATCCTCTOnomGAC^ 

AAAA 

SEQ ID 992 

MPSEKNIGmGSSRPIJlVAFM^VSCIIiFM^ 

GGSUiLmURQGGRDI^ 
TATAWGPVSFFGLIMSLA&&ffSP!^ 

SEQ ID 993 

ACCrGCGCX3CGCCATACGTTTO3T^^ 

CTCCCTTCX3ACCAAAT(mTO 

CGTQ(XrPGCAATC6ATCATCA(X^^ 

GTCGGGATTGCCGCCGGTATGGACAATCGCGGAATGCCC^^ 

(XGTTTTTAAAGCCGACGGGGCAGGACAGAC^^ 

TCATATCCAAAAATTCO^^ 

CAAACCTTTCCAACCCACCGTCGTTC^^ 

GCAGCTCmAJK&TGGGCM^ 

SEQ ID 994 

TCAKH1?CVRTRQQCQQFPSRAPTD1M 



QTPPTHRRSHIiI£INAH^ 

SEQ ID 995 

ATGAAACAACTCCGCGAC^CA^ 

GCGATGTCGATGCCACTACGGAAGCGGCAACGCAGACCCGC^ 

OTTATCCMTGCX^CGCGGCGGACAGCAATGTGCCGAA^ 

(XXXraXXmAOOX^^ 

AGTOTPCGAAGCAACCTGTCAGGTCT^^ 

GCCCTTGAAGGCTOT 

SEQ ID 996 „ 

MKQLRDNKAQGSALFTLVSGIVI^^ 

GFHM©AKG(^im)QBI^ 

ALEGFNAKPAKGGNAGLSDDEVKAAVDYMANQSGAKF 

SEQ ID 997 

ATGCCCGAAACAGGTTTTCGGG^ 

CCG(&CGCG<X^GCAATATGCAGG(^TC^^ 

(XX^ACmCAGCCTCAATCATAAAA^ 

CTGACXXOTGAGTTCTGCGCCCATTA^ 

CCGCKTFGTACGATTCA^ 

SEQ ID 998 

1/ITEFCAHYEGRIiMNIH^ 

SEQ ID 999 

TOTTTCKX^TCAGCOTGTCGGTO 
TGACGGGCGCGTCGATGGCGGTGGCGG(X^TGATTA 

OCTGCTCCTGCCTQCaK^^ " *" — * ~ 
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GTGCCOT^TGGG^^ 

TGCTGGGGCGGTACGA^ 

(EGGACGGCCXHSTT^ 

ATGOKXiACAGGCTGCGCCAAAGCCTO 

TTTTT«3TGTGTOGGTACGG^ 

SEQ ID 1000 

I^VSLSVGIADFRWSDWSLSDSQQVMFimPRTPAIVL 

VPLVGIIFGGVVEAWFVAYI^^ 

MGDRIiRQSLPAVALLGASLVI^^ 
SEQ ID 1001 

TTGGCCAGGTTGAGGGAAMAGGTTT^^ 
SEQ ID 1002 

LARLREKGIiAVINRREKRVGVGDSN^ 

SEQ ID 1003 

ATGAAAOGGOGGCATTCGATGOCGTCTGAAACA^ 

TGCCCCATCCIOT5CCC^ . 

SEQ ID 1004 

HKRRHSlffSETRMQTASCVSDGIARYGRYRGNQGKR^ 

SEQ ID 1005 

TTGCGTTTCACCGCATOACCCTC^^ 

CETCCOTIACXS^^ 

GTCCGTCGATAAAAA(^ 

ATOGGCAGCCGCGCCGCCAAGGCGTT^^ 

AAATCTTCGGCAAACAGGCGGAAGCCGACAAGCK^ 

CGCXXmAGATGTO^ 

TTCXMTACTICAMGAGAAAAATCCCG^ 

CCGCATGGAAAAAAGGACAGGTCGTTTACCTC^^ 

SEQ ID 1006 

LRLTALTLCTVIALGACSPQ^ 
IGSRAAKAPIMEIAPTIEMTADTAraXESAKER^ 

SEQ ID 1007 

AAATTATCAGGACAGATGCOGTC 

SEQ ID 1008 

VQRmVlOOTGCLQM^GAKPPVGQTGTAI^R^ 

SEQ ID 1009 

ATCGCTTTCCACCATCACGCCXA 

ATCATCAGCTK3TCX3OTAOCCCTGCCQCAC^^ 

CGGTATGGACMrcGCGGAAlKKXXIGGCCTTO 

GACK3GGGCAGGACA6ACCGCTTGCCMTTCGC6GT^ 

TCGGTAGAGGCAGGCATGOCCATATTATTCAGGGACA^ 

CCAOETOTTCTC^^ 

0»ACAC<XmSATMTGA^ 

TGGGCAATCGGCGGCAACAACTCTPTAAC^^ 

ACCTTGCACAAACMGTGCAACATATTACTACAMGC^^ 

SEQ ID 1010 

mafhhhahdimvlpq^vgnvumfhlpgw 

dc^gqtacqfavdlafgrpr^ 

rtrpdndqqppvpavnqigkfl^ 

SEQ ID 1011 

ATGAAOCCXXXSmmXOTCAtKX^^ 

AAGQCAGCAAAATCCGTAOCAACATOGTTACGOT^^ 

GTCOCAATTOCTGACGCTGOGCGGC^^^ 

GTTAAAGTCGTTTCCGTACAAAAAGGCGCGG(7PTC 

ACT«3GGa3TGCGOTmCAGC(X^^ 

AGATTACXSAAGCCXSGCAAAGGCT 

CtXMTCCTATTG&G^ 

AAACCACACMTCCAACACCAATTTGGCCT 

CGGCTAC(KyySGCMTGTAAAAGGCCCC^ 

CATCAGGAAATCAAACCGCMGCGTTP^^ 

GTCTGACCAACOCGAOCAAAACC(^ 

CAAAAC(X3TTTCAAGCAGCAGCCTC^^ 

CTGCAAACCCACGGCAAACGCGGCATCATCTC^ 

ACTOXXKEAGACCATCA^^ 

CTACCGCACCGOCGGCCTGACCGCC^^ 

ACTT66ACGGCCT0OTTG0CTA€CGCTT^^ 
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(XmTTGGAAAACGTT^^ 
CTACTATCCGCACAGCCAACGCPGffl 
SEQ ID 1012 

VKVVSVQKGAGSASAGIGATNGWIAKTV^ 
RIVLSHMKDQHRGtt^ 

HQEIKPQAmSQraEDKKDATEEDKKKKRENER^ 

TWASIAYRFKNPNLKIGWRGRyVQKATGSII^ 

SEQ ID 1013 

TTGTTMTCCACTATACAJrTTCCGM 

SEQ ID 1014 

LLIHYTFPTKPVNKKQR^^ 

SEQ ID 1015 

ATCACaTCCGTCCTTTACACGAC^ 
GCTGTTGGTAATimmGA^ 

SEQ ID 1016 m _ 

SEQ ID 1017 

ATGCOTCTGAAGGCGGTTTGCCGT?K^ 

TGGCCGCTGTATOOTGAAAATCftTTT^^ 

0G6CGGCCG(MCC0GGCA6GACGATCCCAGaG6(X^ 

SEQ ID 1018 

MPSBG(2£F(PASCPYIi!TND^ 

SEQ ID 1019 

ATGGCAGCAAAAGACGTACAATTCGGCAATGAAGTCCGC^^ 

ACCGOCTiroX^^ 

CAACGACOTAGCCGGCGACGGTACGACTACCGCCACCOT 

AAAGCCGTTGCWCTTTGGTT^ 

TCGCCGAAGCGATQGAAAAAGTG3GCAAAGAAfiGC6TGA^ 

GC^AGCCGCCCGCTGTTGATTATCGCT^ 

ACC(^a3CAAAG(^TaTO^(^(^ 

C6AAATC<^AAAGAAAA(^AICXX3^ 

AAACTGC^GAGCGCGTTGCCAAACTGGCAG 

6CGCAGCCGTTGAAGAAGGCGTGGTTGCAGGCGGCGGCGTAGCOT 

GC6CGCCGTT6AGTCTCKGCTGC6CCA^ 

TACGGOGACATGATCGGJ&TC 

CTGAAGAAAAACCGGCTGTGC^^ 

SEQ ID 1020 

MAAKDVQFGNEVRQKHVNGMNIL^ 
f^VAALVEELKNlAKPCIffSKEIAQTO 
ASRPUJIAKDVEGEftUTLVVNHIRGIIJem 
KLQERVAKLAO^VIKW^^ 
YCMtGMGVLDPAKVTRSAI*^^ 
SEQ ID 1021 

/K£ATCA(&CCGGCG^^ 
TGCCTTPGCCTTCmC^ 
GGCATTGCCGGTGTGCAGGTTTTCCAAAGCGGC^ 

CGATTTGTTGGCGGATTTCGGCAAi^^ 
AGTCGCTmramGJ^GGCm 
GTTTTCAGGATGCCGCGGATCTOTTC^^ 
GffPCGCGGATGTTGCTCATTTTITO 

GCGGAMTCGAGCCGACTTGGGCGATTTCT^^ 

TCATC00QGO6GTAAOGTATTTCATO0CTTC 

C(£GCCCATATTTTCAAACTTGTOT 

AAGOTACGCGCACGGOTTTGCCW 

TCATAAAACCGTTTATGCXXaTC 

SEQ ID 1022 

UfiEGFRKHCliHHAAHTAHSmGHSRPPPIffiF^ 

GIAGVQVFQSGTGAQQGtfAAACNHAFFNGCAGS^ 

SRFFQRQADFFOfflm^reDVLQHRFAAVAEAGGP^ 

TflJHVQLVFQRFAVFNGNHAFFA^ 

RAHIFlOiVFQFDFFGim^ 
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SEQID1023 

(XmATTTCCGATAGATTCCTGC^ 

SEQ ID 1024 

IiLIBnSRHPKNEERQBPIGKTAOTPSPPRKRESRFVHFRIire 

SEQID1025 

SEQ ID 1026 

LKTKNFLSKYKlffiSIiQMQLPEHSBKHRKSKTET 

SEQID1027 

TTCGATAAAAAATTTTTCOT^ 

SEQ ID 1028 

UJKKFFVFKPSPLAIG^^ 
SEQ ID 1029 

ATTTCATACGCMGGAAAATGTTGAAAACAAAAAGAAC 

SEQ ID 1030 

IJKHPAIHSGKTARHIBF^ 

SEQ ID 1031 

TTGAAAATACCGGATTATCTCC^ 

SEQ ID 1032 

mPDYIiQSISPCMIRRIiPAQiaa^ 

SEQ ID 1033 i 

TTGAACCCCGCCACTTGGA^ 

TGTTGAAACATTCAAACTCGGCTATAAT^ 

CGCCTGCGGCAAAAAAGAAGCCG^^ 

(HCGACATC(X5ACGCTCOT^^ 

AGGCCCAGGMGTGATGATGAMTTCCTGCAGGAGC^ 

CAMGACGGCGTGAAGACCACTGCTTCOGGTCT^^ 

GOTACCGTAWCQACMCIWjCAA^ 

ACATOXKSTCCAAOC^ 

GCAGCCGGATCAAGTC6ACATCAAAAAAGTAAAT 

SEQ ID 1034 

LNPATOTSVLRGGRIKFYFGRFNPFQ^ 

VDIGRSLKQHREQGAEI0IiKVFTDAM^ 

GTVFDSSKAHGGPATFPLSQVIPGOTEGVRIMEGGEAT 

SEQ ID 1035 

GTGTKSACGGGCAACATATCATCTGOGOGGAAAG^^ 

SEQ ID 1036 

vi/imssARKw\^mTi^^>mvHrm3 k 

SEQ ID 1037 ____ 
ATGACCGMCCXSACTKXX^ 

CGCGTCGCCACGCCTAAAGTGCTGATG^ 

TCGGATTCWmCGTATCOXXK^^ 

GGATGCCTPTATCMGGGGQCAACCCAAAAOCG(X^^ 

CTCGCCGOTACGCCTTATATGOX^ 

GCAAACAGCGCGCCATCGTGTTCGGCGAT^^ 

(TO&TGCGTCCGCC^ 

SEQ ID 1038 

HTEPTSRRRFLKTCTAAGAGI^ 
mTPKVIMGLRGCT^^ 

LDASALTUmiLPPPDIiPIFPBSGVADlSE 

SEQ ID 1039 



CTCTACGGTGTCCGGTCGGGTTTGCXjGAGGAAC^^ 

SEQ ID 1040 

HIVWFFHAKVGHCQTPFQKTPGYPGW 

SEQ ID 1041 

AJGTmCCTAmCGGOTCTm 
GGTCX^GCCGTCGGCTTGGGCMTATTTGGC^^ 
(JCTCGATTATGCCATCGGCCA^ 
TATTAOGCAGT6ATTATOG(3TT^^ 
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TOOCAACAGCA6CTTCGAACK3C^^ 

tatcgcctocgacca™tcaaccaggc^^ 
gcggcgatcaggacmgctgmcatcgggcgc^^ 

TGATGGACAMTKXTCAACACCTACGGCATTC 
GAGGAGAAAAA 

SEQ ID 1042 

WVAAY^IFPSLSICFGDWTYSSYI^KETO 
AAIQDKLNIGRVNATIiVCIPMG 
LKIFGWGMSAMJMPglJJ^^ 
SEQ ID 1043 

ATCAOT&CTTCXXXX^^ 

SEQ ID 1044 

HSTSAIVMMVAmVIWCKIiIiLSLm 

TCCAACCCGCmCCCATACAGC^^ 

TGTCGATATOACCATCCAAAOGGG^^ 

ATCGTCCGAGCGCAAACTnGGATGCG 

SEQ ID 1046 

KKHAPnUiLTASSAALLSAG^ 
IVRAQTLDASGKVEREWQP 

SEQ ID 1047 - . JUUUL 
ATGACCCTGTTTTGCGAACAAGTCCCC^ 
axXOTTCAACCCGC^^ 

ATCGACCGCATTCAGAAAATTGCCGCGC^^ 
AATTCGGCATCGCCTAMCCGACGCGCTCGAAGOT 
GGTCGAAGCCTGCGAACGCAT1TTGATTTTGGTO 
GATTTGGGTGCGTATGCCH^GCGGTOAAAM 
TTGTCAAACACOTimGAGAAAM 
(XJCTCTGACOQCEAACATC 
GCCAGTATCGCTAGCMTTACMCA^ 
CCTGCCTA 
SEQ ID 1 048 

IDRlQKIAARLGKTAPVSliRVNKOTAKTH^ 

DLGAYAIUVQKLMGTBRIjKIJLEPGRSLV 

ASMASIOTTRNRAAEVLVIX^^ 

SEQ ID 1 049 

ATGAAATACGGCtHftTTTTTO 

cottgcagcttcaaggcaaaccgcm 

SEQ ID 1050 

MKYGVFFAAATAIJiLSACGYKGGLyiiPKEGDTARFGVIQTGLQ 

GGGCAQGTAGAGGCXXSCCTTTGTAAOCGCA^ 
GCATCTTAAACAAAAAACACGCAAAAAGCTA 

SEQ ID 1052 

MGCCGGAGACGGCACGCAAATCATCGTCAACCGCCAC^^ 
CGACAGCCGOGATTTTTACGAOGTOT^ 

SEQ ID 1055 

GTGATGTTCGATGGCAAAACCGCTTCGGCGGTAGGC^ 

TOCAAAAACCGTO^CAGG 

OTGCG(^TAAAACTCTCGGC^ 

(XGACAGCTAAAAAGGGGGATTTGGTCAACAT^ 



WO 02/079243 



-93- 



PCT/IB02/02069 



SEQID1056 

VMFDGM^VGGKAFAKVGEIF^ 
PTGKKGDLVMHGIKGNFRHIV 

SEQ ID 1057 

TTTCAAAGAAAAAGAAA(»GA^^ 
TCCGCCTGCACATCCGGCAAAA^ 
SEQ ID 1058 

SEQ ID 1059 

ATtfEACCGACACATC^^ 
GCA6AATGCCCGTATTGGAAT0CGTAAGC0GTG(X^ 
GTTGTTCGGCAAAGGC&ACOCCGCGC^^ 
CXmrPimCAGCGATGCXSACTTG^ 

TCGTPTTCAAAAAACCTGTCGCOT 

GAAACTTTAOGGCXffATTGAC^^ 
GMTTTGCCGTTTACCTGCTGGATTC^ 

SEQ ID 1060 

KHUIIEmGDPILSLVETFKM 
RAYVSDPTWDNHRGIFEGAGFEVGTYPYYDPATV 
SFSKNLSLYGERVKIiSVVCPNKEEADLVF^ 
EFAVYUiMGR10AGI^SWITyVADALABVI£ 

SEQ ID 1081 

ATGATTGGGGA^TGTTTGTCGGATACATOT^^ 
CCAIAAGGGATATGTTTGGAAAA 
SEQ ID 1062 

MIGIElFVmFDIVDV^ 

SEQID1063 i > 

ATGGCACAATTAGCTCTATATCXj^^ 
ACAC«K5CTACTraATGCCGCTTCGCA^ 
CEGCACCATAGCCGCAT^^ 
AACOGCGTCTTGGACGATGAATTGCTXKaACAAA 

SEQ ID 1064 

MAQLPLYRTAEIGNFTVCTPKVI^ 

SEQ ID 1065 

ATGCCCCTGTTAGACJ^TTTO^^ 

GCATCCCCAATAAAGMATCCTCCCX^GAAA 

GGGCKXCGCACCGGTTTTTATATGAGCOT 

GAGTTGAACGAATACCAATGCGGCACTTATCTGA^ 

ATGAAAGCCTGCTGAACGCT 

SEQ ID 1066 

MPLLDSFKVDHTRMHAPAVKVA^ 

SEQ ID 1067 

ATGCTGTCTGAACMGGGTFIX^GACGGC^^ 
AAGCGGGTTTTTTATCTCXAACA^ 

SEQ ID 1068 

MLSEQGFQTAFAF SVIIRGCTGAGFKPASSFPIFSVPYRF^ 

SEQ ID 1069 

ATGGGGCAAMCCTGACTGCGCCTGACG^ 

CGAAAGGCAAAAATTTCCXiCCATCA^ 

GCCGGTATTGGTGATTGGGCAAGTGGA«Km 

GTGGATGAGCGCGTTACGCTGGTGW^ 

ACAAGGTGGACMCGTACCGGGCCTGGAA^ 

CGGCGAAAACCTCCAAGC^^ 

GOKXXaAAATGGGCGCAGCTTGCGGTTGAOT 

TC^GCGAGCAGGCGGCTTTGAAIPGCGGAAAT^ 

CGACAAACTAGCGAAGGCGGACACAATCGGCATTGATA 

CCCGTAGGCCAttGOCTGACTG^^ 

ACCAACACGTTTTCGCCAACTAC^ 

GCTCtXHTTAGAAACCATTAOT 

CKXXXETCGMGCGCGCCTGCGCGC^^ 

AAJCCTGTTCGACAAAATGGGCATCC^ 
ATCCTGCAAMCCGCAGCCTGGCG^^ 
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(mOTm)CAGCAACMCCCCM 

CCAAATCGAGCTGCGCOT 

AACGTCTCGCCCGAGCAM 

OCQCAACAAAAAOGCCAAIC^ 

" _ * * — ^ATGGCGWAGTGGJCGGCJG 



GATTATTGGATGTACGOT 
VDERmVOTKSG^ 

^IGIDTmSIin^^ 



PVGHSLT&APEQLDLQDVLGRLKPHL^ 



NVSPI$RKYAOT^ 
WnGSKLVMQ^HDELVIiBVVETK^ 

SEQ ID 1071 

SEQ ID 1072 

LI^AFVGTKYmGLLIJSNHSKRVFYFUff 

CCCGGTTOGC^^ 
ACTTCCT 
SEQ ID 1074 



ACTGCO&CC(£AGCGT^^ 

SEQ ID 1076 i 

TACGG 

SEQ ID 1078 

LKKDIilGMRWPn^PFIYPHY^ 

SEQ ID 1 079 ^ 

TTGCCAf^TGCXCTCTGAAGCTTTra 
TWXXXXXOTATCCGA(5ACGGCG^ 

ttcggamcggmgaagccatgccgt™ 

OTTCGGTCTTQ^^ 
TGGAGGGCAAGGCA 
SEQ ID 1080 

ASVLPARVLEIVGNS LEDLADARERFTAYRRSGFAI EHHGMEGKA 

S ATCCA^ 
SEQ ID 1082 

SEQ ID 1 083 

CTGTTTCCCGA^ 

0CGA0C!AAAWATCAAT6ACAAAAATCTAGCX^ 

CAATATTTCACTCCTTAAAAGA^ 

AATCCGC^AAGGAAATTGAAGTGGCXXGCGTGTO 

ACGGCMCAAACGGACGGCAACCCTATTTGCCAA^^ 

TCGCACXIGGCGACGATACCGATATTTCCAAGTT^^ 

HPVKEIEVARVIiQHIGLQSGSTTETAVRFI^ 

TTATCAGACGATGGAGTAGGCACAA66TGGCA6AAAAAGGGGTT 

SEQ ID 1086 

LPRIAVI^IiLSAASPPCPDIMilHYRFPRIAJHGLKI^ 

^ATtaAAAC^^ 
GGGCGGGACACACTTATOCGG^TCMCTACCAATA^ 
CGGCAAAT^TGACGAM(OTCGATGAT^ 
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GTOATTTOGTGCGGAGGGAAGCAATCCC^ 
(X3CGTTTQCACAAATTTGTC 

CTCAACTTAGGflCAGCGAAAAAA^ 

TTMTGGGGTATTGCAAACAGGCAACCXX^ATAW 

AGCACATCAAAATGGGAAATAOT^^ 

AATGTTTCmATCCGAGACAGC^ 

AATTGATACTTACCAGCAACATC^^ 

CAGTACCGCTACTTGGAAAOT 

AAAGTCAIOTAGAWaGC^^ 

AACOTAmXXmiTCGCGGCG^ 

CACCGTTACCACTACAGGCAATAAAGA 

&KX£GCTCAATCTGMTTACCAA 

(XSACAOCGCACGCCTACAATCATTTAGGAA^ 

•aTTCAGGOXXiACAMKXS^^ 

ACACGTTOGACT^C^ 

ATTOAAATCTCACAjSGACTT^ 

AGCAACATTTAATCAAGCCAJM 

AACGTAAGCCATTCCGCACTCMC^^ 

AAGACAGCGAATGGACGCTGCCGTCGGGCACGGAATPAGGC^ 

GGC^TGCmHTOXXX^TTCGCGCC^ 

C(X^ATCTC(mCTCTTCa^ 

TCACGCTAffTG^^ 

(£GCCTGCATAATTCGGTCAAAG^^ 

AGGAAGAGAAAAAACGGG^ 

GCAACCGaGCCCCAACO^ 

Gj^OGCCTOTTGCXEA^ 

GTATGCAGAAAAACCTCGGCAGCGGGCGCGTCGGC^ 

GCMTACGGCATCGGCAGGITCGACATCGGCATCAG^ 

GCAAGATACCGOGCAGGTTTCGGCGQATICGGCATO ' 

CM!TCAAOCGCTAOCGCGCGGGC!ATO^ 

GCXKXTPCAATAOCGCCmTTGGCm 

CAATTGGMGCGCAGCACAGCGCGGGCATCAAATTA^^ 

SEQ ID 1088 

MmDKRTTETHRKAPKTGRIRFSPAYM 



VNLGSEKIKHSFYGFLPTGG^^ 

NVSIiSmREFVYHAAG<ft^^ 

KVItDQQADDQGK^ 

TRSDWTGI/TSCTEKTITDDKVIASLSKTO 
HVSHSMJJGHVSIADKAVFHFENSRFTGRI^ 

mQSLDAIJAAGRNATEKAESVAE 
I^AHffiRMAVWTSGIHDTOJYI^ 
ARYRAGFGGFGIBPHIGATRyFVQKADYRYENVNIATTC 
QLRAQHSAGIKLGYRB 

SEQ1D1089 

GCCGAAAACQG0GGGGTTCCGACC6AAAAGGAACGGGGATAAAG 

SEQ1D1090 

MPSEVRRFGFQMALPACVKRYSUTARHKTLPRI PPPVGFYRRKRRGSDRKGTGIK 

SEQ ID 1091 



GASTmATCAACTCGCCGGAGCAA^ 

GGCGCTAAAAACO&CCGGAGGC^^ 

CATTTGCTA 

SEQID1092 

IXJGRGi^FIUiARQflfLTVRFGVrcRFCSE 

m 

SEQ ID 1093 

GTGT00G0COTACG6TAG0CGATATTCC0GAGTO 

SEQ ID 1094 

VSAVTODIPBSNGKGEYKVFTAYWGKATAGNl^EK 

SEQ ID 1095 

ATCTCGGATAAOGTTOCAACGATT^ 
CGCCCGAOXXX^CGTGGCAACCTAT^^ 



WO 02/079243 



-96- 



PCT7IB02/02069 



ATTGTTACCGOTATTGCCGGAAC^ 



CTGAMCGGACGCTCTTGTGGjAGGCCGG^^ 
fTGCATCGGAAM 

epQ in 1 096 

U0H1iI£EAGWQGE£BGIiFIiARTRHVNMjK^ 

SEQ ID 1098 

mFQFSVFDFFVFT(3RGliS7IiKmvm 

SEQ ID 1099 

ATGACGAATTmGGTOC^^ 
MTCCAGCCCCGTCGGTACGGAAACTTA^ 

SEQ ID 1100 

JACTmMCCCA^TTC^ 

S aSi^ 

aactctcc6tcatoccagggaa6tgggaot 
SEQ ID 1104 



ATOOXM 

CGCACAAAACCCAAAMCTCGGCX^GAAAAM 

CAGQGCGQGTTCTAACGGCTm^ 

GCCTACGGCAACTCAACGCCAACCGCMCACTO 

ATTACCAAACCAMTCCGCMGCGACTATGra 

(CTCTTTGGTTTGGATGCCra 

CCCTCGCCCAGCAGATACCTTACCTACAAAGQ 

ACCGCATCGGCTGGATTTTTGAAGACTC^^ 

GAGCTACCtXXSCCG^^ 

(XXJAAAMTTACGATACAAACGGCATCA^ 

TTTOKXmGCAATACGC^ 

GCmTCGAAAACCGGTCGAMT^ 

AAAGACGATCCGAAATAC^CCACCGCCATCCGGGGGCAGATT^ 

AePTOITCG^ 

AAACTGGGAATTGGGCCTGGCGGGCAGCGGCAAAGCTGG^ 

AACAATCCX^GATACGCCCCGCTrTCAGACGGCAC^ 

TCGACAraTO»a^CCCAAA(^ 

TGGCXTTATGCCAAACACAGCAAGGCCTATAOGCTGTO 

CTTGGGAATCTTTACGCAGCATO^ 

GAAGTTC 

AYGHPNANRNT^ 
PSPSRYI/FVKATGVARSQPDPQEWVN^^ 
PKNYDTNGINGEVYHSFRHIRQNTAQ^ 
KDDPKYTTMRGQIPHLGSERAHAGFSYGTC 

WPMHSWffTLRHiSMflffi 
ATGCAGACGACAAAAGAGGGAATGCACAC 
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SEQID1108 

C t RQSHCRQFERKITHTVHRPSFNE^ 

SEQ ID 1109 

ATCMTX^CTACACGCAjGCAGCTTCMGGTAAAAA 

CCGAATTCCGCATTTGGCWmC^ 

CAACKXXJTGATOCXJCGAATTO 

OOAAAGGCTTGATGATGA^^ 

AAACCCTGCGCGTCGGCGACCGCGATT^^ 

(%GCGGC<&CCTSCTCGAA^^ 

MTATTGAGGCAAACTGGATAGGCA^ 

CGGAITACGCAamCCACTATmCGTCG^ 

(XMCGCCMTTmCACOT^ 

SEQ ID 1110 

MNDYTQQLQGKKDYLKTL^ 



NIEAJ^GNIRIARLSAEEFl^^ 

SEQ ID 1111 

GTCmmAGCaCAAAAACC^ 

MGAGGOTCAG^^ 

(^^TGGGCGCAAAATCAAGAAA(X3GTCAA!K7K^^ 

CGCTACAAAAAAGMGCCAAGCCCGCCAAATTT^ 

GCCGCGACTTAGGCAAT(KX^CCGAACGW 

(XX^TTACCTTCGACACCCXX^ 
AACTGCCX^TCMTCTGATTGCCATCG 
CG^XXGAAGGCCGTCTGAT^^ 
GTCAGC6GC(?I^TGGGCAATAAT^ 

<jKX5AAATCC(XjAGCGGAA 

SEQ ID 1112 „ i 

VEFSTKiraLQEQQAC&QLFVCADKW 

KmEAXPAXFSQAVFH^^ 

GITFDTGGISLKPGLN^^ 

SEQ ID 1113 

GTOKXATCOX^^ 

GCAGACGCT(mOaTGCCGCCCCGCAAM 

CXXKEGCXXXX^TAC^^ 

TTTATCH!<XTGCAGC6ACTCCGGGGCTACTT^ 

OTTCGGCAAAACGCCGOTCCTGCGTTCTO 

TACTIOCTCTTATACGCCGCCGAAAAGCGGTT^ 

AACAGCGATTACGGGCCGGATGTGGATATTCAAMCGCA^ 

TOmGCACCCGCAGCAGO^^ 

AGATACCAACCACCTTACTAAACAA 

SEQ ID 1114 

VSIPTATPLPAGEVTljSSDNGNIENIirrAGAGSASDAPSRSKRSIiDAAPQOT 

FICCSDSGATSYAEITKQDIMKFGAW^ 
KSDYGPDVDIQHATITGPTFSC^TSG^ 

SEQ ID 1115 

GrcCGGTTCGGCCAC^ 

SEQ ID 1116 

VRPGYPPPAQAARAAASGSKGRALYPIAYSIimy 

SEQ ID 1117 

CTGA3TPTTCT 
SEQ ID 1118 

VPAFVCKVSTFVFYS(&TKTC^ 

SEQ ID 1119 

TOCCOTACTAraOTACTGTCTGCG^ 
GCGGTGTTTTT 

SEQ ID 1120 

LpmYCLRUlRLVIJFVNPLyDAPRPPRV^^ 

SEQ ID 1121 

GTC<3CTCAAAG(XA606AfiACATO 

GTCOSCCeCACTCC^^ 

ACACGtmttmQCCGCCC^ 

CGAAAQCTGOGACCGGCGOT 

TCGCTACCCGACAATTGOXX^^ 
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MTAATAGCTTGTC^^ 
GCTOmTCCGam^ 
TOCGGAftACGOCCO^^ 
GCAMACCCOm^ 

TCGAmCGCGCm^^ 

COX^TAC^CGCXXSACCC^ 

ACGCCGCAAGOmttK^^ 

SEQ ID 1122 

VAQSQRDIPFAVTLR^ 



MYPAAAALQAVLRRG^^ 

IIRQIiGIQIRQRSDQ^^ 

TPQAPRAADAAHHIVD 

ATG(XK^GGCGTTGCGCK3CCXAGCGTGCGGCTO 

CXXXXmTCTTGGAACACG^ 

G(^CGGGClTrGCTGGCGAAATTGGCGGGMGG 

ancACGAaTrccGar^ 

GCA6GG3TCGATTAT6AG6CC^G66CG6AAAAGCTGACTGC^^ 
' ACGATCOTK^^ 

(^CGCTACCGCGMCAAGGCXX^GCGGAACX^AA^ 
GOTTCTGCCGAAM^^ 
TGCOX^TOTACGCCCACCmACTAC^ 
ATCGTGCCGACGTTC^CTACCGCTTC^ 
ecn in 1124 

ARAIIiWKIAGRPAEAVARYRALHRDNAMJERVbl^ 



qrYREQGRAKRN^^ 
IVPTIiNYRPGKTESNVPYAKRRNSEVrVSAEWRP 

S a2otgc^at^^ 
ototccaaaaagcgaacgctt^ 

SEQ ID 1126 

MAAVYAGYTYTQPGGES^ 

OTCTOAAJmWKm^ 

SEQ ID 1128 

VLNSTFIUiIETVVGSFQTAFR^ 

SEQ ID 1129 

ACGGCGCGCTGCAATCGGGATGGCAAAAAGGGCT^ 

CGCCAAATCCACGMGCCATCCGGCGCGGCGACAlOT 

OTATCCCCTATO^CCTO^ 

MGCMXXX»CCGi^^ 

AAAATCGCCCAAACCGGTAAAGTATGCOTACCCGAGCCGOT 

GCGCAAGCCCGAACGGCAGGACMGGCGGGGAAACGCCGC^^ 

AATATCGCX^TGCGGCTGGAM^ 

•TCTGAAAACCTCCGC^ 

a^TCTTCGACCAAGC^ 

SEQ ID 1130 

LPRIAMPYFALFD^^ 

RDnffiAIRRGDTYQINYTTRLHLQAYGNPVSLYR^ 

AOARTACXXK^PHPFEA^^ 

KTLQGAEEHLSNALRGVPAAVIA 
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SEQ1D1131 

ATGGCMTOGGGGCAAATOT^ 

SEQ ID 1132 

MAIRGKCFDYTPLIfiGQWILCRESGGKSVKL 

SEQID1133 

ATGGCAGACCGCCAGTTCmCC^ 

GCACGGTCCCGCTGCCCGAAAACT 

CGGOCAGCKIATCGFCAGtt^^ 

ACCACTACOSAACGCGGA^ 

QCTTCTGCGAACCCAACGAATAT 

AcrcxxaxxmcATcra 
SEQ ID 1134 



HTHTCffiAPraiRSDDKK^^ 

SEQ BD 1135 

TOCGTAAAATCGCCCAAGGCATACGCGGACGCGC 

TCGAmCCMGCCC^TCGGOXXXXOTAC^ 

GAACTTCCCCTK^^ 

GATGCGGTCXJGACTGCCCGCAGGCGTGTrC^ 

(XSGGTCGC^GTGATGGAAGCCGCTTOCGC^^ 

TACGGCAACCCTGCCGAAGCCGAAGCAGGCG^ 

CGCTTTCGACACGCTCGACCAGGTCATCG^^ 
GGCGAAACOTACATCMCCGCGAAAACTTCGM 
AAGTX&TTTAICTCX3AAACCGACATC 

SEQ ID 1136 I 

LRmQGIRGRM)ELTOIVAEGGK , FKDIi^ 
DAVGLPAGVFNVVDGPGABIGNMiSAHPQVDHV 
YGNPAEAEAGAPEJfiSWiIEERAVKAVAEKVERAVKQGAKL 
GmiKRENFKAM^FHAGmSGIGGAIX^ 

SEQ ID 1137 

ATGAAACMTTGGCCATGTACATCAACGGA)^ 

OXSACGTTGACCGC^^ 

SEQ ID 1138 

MKQLAMHlKRFE^FNGEWRDVIiNPSTro 

SEQ ID 1139 

TOKHTPATTCCCAAATAAAmCCCT 

SEQ ID 1140 

LVYSQINYRTSPFTRGFWVSSQIRLTCFLP 

SEQ ID 1141 

TTGTTTCATTTCGGGCCCTCCAGT^^ 
GGTTTTOTTmAjS 

SEQ ID 1142 

LFHFGPSSFW(KSFGLFPNKLPYKFLYTO 

SEQ ID 1143 

GTGGAGGGCA(X}TACCGWCGATAOTTG^ 

TCGCGGCGGCAAAACGGTCA^ 
CCAAACCTGCOEK^^ 

AGCCGCGCTGAACATCAGCGAAAGGACGCTGCAACG^ 

SEQ ID 1144 

VEGSYRFOTLSNGISIHGG^ 

GGKNVGETAYLCG 

SEQ ID 1145 

AT6AGCGGGGTGGCTGOTATCTCCGCAACAAAAACAGAAT 
CCGGCAGTCiCGGCGTAGAGTTTGACC(^TA 

SEQ ID 1146 

HSGVAVYLRlOaffilAIASQDA^ 

SEQ ID 1147 

TOATAAATCATAmTCGCG^ 
ACGCTOTTTGAGCGAAA^^ 
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SEQ ID 1148 



ATGAGCCCATCCCCC^ 

(XK3GCAGCGGCAAMCCACGCTGATGAGGCTGATO 

CCGCCGCCGTATGGGOGTATTGTTCCAA^ 

AAATTATGTTGTACGACGAACCGTTTA^ 

GGC6GGClXmGAC«XXXXGTGGCATA^ 

LVIMIiNAVGIAG^^ 
GGIiADGPVAYRYPAQTSLQQDIiLG 

AA 

SEQ ID 1152 

VYFKAYREDGQPSK^ 

^ATGAATTO 

CGCCGATATTTTGGOTATATGGTCGCGGCT^^ 

AAAACGACCGGACAGCTCGAAGCGATGMCGTGATOGC^ 

TCGCGGGCAlTITCGGCGCGTATTTG(?rCGOT 

AOaOTTTGGCGGTCGATmATAT^^ 

KTTGQLEAMNVMAVNPVARVVAPRFWAGW 
TILAVDFILTASMFTD 

OTCAAATCCGCAGGCGTATTGGTCGGG^^ 

GCGCAAATCCTGACTTCGQGAC^^ 

AAAACCTGATCGGIAAATOATGACCA 

SEQ ID 1156 

VDVYRLTDCLKTKRfiTSDI^- _ _ . 

AQILTSGUjGBQTCGLQQGG^ 

S atcaaaJ^^ 

TCCTCAAAAGCGGCGACGCGGCTTCTGCACG^ 
CGCGCAAAAACMGCGTTGGCCAMGMT^^ 
GGOSGCAAGGAAATC^^ 
GCACGAGCCTC(miACC^ 

OSEIVVIU^IPGQKPVNMDFm 

•IftjTKX^TACACGCTMO^ 

AAATTCmGCKmCGCmm^ 

CCAATGCGCTGATGAAGGAGGATTTrTTC^^ 

GAATraCO&TCAOT^ 

OXmGTCAG(^TTOXXmM 

ATACGCCTGCGGATTTGACGGGGGCATTGACCTTC^^ 

6A(^6GCAA(XmXX3C(W5CmT^^ 

KFF(»RIJLRVGRCT^ 
EFTDQVFQNHCFTGYG! 
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SEQID1161 

ATGCATACAGAACTCMAAACG^ 
TCGATTTCGGAGGTGTCAAACGTGCO^ 
OiAiOGTACGAAATCAAAGACTGGCTGAAATCA 

SEQ1D1162 

SEQID1163 

ATGQCAGGCAACACTTT^ 

TCCAATI^CCTCGAtf^^ 

COOTOTAATCCGCAA^ 

AAAAAAAMTCCGGTTOAAGGC^^ 
CGTCGGCGCGAAACT(£ATATCG^ 
GTCGAAATCGGCGCAGGTTTT(^CAGCGTAA^ 
GGCMGACATCTGCOTCMTATCQCCATCAA^ 

TCGGACAAG 

SEQ ID 1164 

HAGmTGQIFTVTTFGESH(a6Ii«:iIDGCP 
GGGRSSARETAARVAAGAVAKKWIiKEKFGTEITAYVTQVGEK^ 

SDR 

SEQ ID 1165 TCAAAGAACTCTCTOTACO^^ 
GGCGTGTCGCWTCGATGCCGTGTTGGCCTTAGTO 
CGACA(X^AMTCTCGGTC1TO^ 
CCGTCGC^GAGCTOmGCaKm 
TCGAAACCTTOACACCGAATCC(^ 
AC«aCGCTKmTTGCK^ 
(XEGTTKXXaCCGGOGCACCAT^ 
TCGCCGTGCCXCTTTCCTATTTC^ 
CGGCAAAATCCATTTTTGGCTCGGACl^^ 
AGTCTGACATPGAAAGGCGGAAAA 

SEQID11SS 

mQBmiimSFTAVGIFTOliAVLVSTQAINLLGI^ 

PWAHjRSREYAEIIiKQKQELSLV^ 

PVSHRRTISTRQIiIGSSHPQHQABUBM 

SLTLKGGK 

SEQ ID 1167 

TTGTTTCACACAAACCGAAAATGCC^^ 
TACAACCTTTGCTCCATCAA 

SEQ ID 1168 

LFimn«CRL!OTNFQTAFVP*NTP^^ 

SEQ ID 1169 

ATGAAAAAAACGXXITACQCCATCCT 

ACXMGCC6ACCGCTACATTTTCGCCOT 

CAGCAATATCTTGCOTTGGAC^^ 

AATAAAAACAOTTGGGCGACACGTTTCXXf^ 

TTTATCCGCCCMGAATATCGTTT^ 

CATCGACAAATACAGCTACA(X3C(£ 

AGTGCCGAAACCGGCGCG(K^GAGCCCGCCGTTC^ 

SEQ ID 1170 

mTAYMLLLIGFASAPAFAETRPADPYEGYmVSKFNI^ 

NKNTI/OTASJKMNS^^ 
SAETGAAEPAVHEDSVSFPQAEAAGEAETQPGTQP 

SEQ ID 1171 

ATGCCGTCT6AAAATCXHjTGTTC(K3ACG^ 
OTACGAAGTCAAfXGCAGCGTATTCGTCCT^ 
CAATTOTACCTTGTC(XXXCGTGCGAA^ 
(XCmCTCGAT(XmCATTTT^ 
SEQ ID 1172 



PAliDADIFNHTOIQIiSTOTD^ 

SEQ ID 1173 ■ 

ATGAAQCTCAO^TCC^ 
AGTTGGCAGGOTCATACI^TTGTCGCXXX^ 
GCT(&(X£AGACGATTACACTO 
CTGGAAGOCTTGTGCGAA 
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SEQ ID 1174 

MKLTVRNYHLDCTGHVNHARY^ 
LBALSE 
SEQ ID 1175 

TTTCCAAGTCCTCGCCGTTGC^^ 

TTCGGCACACAGCnTOrcCW^ 

AA 

SEQ ID 1176 

MKKILTAAAVALIGIIMTVLIPDSKTAPAFSOT^ 
FGTQVYPTSVLIOTGBILKTYVG^ 

SEQ ID 1177 

TTCXXXXAGCGCGGTATCGATTTC^ 

AATGCCTSTCCGACGGCTTO 
MraTTATTTTTGTAGTCGTTTGCCGTT^^ 

ATTCGGACAAGGCTTC(AGTGCGCGGGCAAGG67^^ 

TGTAATCGTCTGGGTCAGCACMTGCGQC(^TO 

AGTATGAGGCCTGCCAACTCGTGCATCAGT^^ 

SEQ ID 1178 

LRQRGIDFLWAEIGFADISFEDFAFFAOT 

UGFSVQVGQGEGGRGPTVGDEIXK^ADQC^ 



SEQ ID 1179 

ATGCCGTCTGAAGCO^^ 
GCCGCCAAAGAGTAGGCKXjGAATGTCT^^ 
CCAACGTAATCGGACGGATTTCCGTGTAGCC^^ 
CCCACGGCAGATTTCACAGTTTGCCCCGAT^^ 
GCXAAAAAACCGCGCACCCGTCTGCCGAC^ 
SEQ ID 1180 

MPSEAQKTElTMFIiFTQIIRPDTFRRHff 
PTADFTVCPDAIttVAQNRIRKHIRPAFDIItf 

SEQ ID 1181 

ATGGAATCAAAATTCTmTCCTGCTGCTO 

C«IKXX£ATATCGGACGCGGGGT(M 

CJU^ATOTGATGATGGGGO^^ 

TGGCCGGGGCGCAGGGTGATTGTGATGGC^GGCGTAA 

(^GTGGTGAAAAAGAATCTGCCGGAAGGT 

SEQ ID 1 1 82 u,.^,^ 

MESKFFFIiLUtFAGSVLPPSYIffiGIGTO 
WPGRRVIVMAGVTVGRGSVVGAGAVWKDIPPYSIiM 

SEQ ID 1183 

AAACCMTTTTCAGACGGCATTTGTCC^^ 

SEQ ID 1184 

KPIFRRHI£IiKTRFFKRIHTKHPQV^AFHPSYNI^Iffi 

SEQ ID 1185 

TTGATTCCA!KMAOCTTCCTCA^ 

SEQ ID 1186 

IiIPSMLPQKHFUfflBmiDBTTENSEHAHiV 

SEQ ID 1187 

TOAAAGGAAATCAAATGAAACAAGGTAT^ 

AGGWGCTCCCTGTGCCACCCGT^ 

SEQ ID 1188 

ItfGl^QGIHPNYREVNV^^ 

SEQ ID 1189 

TTGTTTCATTTGATTTOT 

SEQ ID 1190 

IOTjISFQKSGHRGCTYASDKUSILAITOT 

SEQ ID 1191 

TTGTTAATCC&CTAIAT^ 

SEQ ID 1192 

LLIHnDKKSVCQILKSPAISlMAGUISCKAAISV 

SEQ ID 1193 

ATCGGATGTTTTAmTTCAAAATGTTOT 

TTAACATCTATTTAGGACAATTTCTAGGCTOT 

GATTTTCCTAGXCTCAAAGTTTTGCT^ 

AGT'TCTCGCGCTGACMTATTGGTGTCTTO - 
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AAAAATTGGCAt^TOECTTC^^ 
OTATGGACTCrTCTCCX^ 

SEQ ID 1194 

KGCPMIQNVVTSIILY^AVDIjLTYPI^ 
SCGADNIGVFVPYFTTLNIANLIVAL^ 

SEQ ID 1195 

TOAAAACAACAGTTTTGATATGCT^^ 
TGATGC(XjTCTGAAGCAGTCTOTGCAOT 
CAAAACCCBCGAAAAACCGF^^ 
CTGGC^G(X^GCCXXmXXrPTC(^^ 
ACCX^ATGATGCOGCAOGCTTTGCWaGOT 
TATCGGCTGTATCGGGCTGATTCCGCn^ 
TCTTTCCTOTCGCTAOT 
TGATGTTGACGGCAffTCGCCTCGOT^ 

CTGTTTTCGACCGACT66GGGATTTO 

WTTCMGCGCQ6C(mCTCGA^^ 

TTACG(X?PGGCCC(XKA^^ 

ATTACCCGGAAAAACATACGCGGCA(X£AGGCOT 

CGCCCGTCGTCCXK^GTATGGAGGCATCGTTTT^^ 

CTACGGCACAmCCGCACCGC^^ 

MCAAAGACAGTAAGTTTCCACTGATACGGAAAATCGGGGAAAATAT 

SEQ ID 1196 

LKTTVLICnfGLCRARKKYYEKD^ 

LAGSPTPLVAHLFGQTDFGIPPVYLWVAAAFK^ 

SFUjGTGTOISLAAAYP^ 

LFSTOWGILGIVWMLAVLVLI^^ 

ITRKNHGRQAVTHWAAGVTLTV^^ 

UKDSKPAliIBKIGHJIIOTTD 

SEQ ID 1197 ;| 

GT(XX^AAATCGCGTATAATT6CGCGATTAAACCCTA 
ACCGGTTTAAATTTAATCCACTA 

SEQ ID 1198 l : 

VPEIAYNCMKPIjYSGUiLI^EKA^ 
SEQ ID 1199 

AT^TGACCCTCTATT^^ 

TCGCCGTCATGAACCCGTACMCCAAGTCTC 

(XX£GA!IHXCGTTATGCGCGGCC^^ 

CGCGAAGCCATCGGCAACGGCCTGACGAT^^ 

ACCATTACGATGTCAAACTGGGCAAMGTGCCGCGC^ 

SEQ ID 1200 

MKTLYSGITCPFSHRCRFVLYE^ 
REAIGNGLTOiSPSFSKSKHLGEDF^ 

SEQ ID 1201 

GTGCXmATGTGAAAATQOCTGCAAAftCGGCGGG^ 

TC^GCX^ATTTGAGCAGCGGCGCGGCACTTT^ 

TGCTTTTGCTGAACGAA(XX£AAAG 

ACCAAATCGCGCTCGACCAGCA(X5GGGACTTGG1T^ 
CGAAGCGGCAGOGGTCGCTGAAGGGGCAGGTAATGCCG^ 

SEQ ID 1202 

VPECENACKffAGFASISDLFAHGFFGGCQCFD^ 

FFHTVQHQPPPAAHNGIAGHKLRMGEAFVDVFVD^ 

YGVASPCRTICTVOGFVALS 

SEQ ID 1203 

TTGCAAAACOCGCCGTTTTGCAGGCATT^^ 
GCCTCACGOTCACArcCTTGTCTG^ 
CAAGftTTGGATCAGCTTTTOCGCC 
GAGC&GTACCCXmWAOXX^^ 
SEQ ID 1204 

LQ^PFCRHFHIQAHHTEPIMPTSTKPYIIjRALC 
EEYRPDTPSHfTSAETAPRPAKKGLKLVK 

SEQ ID 1205 

TTGCATCGATTCATTOCCX^TAlft^ 
T^CCGCAGGCAGCACCGGTTTGGCGG(X^ 
AGACTTGAACGCXXrGWTTGQGCGCGOT 
(XXXX^GGAGAACAGCATACAGAACMTAC^ 
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GAGEQHTEQ'IKHFFHGFSFRGCKQOTASaJOIU) 



CAAGCTCCCtXXGmCGCCCnCTOPimG^ 
QAAAETPSVEAEAAESSAAB 



OTCTSAACCTGCGATAO^^ 
SEQ ID 1212 

WQTAFCPKGRKRNPFVOT^ 1 

COGGO&AGCO^^ 

SEQ ID 1214 

mbjtrrktcsavcfapqtaskpavsikhpsed; 

SEQ ID 1215 



0»CTACCmC^^ 
S m^K^FEVAMRRFmm 



SEQ ID 121 8 

LAAft 

^ JkTCTAOTAOOTCTATOAAGCA^ 



CAAACTCAACTACACCTTGCGCTAlAi i «*u*i i ruaiuw , — ,„„ , , ,™^ AimTO xxm3UX3mCACC^^ 



CK^AACAGCOTATGGGCA^^ 
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SEQ ID 1220 

MniPSHKHSLPLLAALVLAACS 

EGGAQWECYADSSRHDYT^^ 

RSPAWGVLGHYQSgSU^MIiDYYGKVADR^ 

VPDAGKNSVLRKAEIXJAIKRALOT 

VIGAQSRVC&QGIii^ 

LKQRKGTVPAR 

SEQ ID 1221 

TTGGAAAAGACAAAATGCCGTCTGAACA^ 

AccOTcccmGccan^^ 
SEQ ID 1222 

LEKTKCRLHRRLPELYAET AP PLECFRHNLYFS I ICRI/5ANRCIiCPFQT ALSKIVARPLLY 

SEQ ID 1223 

ATOTAACCGCCGAAAGGGAGAAGGCGCAGGAGAACAGC^ 
G^FTMCAAAAACCGGTACGGCGTOtt^ 

SEQ ID 1224 

MI/TAEREKAQENSIyrtodFFHVFP^ 

SEQ ID 1225 

ATGOXnTCGAACCTCT^^ 
GTACTATTICTACTGTCTGCGGCTTCGCC^^ 

SEQ ID 1226 

MPSEPLFRRHIJSLTHSRRmVPYYSGUM^ 
SEQ ID 1227 

TTGCCGCATACGAAAAAATGCCGTCTGAAACC^ 

GCAGCTK3GGCGGACTGCTGTTGCAAAGCC^ 

CGCGCGCCAAGGCTOCAGATAGCGCGGA^ 

CGGTTTGAAGCGTTTGMTA£^TCGTATTTGCOT 

TGTGTCATTO^TTGTATTCCT^ 

SEQ ID 1228 

LPHTKKCRLKPNRFSIXjISFGDLF^ 

RFEAFEYIWAVDVTOHFGLFLFAHSLDG^ ' 

SEQ ID 1229 j 

ATGACAGAAWACCGTGGATAGCCGAAGCGAGAAGGCACATCGGTTO 
CX&CAMGTCTTGGTACTTTGAAGA 

GGCGGCAAfCAGG<mTA^ 
GTTACCGGTTGTCGGACGTTGCCGCCACX5GCG^ 

SEQ ID 1230 

KIEI^WIAEABBHIG^ 
GGHQGN^IIPFDPADIIXS™^ 

SEQ ID 1231 

GTGATGAGCCGACATCXIAGGTGCCAAACTCC^^ 

SEQ ID 1232 

VMSRHRGAKI^mi^IKPVIPGVPFIR 
SEQ ID 1233 tMWUWWWMWW 

SEQ ID 1234 

LTQMGrPAKAFRQPQIiYLPRPKPTSGPVVMERQ 
SEQ ID 1235 

ttgtogmcgct™gotgttatccccgc^^ 
gacmtgccgtn^amcgggcamggcagcg^ 

SEQ ID 1236 

LSERFSCYAKKLCQIGRIiIFVG^^ 

SEQ ID 1237 

TTGGAAACCfTGTCC^ 

SEQ ID 1238 

SEQ ID 1239 

ATGGGCTATTCCGGCXXXjGGC^ 

TGCGTCAGGCGCGGCAGAATATCGGCATGGTCn^ 

CAAAGCGCGCGTTGCCGAGTGCCTCGAMTC^^ 

AAGOXmG^TCCTCGCCGACG^ 

aXACGAAATGAGCGTCAOGACGATO 

CmCTGATTCGGGAGGAC 

SEQ ID 1240 

MGYSGAGKSTLLRLINl^PDTGKVNVCGQELT^ 
KPQVILADEPTSALDPATTRSV^ 
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S atg<^ 



AACCTGCGCX5AAGTCCCAAMGGTGTGATTC 

aTC^TCGCCCTCCTO^^ 

SEQ ID 1242 

MADLTFEQAVSTIV(MEIVRAIiGETF^ 
NLR^OTEAAAA^^ 

SEQ ID 1243 

GTTTGAAACCGCXACAACATCAGGAGAMATA 

SEQ ID 1244 

MPSERQKPRHYPKNAIXSPCSRQffi 

SEQ ID 1246 

mTCHGCGCCCGMTCT^ 

GTGCCGATCGCGCCTTTGGGACTGTATCC^^ 

TCAftOGAOGGCm^ 

GCXOGCC&CGCCGACGTGGAOT 

q«j^AAACCGCCGACAAA6AwAGCCAA^ 

ATGAAGGCGCAGCCAAA 

SEQ ID 1248 ! 

VFCAPIiayPGKUCSLEE^ 
VKTADKDSQWLKDVTEAYNSDAFKAYAHKRFE^ 

SEQ ID 1 249 

ATGCCGATTATCAMCGCCOTACCATCATCCG 

C&CCCAATATCCGA 

SEQ ID 1250 

MPIIKRRTIIRLSHKTAQIOT 

SEQ ID 1251 

TTGTCOGCAOCGTTGGCGAGCAGCX^TC^^ 

TO6GTQTTQ(XSjACM1PTTCT 
CGGCATCAGCGGGGCGAGC(KjTTTC^ 
ACGGAmTACGAMGACC^CTATATGCCTTGGC^ 
CGATGCCX^CGCTOCCTCT^^ 

CmCGCTGTCGCTGTTCGGAAI^ 
SEQ ID 1252 

TOTKEEVYALADFVHPIGOT 
HTLSLFGIDTPDLAEWQQfiD 

SEQ ID 1253 »,vvvwmvmvv« 

ATGGCAAAAGTAAMGGCGGATl^GGCGCGGCTTGGAT^ 

^GGCGAACGCCGTOGCGCGCCGCT^ 

GAAMCCTmCCCCATCGMGAAGa 

TGCGCCTTTTAAGCCTGCCCGAATCCGTGC^ 

CT^CTGAAAAATTGGGTGTCAACGCC 
ATTACCGGCC? 

SEQ ID 1254 

ENLNPIEEAQGLKRLADEFGLWBTIAQAVGKSR^ 
LTEKLGVNAEIKTANHKKGHILH^^ 

SEQ ID 1255 

GTGG6AAATATAATTCTATTGACAT 

G0G0G 
SEQ ID 1256 

ViaTCIlMTFIiUlVRIVFLFSF^ 
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SEQ ID 1257 

GTG(£TAAGCGCGTGAGCC^ 
TGOTAAAGCX3GCAGGG0GCTGC 

SEQ ID 1258 

VGKHVSQCWSVISTIOTTSDM^ 

SEQ ID 1259 

AlOTX^TT&CGTGATTTGTTTGCTO^ 

AiGTGGEAAOtXX^TTOTQCTGATO 
AGGAAAACGAJPIOTAC^^ 
SEQ ID 1260 

RKTH/TRSRKNVNRIIFPT 

SEQ ID 1261 

ypGCCTCGOTTGCC^ 
ATAGCGGTACMATGTOnTTACAATATT 

SEQ ID 1262 

LPRIAVIiFVIiSW^PCPDL^^ 

SEQ ID 1263 

AAATGAATATCCCAAAGTTTCAAGCCGTTCCTC 

GTTOTCTGAAAGGGTGWTGGCAGTGAOT 

SEQ ID 1264 

K*ISQSITOFlilKPDRTmCRPMro^ 

SEQ ID 1 265 

TCATCGACAACGCGGC6GACGAGGCGTTGG6CGG 
COT^AAGAAGGCGTACCCGTi^ 
GTA«ira?PTACCAATGCGOTa^ 
GOGCCGTCAAAGACAGCGGCAOCGAAGTGCXX^GT^^ 

(XXXHGCCGCTfflTCTO 

OTACGTCAACCraTOCGACCTCG^ 

AGTGCAAAGCGACX&CGT(tfTO 

GCCGCCGTATCGGGCGATCOT^^ 

AAAAGAAAAMGGCAGCGGCGTCGCCATOTTO^ 

ACTCGCCCGCC^CAAAGCCACACMGCCATC^^ 

CCCTCTOTA(XXmCTTCCCCAAAC^^ 
CGCCCTmCCAmCGAACTOACGGCATOT 
AAAGACJmCaTraCCCCGACACCC(X^ 
GCCGCGCTTGGATGGAACtKXSAAGGCGAT^ 

VSVTNALSTRl^SVKRWKVHRIW 

AVPLFSCEHnSIXmDFSIGEGAAFALTWI^SCAHESYm 

MVSGDPLELWLNQNVDFGKKIAEIiAIRQAQARIKSV^ 

VGVDPHAIKIBPDI^GIjRYGKIMLSDADVDG 

KDTMIPDTRRIiLQVQIPEG 

SEQ ID 1267 

ATGOTTGCTTAAiRQOGQCWSGCXXSCTQCT 

ATAAACTACTACAATTTCGCGTTGATTTOT 

6 

SEQ ID 1268 



SEQ ID 1269 

ATCMGCAGAmTCATCCTACMTCCGCC^ 

OTTGCAATATGGCATCAATCOT 

SEQ ID 1270 

MKQIIILQSAVI£ICAAWAVWGFAGPLS^^ 

AOCTOATOXXSTTTTT^^ 

GTTCGAOTK^ 

ClGCCGG^mGCTGCOG^^ 
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AGGCGCACGATGCCCaC 

MDPILFVFHIIAGLAHAlFHlLVITIiQAPIPMAIiA^^ 
ACCGCX^GCATRTCCATCGCGTTCATCAGGAAAATC^ 



SEQ ID 1275 ™^»n^^wwira3TMO^^ 

SEQ ID 1277 




CJOTQCGRIPACCGftTAGMGCftCCCftW 
CTTGGQAMMCMTGQGCKtCOIGCGC 

I^RISGHRAPARYRRTPTP 



SEQ ID 1281 , „ ..^^^^...^^^^^^^^^^ -- - - 

SEQ ID 1282 _™ JlwmKfflWfiU ^^ 



VGElAQMLNKRTOSBtiKISVHEPEUGGIKVEVGD^^ 
SEQ ID 1 283 _^^^L«^mJS81MM^ 
TTCCMCQCOaCI^ 

SEQ ID 1284 „ wnimim ™ mn MxmreiffiFKPSSR^ 

SEQ ID 1285 ^.^vmy^.mfflffl^^ 
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TGCCGAGOTGMTTGAAATCACMGTATTCX^^ 
OTACTGCATOXGCCTGCC^^ 
ACTGCGAACCGCGCAT6ACGACX3GTCGGCTCGCAAG^ 
CCACACCGCTGCCTATCCGAAAOT^ 

TAOOGCKX^TMGCCCGAGTCCCT 

CGTTGCCAAAGCCGCTAAGAAAAACMCT^^ 

GCXXaGCIXrrACCGTGAAGCTCAATAA 

TCAAAGCTATGGAAAAATGGCTGGCAAATCC^^ 

GAACGACCCGGACGACGTATGCTTTATGTCC^^ 

ATGGACraAAAGAGTTCTCCGACGAAGGTCAm 

ATGTCCACTTCGACCCGCAACTTCCCGM^^ 
GGCAGAGTTGGCGGCGATTTGCTCCAAACTGOT 
GAAATCGACAGCTACftAC^^ 
SEQID1266 
KLEAHUU&AERAMjGIPAIjPLjN^ 
TLLMFDSFHDVQEKAEKG^^ 
GSSRKSATNSVIVIHTGEDIPF^^ 
STAFRLPQAPAESKVGFTIAQKMVGIttCGIiP 
NRLLLPDTVGTGGBSHTRFPIGISFPAGSGLVAFAA^ 

GKSDIPVRLWVAPPTKKDAKELS^ 
BIDSYNEVAETVHV 
SEO ID 1287 

ATGCAGOTAATCCTCCTGAAATTAGC^ 

TCCATGGTTT6TCAGATGCAATKK!AAGGTGA 

TATTAMGAAGGCGACACGG1TACCTCTACCGGCOT 

MTACAACTTOSACC&XXX^^ 

GTCAGCGTGAGTTGATTATTGGAGACCGTCAGACAGOT 

ATCTTCTATTCCCAACGTGC^ 

TGTACX^TGGGTGAATTCTTCCGCGATCGTGGCG^ 

G05AAGCCTAiraX3GCGATGTGTOT 

TCTGADCGCGTTGCCGATTATCGAAACOCMGCG 

ATCCXjTCCTGCAATCAATGCCGGTATITCCGTATC 

C6(XX3T3XriX^CftATTTGCAaOT 

GGCTOGACCX^TTCGGCAATCAACMOG 

GCCGTCAATGCTTWGGTGCAATGTCCG^ 

SEQID1288 

MQLNPABISDLIKAKIENIjSVNAEVSTRGWISVTK 

HmTAPVEKIAPGVIARKSVDQPMQTGIi^^ 

CTMGEFFRDRGEDALIVYIBLSK^ 

IRPAII^GISVSRVGGMQTKVIKKI/SGGIRLAM 

AVNASGAMSDESEKTLEAAMKSFKSSYAYQA 

SEQID1289 

ATGGCAGTAGGAAAAGA£^TTCTOUXAAAATCCGCAG 

TGGCGCGTCOTATGCCGAAAAAGTGCGTATGGT^ 

GATTACETCTGATAAGGGTTTGTGT^^ 

AAAGGTCTGATQGCGTGTCAGAGCATTGGTCT^ 

AGAAACATGAAATO6ACAGAATCXIATCTGGTCT 

CAAATCACXXOT!CAGCTGGGAATACaX^ 

GAACAGGC^GCCCGCATC^TTGCCATGAAAGCTO 

SEQID1290 

MAVGKEILTKIRSVQIPPQKITKA^WCT 
KGLMACQSIGLNWASAVffijGOT 



SEQ ID129D 

A^GOCMGGCAAAATCGTACAAAmTCGGTGCXXn^ 

TCCAACAQOTTK&GTGATGGCGT^^ 

TACGTOGGACGCA1TCTCGATGTATO 

AOCGAATTGCTOGAAACGGQCATTAAAGTGAT^^ 

TCAACAACATCGCCAAAOX^ 

aTGGTGTATGQOCAAATG^^ 

TTGTTCTTCGTTGftCAACATCTACCGT^ 

TCCM^GCGTAmOTmCCCAMC^ 

CGTCGTATTGAGCCGmTATTG^ 

GGTGTACAGTCTACTCTGCAAAAATACAAftG^ 
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TBLLETGIKVIIJLLC^^ 
LPFVDHIYRYTIAGTBVSAIiIiGRMPS&^ 



SEQ ID 1293 



03TCCGCAGTCC(mA 
SEQ ID 1294 
AIAQIiKTIiDYIKSHKK 

SEQ ID 1296 

yjFEIXJIiIiEatHGIilTORIKVl^CFHSRMRFISEa 



j«cgmaa«oooo8noca^^ 
otc«»3M<^cgcc^^ 

IGRIRALSRAVaQKYIESREKLGFPLlKK 

SEQ ID 1299 a wwMi^iai^aumnwff 
CGAG 



DKEKALYAAAQGIfiPRlMAVAEGBFQTALSEIASVKPQVD^ 

SEQ ID 1302 

SCPPVSVCGTVHFKLH^POTKEGIGCFVSVOTBHH 

^PO ID 1303 

G 
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SEQ ID 1304 

MLEPYRPIC 



SEQ ID 1305 

ATGCCGTCKJAAGGTOTT^ 

SEQ ID 1308 

HPSEGCSIXsIAVNPIiIiHTDDGDQTF^^ 

SEQ ID 1307 

GTGGGCGA<&TGGAAA^ 

TTGGGCGGAAGAfTTAOCAATTTTG^ 
CGCCAACACGAAAIXmCAAG^^ 

CAGGATGAGGCGGCTGTTTACCTTGAGGCMTAC^^ 

SEQ ID 1308 

VGMEKDRSIIAMGAWLEILSE^^ 
RQHBIAQGIQKTABNDTLQSMGF^^ 

SEQ ID 1309 

TGGMCAGGCGATGGCGGMCTCGTCCCCGGCTTGTO 

GAAACGATGTTTATGCAiCXjTOCTGACCTO^ 
(X^TGCGTTTTTTCTTGGACAGGTTTGCOT 
CGCCTT6T6CG0CCAAGAGCTQCOT 
AACACATn^OCOm^ 
SEQ ID 1310 

EfMPMHVI/PSPSGVAEYGGRAFPIiLBSEHOGTAGniSRKAl^ 
KTFKHRLIRALTKIGREREKRRKRREQTIGKIIVPFQ 

SEQ ID 1311 ! 

ATGGACATOGIMTTGCGGCAG^ 

GTGAGGAAAACCGGGCGGC(^^ i 

SEQ ID 1312 

MDIWAADDOTMYLCVAAKSVE^ 

SEQ ID 1313 

mX^CCAATOXmGGO^^ 

AATTGGGCGMTACATTGCCGATTGOGACAAAGTC^ 

CATCGATTO™^ 

ATTTTCAAAATGTCCTGCGAATGGGTGGAACAATACAAG^^ 
fTATGCCGACCAATTATGCCTTTATGGCGMCGGGTT^ 

ATGCTTCAAAGATGGOGCAAA 

SEQ ID 1314 

IJPlCGGGGGmFIDV^ 

IFKMSCEWVEQYKDVMQYQDQDI^ 

HLQEWRKKLSARFLRKIY 

SEQ ID 1315 

TOOXnXXEGCCGACAAAGCCTATGOT 

TCGGAAAGGAGAAACGGATTftA^^ 

TTTTGATTGTCGA 

SEQ ID 1316 

LPSERQSVCFKDGAKSCLPDSYARPIDGAGR^ 

SEQ ID 1317 _____ 
TTGGATATTTTGATreTOGATGACGGCTCGA^ 
CCTCTTTAAACATOGGGCTGGACGMTTGGCA^ 
GGAAAAAGAOCGCAGCATCATTGCGATGGGCGCGTGGTTG^ 
ttTGAAGACATTG^^ 

ACTATMGTTTOGTACGAAGCCGGCAAACTGGGC^ 
GACGGCGTGGAAAATCAAAGAAGAAATCAGGGCGGGGM 
AAA(XXTTCTCC(X3ACAGGATA^ 
AGCT6TTTG0CGCACC6CAATATAG6AAMTCTO 

SEQ ID 1318 

I^UVDDSSTDOTAIARRFQBQD^^ 

HEDIVAVFPFGNPIHNNTMIMRRSVIDGGL^ 

KALSGQDIGCLRLFIiYBYFIiSIiEKYSLTDIiliDFL™ 

SEQ ID 1319 ' 

AWPTQCACATCTTCAATO 
^CXXTCCGGGGCTTACGGAGCMGTCOT 
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AGAAAfiCCCGGATTTGCC 
RKPGFA 

GATTTACCGTGATGCGCGGACAAAT 



GATTTCCMGC8CGCCGCftTGA«<KjC6lOT 

SEQ ID 1322 




EFQjERHK4fiFKDareKHWjVHTIiK3SQIAVGRTL 
GGTJATGGTCOCGGCTCTOOGCGW: 

SEQ ID 1326 




FDIMAIjGKFGHBKKKEKERQRQAKLEKVYGESVILFK 
SEQ ID 1327 



SEQ ID 1328 
SEQ ID 1329 



1™= 
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TCmTCGGGC^CTTTCCA^ 

ccacgtcgggcagaagctgtg^^ 
tcggtcgtcgcogcgctcgaac^ 

SEQ ID 1332 

VLHRLQKVVRHIAQTBIMPRFUfreSRRKHJGS 

SGBOTATOGQGAFLNGTRLPLRLVDKKLNEA^ 

SWAMiBPKLPEBHVGWIBESQ 

SEQ ID 1333 

TTGCCTAAGTCTTCCACCTACCG(X^ 
CTACGCATTTCTGCCTC(mT 
TCGCAOTCTGATACCTCCAGCACACOT 
TOTOCGOGCAGG&MACTOG^^ 

SEQ ID 1334 

LPKSSTmKQMPTACQPNUiRmAPESSTGILTOT 
SSAQDDSTSELIiRPL 

SEQ ID 1335 

ATGAAGTTGCCGTITAAGCGGC^CACTCTGTTTAA 

(XXaATCTGTCCCTG^ 

CTCTTCTTGGAAAAAAG(XX5AAGCAGCCG^ 

GCCGAAGTTGCAGC^TTTGAAA^ 

GTCOGGGTCATACOTACGCAGTGAATACA^ 

CGGCGGCACCAAA(X5CGGTGTGATTGAAACCAC^^ 

ACMTGCGGAGTACGGCGAATACGTTACCGGCCCTGAAGTGG^ 
AGAGGCTAATGTCAACTACGC^^ 

GTTGAOCAAGACAAAAAC ' 

SEQ ID 1336 

MKLPFKRHTLFlffSEAAQHFKASTTIY^ 

AEVAAffiiKBGAIWAHGPNVH^IVPRM 

TLTEAGYAPEMAYFECLHEMKLIVD^ 

VDQDKH 11 
SEQ ID 1337 |; 

ATGAACACTCAOCACCAOCCGKXICTGOT 
OTACGGCGCATATGTGCTGCACOT 
CTTGAATGTCATOXS^^ 
GC^GAGCAATGGTATTTCATCGATCCGACTGTTO 

SEQ ID 1338 

MNTHHHPPCPCGSCTSYAGCCRP^ 
AEQtfYFIDPTVPLPmQPCirc 

SEQ ID 1339 

GTGGTGGTGAGTGTT^ 

SEQ ID 1340 

VVVSVHPVI^CRUCGI^AWjKSLSVRASACBR 

SEQ ID 1341 

ATCTCAGCGGCAAACOTCCCGCCft^ 
GCGGCAAAATCGCTnt^AACCGGCGCATCGCG^ 
• GATTAGCAAACGTCTGGCACAATGGGGCGGCG^ 
GTCAACAACOTGGCATCACXXXKXSACM 
GCATGATGIUW^GCGCGCCGGOOGCATCA^ 
TATGGCGCGe<^GTC(^GC(X^^ 

SEQ ID 1342 

KSAMfllPPPSSGNKQQTGIMPAQSFDYRKK^^ 

VNNAGITRDNLLMRMKEEEWDD^ 

AISRFfflflQDIADAV^ 

AAATCAAAGCCTTCAACCCOiAAGGC^ 
CTACGGCATGCA(nTIAK^XX5C^CC^ 
GACGCGCCCMCACACTCGACXTI^^ 
AAAAACAATOTACGraTCCAGTT^^ 

GACCAACTC^CCTGCGTGTTCXTrC^ 

AGTOATGGCGAAACTCGCCGGCGTGACCGA^ 

GCAAGGCACGATTTAOCCCGACGTAATCGAATCX^ 

GAGCCCTTCCGC^OTGTTCAAAGA 

GCGMGTGAAAAAAGAA3!ACG(XXjACTTG^ 
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TGCXXaJATTCCTGC^^ 

CCATACTCACT(XITCGGCCX3rc 
SEQ ID 1344 

^DKILUiDFGSQVTRLIARKVREAH^ 

DAPlfTIjDVVJMSHGDKVSK^ 

DQLTCVFVIEGIjIiRIJIEGKMVMD^ 

EPIiRDLFKDEVREI/n^M 

PYSLIiGRVSNRIINEVKGINRVVYDVSGKPPAriEWB 
SEQ ID 1345 

(^TTCramCOSAAAOT^ 

AAACTGGCGCGMCAGTTTACCTATATGGTTTGGGGGAC^^ 

CTGATGACTTGGGTCTCCGTGATGA!^^ 

OOTOUUXmCCmttGTCGGra 

CCTCATCXSTCGTCCTGATGTTTC 

GAAACCX^TCAQGGACACCGCGTCGTCAAGCTG^^ 

CGGCA(M!TC!CCCGTTCAG 

C<mTOTGCAAATGTAC^^ 

aaaggcacgctcgcgccgcagcgt^^ 

aacgcgtcgccctggtcxx^cgttccggcag 

caaactcgmshe^^ 

tccgcotccamtcg^^ 

cxxxxgccaottgaaagacgcgccgatattatt^ 

(arcATCGTCGCXXamCCTta^ 
ACCATGCTGAGGAACATCTCCGGAAAAGATACCG^^ 

SEQ ID 1346 

MIEK.TPGIOTCED&RSFMRI^ 

liflWSVWISKIRKDMFAKMLTLSSRYHQETP^ 

ETHQGHRVVKLFl^AQAMRFDAVNRTIVRLSKKITQ^ 

KGTLAPQRVEt&ISFRNVBVE^ 

SALQAAHLQSLIDASPta^ 

TMLRNISGKDTAAVQTA , 

SEQ ID 1347 

TTGACFTCGTTCOGC^^ 
TOCaGTOXXmrPACCGAOA 
SEQ ID 1348 

LTSFRKKQPALFRLLFTSERQTMY^ 

SEQ ID 1349 

ATGTCTTTC(X^TTCTTCTTCCCCGGA 

AOTCTjGGCGATGATTMCGGCAGCGATGCC^ 

MCGCCTGCCGCCGTTG(XKXaACACAGCCT(^^ 

GCCGTAKXmGGCCT^^^ 

CCGGACfcAGTCGTGATTG^ 

{aTGmcccGOBca^^ 

CAAMCGCATCMCAAGGCCGCCGCGTGCAGCGCAC^^ 

SEQ ID 1350 

HSFAFFFPGQGSQSLGWOTAEHVIVIQrrro 
AVPQGVGAMAAILGLED^ 

SEQ ID 1351 

GTGGATGACGCGGATTTGCGGCTGCTTGATTTCAA^^ 
CGmGGCPO^GCTTCTT!^^ 
ATTCGGCACAAATCK^^ 
ATCGGCAAAATCCAATGCTTCG^ 
GCGGCGAGCATAATGGGCTGGGTGTTGACTGTTTGAC^^ 
CGATGACGTGTTCGGCAAAGCCGTTCATCA 
GTTTTACAAAAAGTKXX5GCGGAACGAMTM 
SEQ ID 1352 

VDDADLRLIiDFNGFECFGKF^ 
IGRIQCFGGDKROTAEAVS(2$(£RRFAOT 



SEQ ID 1353 

TTGGTGGTCX^^ 

TCTTXWTGGAAGCATOAOCACTTO 

^TTTOTGCAGGCATTCGAAGTMGCtt 

(^fiATCGGTTTOGGTTTCTTCGCGGAAA^ 

GCAATCAGAGXAGGCACGCtXXX^GCCGCOTTC 
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GGTCGOTCAGCAAC^ 
TGRCCTTCXXjMCCGT^ 
TAAMTGTTGAGCGGCTTCGGAO^ 
AATGCGCGCTGGCATOTGTTCGTCA& 

SEQ ID 1354 

LVVGIQTMGSHRHWDITLI^^ 
QIGFGFFAESGFNffTAPC^VGSRIRQ^^ 
GOTSNGFHHMA(£F€EPPWtt^ 
HAHftHYVBQ 

SEQID1355 

AGTAAGCAAACTTGAAATCCCTACTT^ 

SEQ BD135Q 

SKQT*HPYF13KAyCmS*SCLW3raSIQ 

SEQID13S7 

ATGAAGCGGCAGGOTTCT^ 
ACGGCATC(&CAAAeGGCGTGCCGTGCGGGGAA 
TAAAACGGCGAAAAAGCGTTTTGT^^ 
CAJCTC 

SEQ ID 1358 

MKRQAFFKPMACAAFLSAVSLR^ 

HL 

SEQ ID 1359 

ATGCAGTAT(mAAATTTCCG«a 
AATTK^CCATATTGCAGCCGAAAACGAAAAAACCA^ 
GGCAACGCCGCHWATGCAGTrTCCTO 
ACTACGGCAAATGCCTACATTAAAAGCG^ 

TTCCGGTTCGCCOTACATPTC6ft3^^ 

■j^FIOTCCCCATC^^ 

TTCCGCTGSCTTTQGAC^^ 

SEQ ID 1360 

MQYAKIS^SYLPANRVSMM^ 

TCANAHKSGMAKNtoVIGJOT 

WIVPHQANRRI IESTAKHU5LSMDKVVLTVQDH<afTSAASI PLALDAGIRSGQ IMGQNLLLEGIGGGFAWGAVLLQY 

SEQ ID 1361 

MQCCGTCTGMGCftGCTTGGXTro 

AacxmaTtmsmc^^ 
SEQ ID 1362 

SEQ BD 1363 ■ - 

ATGCCGTCTGAAAGCCCAAGCTGCTTCAGACGG^ 
GAMTA3TmAGAGATATGTTCMTAGG(n^GGA^ 

SEQ ID 1364 

MPSBSPSCFRRHIiIXWS^ 

SEQ BD 1365 

TOCCMTAATTCAAACMTTTI^GAAGGCGGGGATA^ 

CTAAAAATATTTCTACAATTCAAAA^ 

SEQ ID 1366 

LPIIQTILBG<mWOJUC^ 

SEQ ID 1367 _____ 

TOATAAGAATGAAAACATCGACGGTCG^^ 

CCGTATCAAATTTTGAGAAAAAAGCCXX^ 

AACOSOTACGGCGTT^^ 
SEQ ID 1368 

LISMKTSTVVFCXjFPMAIHj^ 

SEQ ID 1369^^^ 

AAAACAAACTCATTGTCTTTGTTTGT 

SEQ ID 1370 

IOJTCAFCVSGIiSKYHHHPNTM^^ 

SEQ ID 1371 

GTGTGCCGOTAAACGGCMCTTCATTC^^ 

TCGGATTOX^TTTTCCAOGAAAACAAAA^ 
CCACnOTTCCXJTGTAITICCGG^ 

SEQ ID 1372 

VCRUIGNPIB^FQRPGFAVIXMBGI^^ 



WO 02/079243 



-116- 



PCT/TO02/02069 



TACGCTGTCCAATAITCCGCGAGGTCGGACAGACCXX^ 

AVRADG 

iwrsRGGiir""^^^"'^"""""^'" ~" J """ ■— — - — — 

SEQ ID 1375 

AT6TTTTCATTCTTATCAAATCATTGGCTO 

TCTTCGAM 

SEQ ID 1376 

MFSFL 

GTCCGM 
SEQ ID 1378 



TAAAGMCAAAAAATACTCCTCCA1 
CAAGACCATACCCGGCA' 
CMTTTGCCGTCATCGGCAGCGAACTCGTCC^^ 



ldprkfbguujgimivikshgotd^ 

SEQ ID 1382 

SEQ ID 1383 



ttattccggcacgcatgctgcgctgttttc^^ , jM/XXtaMZ-^ 



ACATCGACGAACACAACCOCCTTCCra 
GVAVIiffiHKAKACMPEGIPYFSGITVE^^ 

GTGGAGGCAAACGGGATCGAACCGATGACCCCC^^ 

SEQ ID 1386 

I£LFVBFGFPIC*RS^STSQ^ 

SEQ ID 1388 

U^QIPCMKSIIIBHIljLICIjHSVmVSQP 
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SEQID1391 

(HOTACCaGATTTTC 1 ^ 
(^TGGTGCGTCCMTGCAAAA^ 
G?PQ(K3GGAGATGTGGTGCGGGCGGTGT 
TGT 

SEQ ID 1392 

VYQIFCFIGISLVVGAFGIFSRGII^^ 
C 

SEQ ID 1393 

ATGttCCGTTCGCAC(»^ 
AAGCmAGQCGAA 

SEQ ID 1394 

KHRSHDALTAPALFVDS1TCKVHEPHHI 

SEQ ID 1395 

AT<nCAGACCCTAATTTGATT<^^ 

GGCGGAACAMATATCGTTTACiOTACCGGC^ 

GTOATGCTCGATGAMGCAGCCGTATCATC^^ 

CX^TTOTAGAAGACCAAATCC^ 

GTTT3AAAAGCAGT 

SEQ ID 1396 

HSDPHUDPBIFAAERQHLQGSFLI^^ 
ALVEDQILMSLPFSPRHGHCGM^ESANQDKPNPFAVLAGI^SS 

SEQ ID 1397 CQJCTWC M^^ 
AGATGGAMGGCTCG(£ATTGC^^ 
GACCGGGCGTTiraXXJGCXX^TT^ 
GGCAGGGAGATTGAGTTTTACAGTGCGS^TTCTGCT 
GCTATTOGAGCGCGACXEG^ 

SEQ ID 1398 i 

VFRFMAGFYRPFGKVFNGKED^ 
GREIEFYSAVVLIiHTVTGKMHRHIDKTVV^ 

SEQ 

GAAACCTTOXIAAGAAGGGCGCAGT 
TTGCH'CGGCCCGTCCAGCCTGCTGCTGG^ 
TGGAGCAACGTTCGCGGCAGTGCGGCGAGACOTGATT^ 
TACGGASTO&CGTTGCCGA^^ 

SEQ1D1400 

LVGPSSLLIALMASGAN^ 

SEQ ID 1401 

TTCGCGCAGGATCAAGCaUU^^ 

SEQ ID 1402 

IAQDEGNAQSBKTPDDFCDVCGLKISAPIGGEQ 

SEQ ID 1403 

ATGCACffiTTOXXACCGCATCGG^ 
TCATTTTTTTCAGACGGCAGATAACCATP^^ 
TATGO6CCAA!KXX^0aUUiTTCGCGC0C^ 
ATCCGTGTGTK^TTCAACGTTTGC^^ 
ACCGCCCGTTGTTaTGCTGCM 
CTTCAGACGGCATAAACGGGCAGTTACAAAACATCCAC^^ 
TCAACAGCGCGCCCAAGCCCT^^ 
ATCGATAT0CCGAT6CATCCTGGCCG 
SEQ ID 1404 
MQVFHRIGKQRIIAVRRFNKNIiRIAAIJOT 
imiQRI«IQPAYmyPgffljQMRTRRF?RFH^ 
STARPSPRFSRnmPHPAIUPHSGGCAARPTASSRP^ 

SEQ ID 1405 

AtttXSCAGCATGAAAA* 
GATTCGGCAmCCTT^^ 

SEQ ID 1406 

HPQHEKCRIrfCPVQT^ 

SEQ ID 1407 

A^mTAAAAGAGKaCOT 

CAGCAACCGCTICOGCCGAAGOCGCGCTC^^ 

ATACAAAGCAAOCGGCGACGAAAACAAACOGTTCX^ 
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GGTTGGCTCCmTGATTGMCACCMTCA' 



MCGTCAMGCOGTACTGTAIOCMTGfiCCAACGCCTCCW 



CCGACCC&CCCGATGC&XSC^ 
(XXj&GCAGTGQCACATCAACXI^^ 

TOIMQMMMKmVFSVMFFmMIiVLYWVV^ 

ATCCftCATCM 

SEQ ID 1410 

RHEOTAIQHiaGSRBHSSVSGGFRK^ 

•AAA.TT 

IE1TGSTDKLDSFLBTO 
SEQ ID 1415 




GOTCCAA6CCTTC(^CCT 



SEQ ID 1416 

VPNSAVDMNSWAESSLLSP: 
ASKPSTYPTQVTSKSN&KLQ 
SEQ ID 1417 




CCC^CTTMTCTTXATCGGCGGTGCAAT^^ 

SEQ ID 1418 in^UAe.!**™^^ 
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GIGLLMSKTCSFLTKHEEVE^^ 
PKFQRAGVNFLYIALFIVVGGS^ 

IGAGVPGPIJNPPISIiraQGU^VHAH^ 
PDLIF IGGAICVAIQATKIVFGRDR 
SEQ ID 1419 

ATGCCGCAGGTTGCGTGTTCCGATAGACG^^ 
AGCGTGCAATT 
SEQ ID 1420 

KPQVACSDRRGIRAGFI PDGRAASPKVCTALYRNRWEIKGKRAI 

SEQ ID 1421 

ATGMCTTCCTMTGTCCAA^ 
CGGTCAAAAAATACG<X$CATT^^ 

SEQ ID 1422 

MNHiLSKliLLGURFYQYC^ 

SEQ ID 1423 

ATGAAACGCACTTATCAACCTTCCGTT^^ 
6CCTOGCACTG 
SEQ ID 1424 
MKRTYQPSVTKft^THGFLVRSK^ ' 

SEQ ID 1425 

flK&CAAAAGCGGCAAATTGGAGTGTAATTCACXXj 

TTCCTCGTGCGCTCOU^ 

AAAAACGGATGATTTTWATCCGTTO 

AAAAAAACCGCCAAACGCOX^^ 

TCGACAGGGCTACCGCAAAACAGGC^ 

SEQ ID 1426 

LTKMIWSVIHGIJiyPFYFRNHHETfiLSOT 
KKTAKRANERNYMKRVIW3WFRM 

SEQ ID 1427 | 

TTGCCTGCGGATTTCTGCCGCCTTTCTC 

GTTTTCITITO3AAAATATTATA j 
SEQ ID 1428 

IiPADPCRliSVDKFIflGIOTK!fRRVGIIGSLRFBPTRIQ^ 
SEQ ID 1429 

TTt^TACGAAAKXXXX&CGGCCG^ 

CCACCCTCCTCACX^ACCCTMGGG 
SEQ ID 1430 

LNTKGGTAGSQLFPYRFYFSGKUUoEHIAGKLVL^ 

SEQ ID 1431 

ATGACATTAGCAGAGTTTTGGCCGCTGTGCC^^ 
TGTA£X3GCAAGM(£AGTTTGCCTGCM^ 
GCGTGOGTTA 
SEQ ID 1432 
mAEFWPLCIflRlffiM^ 

SEQ ID 1433 

ATGG(XX5CGGTTGMGGTGCTGTCGAACCTG(XX^ 

GGATGCGCAGCGTGATGCGGAAQAAGCGCGTTACGAACAAA(^ 

ATTGCGGAAMCCCGGGGCAGAGTTACMCCCGTK^^ 

MGTCXSCTATAfi^TOGGACGACTACATTC 

TaGOraTCAAAGGAAAAGACCGTACGAT(^ 

ATGGACGACCJGOTXyAATCCCGCOT 

AAGACGAAGCCGCGCTGTTCATTG^^ 

TTTCGCGOTACCGCTTTGCAGGA^ 

TGCaCGGCATCAGGGCGGTG(XX3AA^ 
SEQ ID 1434 

MMVEGAVEPAEPStMGSEEMPVQEVLim 
IAENPGQSYNPFFLYGSTGLGKTHL^ 
MDDRIiKSRPSWGLTLELEPPELSffllAILQKKAEM 
KRTRNIARPRQVAHSLTKamSLPS^ 

SEQ ID 1435 

TCGGTMTCTGTTTTGTTAAAGAGCGTTGCGAAATT^ 
QCAGCGGCGAAGAACCGMCTAIEACGCCCGC^^ 
SEQ ID 1436 

SVICTVKERCEIIKYPFWjSKISUJISDIPCYTFQFVRI^ 
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SEQ ID 1437 ^,,„„^^rr^^AA<3J^^ 



GACGftAGCCGC(XHGlTrGCCGGC6A' 



TCA r 



CGTTGCX^GTCATGCCCGAAGCGGACGA( 
(XXmxmTTACCaOTOTGACAGGCA' 



LTCCACfiOCTCXBCCTGTAC 



OT5GCGGCGATGGGCGAT 



SEQ ID 1439 



iLIGSGDRSl 



arcMaccAMcrc^^ 



lTCGTC(aftCG0CB o^ 



iTTTCJCGACATTITGCGCGC 



^AmTCTCMCGBICTGCKMQC 



ATTGTGATG 



CTATTPGAf] 
CCGATGCGGATT 

SEQ ID 1440 

HlilliQAERimOTIiQAyTGIVERLHml 

otisatpswqe^ktmi 1 sqvqysmavqd: 

PMRI 
SEQ ID 1441 



,YLMDVIiRHIHSDI1iQIiAF(3M 



TTOXXmT^^CA^CCGTCC^ 



iGA r 



lTCCCGCC ^^ 



lTCTTTGGG 



C(5»GTGCATCGTGT(X5TCCAATCTTO 

^CACGCGCGACA^^ 



CGCCGTi 



GCAAITCAG© rGACCGACGCC s^^rCTATCAGGTCAtoGGCOOGeifCAAOCICCT^ 



COGCCGATOCCGC(X3TTCW3CCGTaiAAATGAOGATra 
ACTGATGGCGCGOTITGACGAAGCCAACAAf 



0tCCtOKCAAM9GQCMCI6QR 



AATGGCACT^CAGACAATACCCACGCGTGGCTGA' 
CGAGGC 
SEQ ID 1442 



jVNRPDLKFPPSfP 



CACMTATCCC 



SEQ ID 1444 



IFFVHRIADVDAA? 



iVHfflWGBGRGGWFVSFFGRHiIHD^^ 



VVGKTORl*Cm3DDFPUJa(nDmi^I£HGGSBCQA^ 
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SEQ ID 1445 

CATCCMGGCGGAACTCTGTTTTGCCGAC^ 
(X3(X^^GTCATCGGTTTCMGCAfiGA 
ATTATGACCMCGGCTGCATCAACGTCAG 
SEQ ID 1446 

HPPMJ&3IIXX3iLTTAPVFAQ^ 
OTNGCINVSDAVYBKLRHYFVLOT 

SEQ ID 1447 

ATCCGCCCCGGTCTGACCAA^ 
CT Arc GACOTGAAAACACTT^^ 

SEQ ID 1448 

MRP(&m3DIYKIPGRPHY^^ 

SEQ ID 1449 

TOGCGGATACGTTTCG^^ 

CGGGCAAATTTGTCTOAT^^ 

ATTCGCGCACGCCGTACATACCTTOT 

CTTOirercGMcoracB^ 

GCTOGOT 

SEQ ID 1450 

IAIOTRFCBBDFADDBIiRUSFGCGRB 
LVVHirciGFRENRIVGRAVGF^ 

SEQ ID 1451 

ATGAAMTCAAACAAATCGTCAAACCGAGCT^^ 
GAACCCTATTCCCTAACGCTCGACAM^ 

SEQ ID 1452 

MnKQIVKPSLAVIMGIIiSACATIKQRQSRRHIE 

SEQ ID 1453 

TTGCCGAAGCCXX^GGCOXX^ 
GCT«^TCCATCCTTG(^ 

CCXjCCGCCGTCAAACAGACAGGCTTTAC^^ 

GCTCAATCC^AACGCGCGTAAAGTGOGCGATACGOT 

CTTCGGATTGCCGAAATTGCTGCGGAMTACGC^ 

SEQ ID 1454 

MGRGRGETRVAAVGAGVKMRI/TDTHCHLADSVli^ 

KTQTPPQRERQIQWSRQLAIAQTliRR^ 
LRIAEIAAEIRGTGftABIAAITERNftDSLIiRP 
SEQ ID 1455 

AATTCAAACACAT^ACCCGCCATGAO^ 

GCTGACCGGTATOCTACTGCCTATACGGATTCGATO 

CGTCCGTGCGTCAAACACAATITCTTGGT^^ 

AAGATGTTACGCAGGCGATGGCGAAATTCAGCTATC^^ 

AKTGCCAAACGCCCGGTCGTCTATOT^^ 

G6C6CTTATCCTTCCGGCGACCGCXIMTO 

A(XCTGTCGmCC^^ 

GAAAAACATTTTGTCCGAGATGGTOCGCTG^ 

TTTGACMC^CAGCGAAATTATCMGCCCCAATATC 

TAO»GCGA(X3GCTCGAra^(» 
GAAATATATTACGGCGGTCGAGAGTOGGAAACCTATTTCG^ 
GTGCCTTGTTGGAAGCATTGAACCA^ 
TCCGCCGCATATGCGTACGGACGGAAAGGCG 

SEQ ID 1456 

MQLS(^IIVQSUCAEX3VEYVFGYPG(» 
RPCVKHNFWTDINEfc^ 
GAYPSGDRQnmGMHGTYBANLAHQNADV^ 
FDNGSBIIKPQraQKLAEIT^SAIITSD^ 
BIYYG<^mFDSbPDFVK^ 
SEQ ID 1457 

SEQID1458 

LPDFHMQNPFLPDARLPFVRKSRIRRVUJV 
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SEQID1459 _____ 

GGCTCTTCCGAAmAAGG<^GAC(^ 

SEQ ID 1460 

MRKFARASFEHmyTAMlW 

SEQ ID 1461 

GAACATTACCAGCCC<XXX&CAT^ 
ACCC 

SEQ ID 1462 

MRVRIYASCRQFADSPTSATCYNSSFKFLTPSSEKQNDHART^ 

SEQ ID 1463 

ATGTTCCmACCCTACWGGCMGCTGa 

CTGQ6AAC6C6AA6TC6CCACCTGCAA^^ 

GTCGACCJ^CC(n!CCTTGC^ 

AGCTGCTCAACGATTTGGACAAGCTGG^ 

GCAAGGTTTCGMGGCGATTACGCGAAATTCCTGC^ 

(XXXjACAAACCCGAATTGCAGGCATTT^^ 

TCAACG6aaOUmG6AAGTGTG(a 

TCTCXXEAAAAAAC^GTCATT^ 

GATITPCAGACGGCA 

C^GTTOAO^OTCC^ 
GACGGCATGGTATCGGCAGMTCCGCGAAiffACTGGG® 
ACXaAAGGTTTGC?rCAATGTTGACGAACCGTTO 
TGTCGAGCTGACTTTCGATGACIAMGG^ 
CCGCAAGAACTGATOACGCCTACGGCGCGGACACCG^ 
TGCGCCGTCTGTGGCGTACCGOTACGMTACCTGM 
CACCy^GCCAAAGTCAGCGACGACTACGGC^^ 
GCCGTttKXXAAGAAGTATTGGAAGCCGCOT 
CMflCGACGMG^ 
ftCTQXXIAAOGAMG^^ 
SEQ ID 1464 

MFPYPSGKLHMGHVRmiOT^ 

VD3TVLANEQVIIX3RGWRS6ALIEKRKIPMTO 

ADKPELQAFIAECKAGSVAEAIMAT^ 

DFQTAmVAAKLQSQGAGEPKTQYRUlDWGIS^QR 

DQiVSAKSAKYV^VDQnC^IHffilLHLLYA^ 

pQELIlfoYGAOTARIiFMMFAAPPEQSL^ 

AYAQmEAAVRM^IVPHICmWSEIiNGAK^ 

SEQ ID 1465 

TTGCCCTGCTGTCTGCGG^^ 

AAGCAACCCACAAAATCCGTOTAATGA^ 

TKXXXMATTCXCAGGAGCAA^ 

AACTmOTPTCT^ 

CGCACCACATOTOKHTCCAT^ 

GTITPTACHKAACCAAAATOCGGAfiCGC^^ 

ATCCAATTOCCCAAMCAACCATATC^^ 

CACCCAACTOTAAGGOTTTO^ 

TOGTAAAAAA6TCG6AGAATTATT6GCMTTTAI6M 

SEQ ID 1466 

LPccua^vLiFmua^ 

NFmAYHU^IRQLVLVP^ 
IQFPQNNHIKDKIRQQLQILRDQNMIEFIGRGLyKmQ 

SEQ ID 1467 

TCTmGAGAAtt^^ 

AGCAGOUjTCCTCCCTAAATC$^ 

GOUSGAIGAAACCTGCCCA^ 

CAAAACGGACAATMTCTTGC<MGCC(^ 

OTK&CAGCAGCATCAGTGCGG 

ACCCGAAGCCC^TMTGATGAITFTGTCTTGA^ 

CCCCTGCTCTTGGCAGTGCaTGCm 

SEQ ID 1466 

QNGQ*CX^PVaQ*OUMQHK^ 
PlilAVPCKCTQSCVNRRQKDQiCRH 
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SEQID1469 

SEQID1470 

imSVAVFPQCAYPKSLI^ 

SEQID1471 

GGAAAMCAItXIAAACGGCTGTTTTGATA 

SEQID1472 

MIDFDKPAEEAAnQSRIiKKS^ 

SEQID1473 

ATGATTTra3CAGACCC(£CTO 

mttttaccmgmtggckxxiactgtg™ 
ccatattttgaacaatattacttgggaaaaac^ 

ACGTTTCATTATGAAATGATGAAAGtt^ 

TCCCCTTACTTGAACGCTGCATACTATCG^ 

TTGCGAACTGGAAGAAGATTTTTTGAAT 

SEQ ID 1474 

MIFADPPYFLSNDGFSCQNGQMVSTO^ 

TFHYEMMKAQNNGK^ 

SEQ ID 1475 

ATGATTA'TTQGCGGaAroGGCGGTG^^ 
ATCG^TTATGAAGTCTMjTTTAA^^ 
AAGATTACTGCXXX&TAATCGCTT^^ 
AGAAAACAATATAC^GTTAGTTCACTCrm 

SEQ ID 1476 

laitKaGGARTtfTCiaFEEro^ 
RKQYTaVHSLGWRVGmVUlDMPTKPEY 

SEQ ID 1477 

ATGATTGGTTTACAAAACCGGAATATTAAAGATGTOT 

ATGAA .| 
SEQ ID 1478 

SEQ ID 1479 

TOACAGATGCTGC^TTCAGTGTC 

SEQ ID 1480 

LTDAAFSVAES AY7EQKVG LLYHNSRKMENGGEft 

SEQ ID 1481 

TOTTCAAAATACGCGGATTCCGCGAC 
G1TPAGGCGGTATTCK2AGAGTACQGTGTCTTATO 

SEQ ID 1482 

LPKIRGFBDTECSICQRIiFENGRVRQVCRQOT 

SEQ ID 1483 

GTCTCTOATGC^TGCaOTA^ 
GGCGT 

SEQ ID 1484 

VSYADAVNNARIENIiKRLLPTVMPVYEQSVm 

SEQ ID 1485 

TOCGTAQOCXSACTOGCAAACGQCAA 
OTACCO^TCAAACCMGCCG^ 

SEQ ID 1486 



SEQ ID 1487 

AT^TTTGCGCGCCTGATAACTGCATAACGAOT 

CTTT 
SEQ ID 1488 

KieOTHCmSFIRFQTI^^ 

SEQ ID 1489 

TGAAATATTGCMCTTTTAACOT 

SEQ ID 1490 

♦NIGTFNVGIVKSPFEASIiDGPADMLSGTQ 

SEQ ID 1491 

ATCAAACAGAAMTTTTTGTCCTTT^ 
TGGATTI^TTCmCGTTTGTGAG 
GCTQCACGGCACX^CACACTCGOCTATAOC^^ 
gXXXX3C(£AATCTGGTGAC^ 
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CACGGCMA^^ 
GTTOCAGOGGnGQGQCTGOIG 



HGNTSAGEGPIRAVQDFTQQGKLLliNISQVPQG 



AGGCACTAT 
AAACAAGGGCATCGAAGCCATTGCG 

SEQ ID 1494 

MQGLHDIGAIDKKTKRGFDK^ 

TCJ&'rTCTGTAAGTTCOT 
S |S^Ara^ 

ACGGCACTTTCACGGAGAJACAGCCA 

SEQ ID 1498 _ 

MTAFAKNDRENITDKELTEIi^ 

GCGCG 
SEQ ID 1500 

MRIFKNQS^VKFAKKHKISDSEIJEAVERADN^ 

CCGCCTCTATCAO^CTG^ 

SEQ ID 1502 

MPEKXQfPAITAASALPLPCTHMHAPPKSASI^ 

SEQ ID 1504 

LWKMCATVYLVILGIIPRTGNTASBIiFIPCHFSVYAA 

GGGGCATA6GAA6ACA6C6AAACGCGCTAAAAACGCAATTT6AGCCA^ 
SEQ ID 1506 



TTCATATACWATTTATCTMCOT 
SEQ ID 1508 
WGFSFRYAPTAKTVKNm^ 

SEQ ID 1509 

TTGCGGTMTTTTCGGRT 

SEQ ID 1510 

I^YFSMI/KNQVSSIRADtTCLHSQEGDGRITISIiKSIiy 

TCea^TACWCCTG^ 
SEQ ID 1512 

T(&TT!CGGCTOTG(XOTATACCMTCCGCMOT 
VFVTAGISIKVSYimiKDG^ 
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SEQ ID 1515 

6TGGCCGTAGGCTia\XXjAGTTCAAAT^^ . 

SEQ ID 1516 

VAVGCASSNLAVGT^KIMIIX^ 
SEQ1D1517 

TTGMCTCGCACAGCCPAC«mCTAOOT 
TTGCTTACGGGTTTGGGTGCITCCT6AG!I^^ 

TAC 

SEQ1D1518 

Y EGRAGIRAFAVLBB 

SEQ ID 1519 

ATGGAAGOCTTCAAAACCCTAATTTGGATTATTW^ 

GCGGCAGCGCGCAAGGOTATFCGGCTC^^ 

CACGACAAAACACGGTTTGGACTTCAGCAACATA^ 
SEQ ID 1520 

MEAPK^IWIIKIISAIiAVIVLVI^HGKGADAGATFGSGSGSAQ^ 
SEQID1521 

ATGTATCGCCAMTCGGAATCW^ 

GCGTCCTCATCGGACTCGCCGCCCCGACC^ 

CGGCGCGTACACCXXJCGMGTGTCCGCCGAAATGCTCGCrc 

AAAATGGAAAACGTOTmCGTO 

GGemTACCAAAAACATCOTGTCGCCT 

TCCTACGACTCXrmACCGCCAIKA 
SEQ ID 1522 

KMENVUJVGIJPIiLCVGESI^^ 

SYDSmilSAAQHA ,j 

SEQID1523 L 

ATGTATOCAMTATG«3AGATATGATC^CGCGCTI^ 

TGGCGAATTTGGOGAACACGGOGGCGGTTTTGAAG^ 

GGCGTGTACGCGGATTCCGTTCGGGCGCGGGGTGTGCGGT^ 

CGTTCGGAGAranCGTGOT 

AGATT7IX5GAGAGGCGGTTTGAGGCTTCGAGTCA 

SEQ ID 1524 

MYMMGDMlffiALHFSASBKAA^ 

RSEIVVPLFSIXffiCIGVLDAD^^ 
SEQ ID 1525 

ATCMAGCTGCCCOTGCCMCO^ 

ATGCGACGGCTCTGTTGGCAAAA 

GCAAAATAACGGGTTGACCGAACTTTGC^CAGGTGCOT 

GCGOTATTGTGGGA(Xj(&GGCCGGTGC£^ 

ATCCTTrCGACAGTPTTAATTTT 
SEQ ID 1526 

MELPLA^K3HKMLEPKWARLAQGL^^ 
AVTORRPVQRAXiIiITRW3DVFEEKVLF^ 
SEQ ID 1527 
TTGO&CTTCmOGGa^ 
A0GTGC6TACGGATGMGAAGC6(XX3GTTTC 
CCATCAC6GTATCC6CAGCCTGCATACGGC6ATGTA1T^ 

SEQ ID 1528 

LGLQTAFaVFFGGFVffl)^ 
RY 

SEQ ID 1529 

TTTTAACAAAGMTGCCO^^ 

SEQ ID 1530 

P*QHMPUffiQAILIQTKDQYVI^FHRPLQQCLQKH 
SEQ ID 1531 

TTGaTOOTTGOCTACa 

TCCOTCCAATACAGT^^ 
SEQ ID 1532 

LPRIAVPSVLSAASSttPD^ 



WO 02/079243 



-126- 



PCT/IB02/02069 



SEQ ID 1533 

TOGGGTGCAGGCT(XXOTEAAGGT^ 

AGGAATGCOTCTGAAGCCGCCTTCMl^^ 

TCGCOTATGCAGGCTGCGGATACCGTGAT^^ 

ACAAACCGCCGOTCT'TCATCOT 

AAAACCAAGCAAAATGCCGTCTGAAGCCCC^^ 

AGTATCGAACAGCAC^^ 

AACACTTCAGGCACAAGGGTTACCGCGCCGCCG^ 

SEQ ID 1534 

LGCRIiPLRFRQILTSLILRFSFAEAALSDDLYGTPEVHAFK^^ 

TtmRFFIRTHVQQTKHPPPVHPRLQOl^^ 

HTSGTRVTAPPT 

SEQ ID 1535 

ATGACACGCTTCAAATACTCCCTGCOT 
TTACCGCXX&CCGCACGXIGAG1TO 
ACAAATGOJCGACCAAAACATCAAAACGC^ 




AC(^CGGGC(^CTTCTGCCACCX5T^^ 
ATCGGGCAGCCTGAACGCCGAAGGCACGCTGC^^ 
TACGACATOK^CCGOUU^ 
GCCCAAAAGACAACCCCG(^CAAATTG^ 
CCGTAKCGOTCCGCCAGCCETA^ 
ATGTCATTGACCGGCAAATACCGCCTGTO 
CCAAOGCCTACGAATTTOm^ 

ATCGTGTTCGATCT(^CCGGCMCCTC 

AAGCCGACATCAAAGGCGAATGGCTTGAAGGGCGTCT^^ 

TCmCCGCCAACCAAGCCAAMCCaCCK^ 

ACACO&CCCAGCCGCGCTtt^ 
CACCGAACT(H!CGCP6AACGTGGACAACCTGTO 

SEQ ID 1536 

MTRFKYSIiLPAALLPWAQADVSVSDDPKPQESTELPTITVTADRTASS 

RIANYQINGIPVADAIADTCNAOT 

YD1APQTRVHA(M)YQQAKETADAP 

MSLTGKYRLFGROTLIAGINGYKYASN^ 

IVn)LTGNLSLYGSYSSIiFVPQljQKD^ 

RLNPDSVPERSFKIfTAYHLAPEAPSGRTIGAGVRRQGETHTD 

SEQ ID 1537 

ATGCAAAACGAAACCATCAACCTGAMCAGCACCTTGOCGC^ 

GCTGGCAWCGCACGGATACAGCGACAMGTGCTGTTTG^ 

GTCGGTGTCGCAOTCrcGCGTTCGGAAM 

SEQ ID 1538 

MQNETJHjKQHLAAIKEY^ 

SEQ ID 1539 

GTGAAACCGCGTTTTTATTGGGCAGCCTGCGC^^ 

TGACCGTGCCGA(XXKX3CGGGGCGATGCCGTTG^ 

CGCGCCGOTGCGCGTGGACTATTTGCAGCC^^ 

GGGCCGGGTGCXjGMGCGTATGAACAGTTGGCGAAA^^ 

TCOTMGGMGCGCGCGTGGCGGMTTGMTGCGCA^ 

GTCCGCXaTOmCGC^ 

AMGAGAAAMCCCC(^CTG(^TTTTa 

GQAGCftAATCATCGTCAWKXTTGCOGCGAA 

G 

SEQ ID 1540 

VKPRFYWAACAVLPAACSPEPAAEKWSAASQAASTTO 

GPGAEAYEQLAKNATTIDLTVDNGNIRTSGEKQMETLSRirc 

KEKNPGWIFIIDRTAAIGQEGPMVEVLDNALVCGTNAKKRI^ 

SEQ ID 1541 

TTGGAAMGAAAAQGTCGTCTGA^^ 
TMTAAACATTCTTATTCGTATGCAAAGGAA 

SEQ ID 1542 

U3U01SSERVFQTTFSVSGICFAVSGI^ 

SEQ ID 1543 

ATGTCCATCMCCGCTK^ACICGGGCATCACOT 

CGGGGCAGAACCGGTTACAAATGGGOjGCAG^ 

GTGCGOGCTGCATTTGGATTTTTCGCT^^ 
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GCTmCOTCCGOXXX^CGACMCGOT 
GCTGAACGMTOTACCTGAAAC^ 
C^GCOGCGATGCACTTTTXKX36CT 
SEQID1544 
MCINRFDSGITLSBMRj(X^ 
QNACHTjTAAACPIiLQRPFESIX^ 
QAMHPam^HAGRFNEAFBIlHrGFLPSiraC 

SEQ ID 1545 

GTGCATCMCC(XTTTC(^CTCGGGCATCACOT 

GGGCAGAACCGOTACAAATCGGCGGCAGGGAATACC^ 

GCGCGCmOTGATT^^ 

TGAACGAATGCTACCTGAAACGCTATTTC^ 

AGCCGCGATCKACTTFTGCGGCTO 

TTGATTTGAAACAACCATTAAAAAAGA1KX3TC 

SEQ ID 1546 

VHQPLRLGHHLITDARR^GCVSHRNPQFQPQLIOj 

KRGOTDGGGLPPMTPVRIGIUjRPAERI^RRAGT^ 

SH)ALLRliPFR(n^BSVQ^^ 

SEQ ID 1547 ^^^3^^^ 

ACCAAGCCT1TATCG6CGTGGTCAACCAAGCOT 

CGGTAAGACAACAAC0GTCGGCAAACTCGC(XX3 

GCCGAACAGGTCG(XX3TAGATTTTTTC 

CCGGCCGTTTGGCAATCGATGAAGAGATGATGAA^ 

CGCTCAGGCATTTAATGMG(X£TGCC^^ 

ATCGGCGTCGGCGAGAAAATCMCGGCCTCGMCCTTTC 

AAGAAGCCGCCGCCAmTGGCGAAAAAGCT^^ 

AATGCCGGGCGAACTCGGTCAAATATCGm 

caramAaxxsrcax^ 

ACGGTTTGGGCAAACTGATGCGOT(XXX^ j 

SEQ ID 1548 

IiREVRIALIiEAnVALPVVKEFVN^ 
ABQVGVDFFPSD^KPVmTMVDYA^ 
IGVGEKIlKIiEPFHPDRLASRILGMGimj^ 
LLKAGRKiffilAMGAGTTVQEVNKIMQF^^ 

SEQ ID 1549 

TOTCGGAGATAAAAAGTCAGMCMGtt 
ACGGCATT 
SEQ ID 1550 

LSEIKSQNKPKHFLRETAFYQRLCRLKAAGGADSRPAGRSBGI 

SEQ ID 1551 

TTGTTTTTGATTTEAM 

SEQ ID 1552 

LFLIL^FIIUIiLFFPLQRFCMIRQFLPYGQRKKIUi 

SEQ ID 1553 

ATGCAGGACACGCGGCAGA!ITCCGTTTTGTCA 

CCGGCTATCCCGAGTGGAAMTCCTGTIT^ 

mGAACAAATCGGGOH^ 

AACAGOGTGTGGGTGTGTCCGGCTTGGGACGAACGOT 

TCX3ATAAAGGCX3CGAT6ATGGTAAACGC6TGGC6T^^ 

AGGCGGGGCGAAACOTTGAACCTGCCTAA^ 

GCGTTGGfGGCC&TGAAACTGAATO 

TGCTGGAGAATOTACAAGGCCGTC^^ 

CGAC6TGGTTTATCTTGC(X3CCAGCGATrTCAC^ 

ATCGAAGTGCGQCAOTTTQ^^ 

TTCCTGAATTGCCGCATTATCTGffi 

GCAGGGAATCAGCAAACACAAAAGCGTT^^ 

ACGGCMTCCGCAGAAATACCMGCCTGCAAAtf^^ 

GCXXSACAAACTCGCXTOX^ 

SEQ ID 1554 

MQDTRQIPFCQEHRARMYHFHQNAE^ 
NSVV^PAWDERQLTESGYPREmVE^^ 
ALVAVKLNRGELGAAQIJjFAPDETQALESVETTKRFW 
IEVRQFWAVSSIXjERIPYFHVGKNA^ITITTLVyAYGGPGIP 
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ITJtffcFVRBPQSIGALV^ 
ADKIACVLLFIiKEFLG 

SEQ ID 1 555 ^. MIJW iJv,m 

TTGTCGCMCT6GGCATCXJGACTCX3TOT 

CCTG&COGeCCG^^ 

C(XXXX3CTCG<^^ 

mCCC^CCCCGAC^^ 

CGGCGTGCATCGCGTCXAAATCCTCM 

GCGCACAGCGGCGACATCCCGCACAT(X3CC^ 

AACACGACGOBACC^^ 

GCTTGCCCA^TOATO^ 

PPIX3HSY«3KTFNbDHVQAAASYAVS^ 
AHSGDIPHIAAIJRPLEEQ(WIiIiHR^ 



ATGCCCGCCTGTTTCTGTCCXjCA 
ATCTCGCAA(£ACGAAAGAM 
CGCCGATATOCTCAACGGCGGTACAACCCT^ 
ACTATTCCAACCCTGTTCGCAOT 

S K^OTPHC^^ 
TIATLFAIiIVIiIMQLSYlFIL 

SEQ ID 1559 ^^^^^^ 
&TGCCGT(XXjAACCACCI<^ 
CAA(ETC<KXX!MGGOTCOT 

CCGCACCTGCCttXK^^ 

OTATGAACGCGCCCGACAGCTTTGTC<^^ 

TAAA 

S ^EPPAA^IKPTHTBKTSCPPVS^ 



ATGACTGATTTCCGCCAAGA^ 
j^Qa^jQGCGOGTCCACGCTQCAA^ 
CGCGATSATGCTGGCGGAJ^AAGQCGTGAAC 
ATCGACGACGTGATTTOGCCXjGCACATCCGTA 

CGTCCGCACCTACC(KX:GGCAGTACGGC(n!AGM 
SEQ ID 1562 



IDimSAGTSVBESIHilEABC^TPJmiAI^ 
SEQ ID 1563 

CMTCGATCWGTGOXXSCCCT 

TMCGAAMCGGCAGTCCGCAAACCCGGCAGGAA^^ 

CGGTTTGMCTTGAGGATTTTCTGCCCC^^ 

CCCGAMTCACGCCGCC^ 

GGGCAAACATAGCGGATATTCX^TTGTCGGAGATAAAM 

TTC(X^CCCCGCCGGACGTTCAJS^^ 

SEQ ID 1564 

LCAVKLIMSAFAH^^ 

RFELEDFLPPDASGGKHHAVARQHAE3M)RKHNM^ 
FAPRRTFHRHLNKKTAAVETVLPDPA 

SEQ ID 1565 

ATCTTTTttXffACCGOGTO 

CtfSAAAAl^GCWUU^^ 

GGTCGTt^TTCTGTCOTCGGC^ 

GTCAGCATCGATCCGGAACGTCACACGCCTGAAAT^ 

AATACCGCGTCGTTTCTGCCA^ 

CGGMGCGAGCCGGAAACGATTGCTGCCGATGTMGGACCCT^^ 
VSIDPEROTPEIIGKYAKQFNPDFI^ 
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SEQID1567 

TTTCAJ^CAGGTTGATGC^ 

SEQ ID 1568 

FQMimSSNFRIAKTajLVRQFffrCRIKQ 

SEGID1569 

ATGTTA&GCGCGCGCTACCTCXACCTGCACGA^ 
CCCAAACCffTCCGCGCCGCX^^ 
CGAAACGCCCCCXJCCCGGCCT^ 
TTCCACGGCAAAGCCXjGCATCCTGCT^^ 

mcaggccoitoggatogctgggtca^ 

TTTGTGCGGCAGCCGTCCCTTOTCAT^ 
GGCGGCAGT 

SEQID1570 

MLSARYLHLHEAIiGI/SPKWIiKQAMVLPPKNTPATPA 

FHGKAGILLDNILKAAGIiDAAYVHK^^ 

GGS 

SEQ ID 1571 

(?PCC6CCCTGCTCGMTA!T^ 
(£GGGCAGGCGGAAAMCTATTACTGCGCC^^ 

SEQ ID 1572 

MNIRPASPSIKAAIAALDTVCNPSA^ 
AGRRKNHCAADGQTEDAVLMBKIC 

SEQ ID 1573 

TTGGTATACAGTGCCTGTGCX^GGCGTTO 

GGCATTCGGGCGCGGTGGCAATCAGGTGCAGTTO 

AAGCGCGGACTCAMTOGCG'ITXj^^ 

ccxx™ttctttggcagtcaggc^ 

catatckggtto(xitgagaagg6cggacggtttt^ 

ck^aacggcggcggttcagcgtgtcgaac^ 

GGATGCC^TTAAAGGCGTGTCGiACKGCGTTGCCAAAC^^ 
SEQ ID 1574 
LVYSACAAALPARTSSSRRCOTSGRCAYQYSSRADATPRRRHSGAVAIRCSSD 
PPVI^QSGRFCCGRRAVRRVPRQLRDSRRRGRARENRRRSAYRV^ 
GCRI^ACRTALPNLAPRRCRYAVR 

SEQ ID 1575 

ATGCAAGCC^TTTOACCGCCCCGTC 
CCGAACTGATCCTGCCGGAAATCCGCACCCT 

ATGGGCGAAGTGTTTTATGCGTGGTTC^ 

GCAGTOTTTCGCACTGGAAMCCGTCC^ 

CGAACTGCTCTACC^CGCAACA^ 

SEQ ID 1576 

MQADFNRPVLAVDTGTSYLSIALRADGEIRLPHQEV^ 
MGEVFYAWDTLNRRIUjSDYQVGRAADIAIjPEGYW 

SEQ ID 1577 

TTGAAATCGGCTTG(A^TGTGGTTCTCGTT^^ 
TAAATTATCGTGTTCGGACGGCATTTTTCAAATAC^^ 
SEQ ID 1578 

SEQ ID 1579 

ATGCCGTCTGCACCGCCGGTGCCTGTTO 

AGACGCTGCCGCTGCTTGCCXX^TGCCGG 

TGTCXXX&OTATCTCCGCTACGGCGCGG 

ATGATGGTTGCFGTGCGTAAAGGGTTTGATO 

ATGTOGACATTATCAAACTGTACGGCTCGATG^ 

CGAACACATCGTC(&CATTTCCA(£ 

TTCATT 

SEQ ID 1580 

MPSAPPVPVQTAIiLFQPm>IHI^ 

MMVAVRKGFDYEAASQPGCMjKIATKYPEIAASHPAGKGVHVDIIKLY^^ 
FI 

SEQ ID 1581 

CACCKXAAMTCMGCAATTATTCAGA 

GTCMTTTTTGGGCAGCGCCGCTGCCGTC^^ 

C^CCGCCGTCCCmAACTGCCGC^ 
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SEQID1582 

HLQNQAIIQKIVPIRF^ 

SEQ ID 1583 

ATGRTGGCAAAAAiATATOCGGCAOGGACGQCGTQCG^ 

GmXX^CGCCCG^TCGCTTATC^^ 
(TOAAACTTTCOt^^ 
•K^AGTTTTGCAAATCCACCTTCTC 
ACAGGTCGTCAGaTCG(mC(^CCmC(^^ 



• GCTCATCGGCGGCGAAGCCAGCGG^ 
A0CGTTIGCG(X3GACT^ 
GCAAAGGGCGTGTOGTGTKX^CGOTOGQ^ 
CATT 

VKLSDEVBtEIEAHDEEM^^ 

. TVC^^PyPQMNVRIQKGQKWQEASK^ 

SEQ ID 1585 

TTGATTTTTGCTTCTCTGACC^^ 

OTOTGCTGCCTTTOT 

GCTCGA(XATAAAGACAAAam , T^ 

MCAGCCICAACACGGGCAM(^AAGAACGACAAM 

AGGATCACTCCGCCGTCGTTGCC^ 

CAACC^CTGOTGACGGCAAAGCCGAOTAra 

CACCTGAAMCACCCGAGCAGMTGTT^ 

^IFASLTC^^ 
NSLimaOiKlTOSRPDFVQKlEV^ 
HLKTPEQNVEIiASAEt^ 

S ATCftGO^ATG^^ 
ACAOTACG(OTKKmX^TTCMC^ 

CTGGGCTTCTCCTCCGTGAT^TGGA^ 

TCTCCGTCGMGGCGAAATCGGCGTATTGGGCAACCT^ 

TGCCGTGCOTKK^ 

MGGAMTCXIACCAAGCCCTGCCCAATACACACA 

AAAATGGCAAGCCGTTATGCCMGGGCGMTTGMCCAAATO 

I£FSSVMHDGSI£jEIX5KTPSSY^^ 

KBIHQALPNTHIVMHGSSSVPQ^^ 

KMASRYAKGELNQIVK 

SEQ ID 1589 

TTGTGTGGGACGGTTATGGTTT1X& 

AACTGTTTGaCTGTO^ 

GACGCIKfflXyCGCAAGCTGTCGGCG^ 

AMGCCCTGCCGCAG(^TGGCTGAACCGG^ 

AAACTATCTGCCGCTGCGTC^^ 
G(KX5AC(m(mCCnTia5CC^ 
(XX^TTTATACCAAGCTCGATTTOGA^ 
SEQ ID 1590 

ft&PQEWLtTCMU)^^ 
G1X3RHVSPHMMRHSYAGHLI£^ 

AACGWACCCACOrm^ 
TCGCCOXB^^ 
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CGC(?rTTGA(X^GCCAGGCGCMTA(^^ 

GTOTGCGGCACGCCG^^ 

TCCAMTMGCGTCAAMTACGCC^ 

SEQ1D1592 

V^EAGDVVEIKWIHLRGRKAAVSEQUiffi 

NVYPRPVQmiCIQAVYIiAHAQTMVHQLEQRVV^ 

VLRHAACIFRRHI£QQCKQFT^ 

SEQ ID 1593 

ATGMCCCMGCCCCCTACTCCACCTGATT^^ 

A6ACCGGCX3GGCATTTCGCCAGCAATCTGGGTG^ 

CAAMTCCTGACAGGCAGGAAAAACCAGATGCACACCA 

GCGCGGGCGATATGGATGTGGATTTGCTGGTOGTC^^ 

ACTGTTC^GTACCGTCAAAGCGCAAACGGGCAAGCT 

AAACAGTetXTOTCC^^ 

TGCTGCACGTC^TCACCAAAAAGGGCAACGGCTACAAA 

ACCCAAGCCC^XXXmACCGAOTATACCCAM 

GTGGAGTTTGAACAACX&TTCCCCGA^ 

CC^CCTTTTTACAACGCGCCTACGACCM 

CTTGTACGATTT6AGCTTCTTGCGCTGTC 

CGCTATCCGCGCX&CACG(X^ 

TGGTCG<raCCGCATTGGCG(^ 

TAC0CTTGAA6AAAACGCCQAACA6G6CGGCGCAGGCGG0GCG6OT 
6GCGATCCGAAAAAACTTTTGGACGA!PTTG^ 

SEQ ID 1594 

MNPSPLLHLIDSPQDLRRLDKKQIflUiAGEI^^ 

SIGAALQdAAADKLLGGDWlSVAIIGDGAmA 

KQSLSLFENFGFR¥TGPVIX2^ 

VEFEQRFPDRYFDVGlAEQHAVTFAGGLACECaiKPWAIYST^ 

RYPRGTGTGAFVSDGMETVEIGKGIIRREGB^ 

GDPKKLLDDLGIiSAEAVERRVREWLPDRDAAN , 

SEQ ID 1595 l 

GTGCTTGGTG!ItK^C6TTGCGCG!OT 

SEQ ID 1596 

VLGVPLRVWKFI PLVRITAAVPSADTRQKAV 

SEQ ID 1597 

TTGCCTTGTATOSGWGACGGCA^ 
GCGGGGTTTCAATAAAAAAACCGCAGAAACAAACGCCGTO 

SEQ ID 1598 

IjPCIGRRHRRRTOaCRNKPPNAQBHKHRPKAAARTL^ 

SEQ ID 1599 

ATGCCGCCTGAAGCGTTTCAGACCX3CATTT^^ 
AGCCXXXXXCCGAACOTTCCTGC^^ 

SEQ ID 1600 

MPPEAFQTAFCLVSAIXH^^ 

SEQ ID 1601 

GTGTGTAOCTTATGCOGGA<X^K3CATGTT6AAA(^T 
CACGGACACGCCGAOTACCACCATCATCAC(^TATra 
GCCTGTCCCTGCTCACTCASjGCGACACGT^ 
CGACXKXJGCAAMGTGTGCCTCGAAGCGTGC^ 

SEQ ID 1602 

VCTLaCTOTiKHKGKNHEPSSIFG^ 
RRGKSVFRSVQTVCQSL 

SEQ ID 1603 

ATGGTTGCAAAAATAAAAAAATTCTCAGATT^ 

MGGTTGCGGTAATCGTATOTC^^ 

CTATATCATCAGCTATCArcTGACI^ 

GGTGG1MTAGCGTAGAAGMCTAGATCGCC(^^ 

OTACGAAGMTTAGACTACCTTTTC^^ 

TCACA(XX3GTCTGCAAAACGCTGTTGT^ 

AGCGAAGAGC^ATTACXXIAAACTCGGTCTGCMCAAA^ 

TCCCAAGTCCCAAAATAACAAAGGAGAATGX^ 

SEQ ID 1604 

MWRKIKKKDSTLSVI^SIRFYVYCLTOIOT 

GGISVEBLDRRFGFSSIxPLSEINPDGIiIIiAIKIHNAPD 

SEBVItAKUSiQQKCLPEI^SC^GKAYIRPKTEQ 
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SEQID1605 

SEQID1606 

LSC^PEVCNRmSGIAAASGEV^ 
SEQ ID 1607 

ATGATAGTTATTMOTATATGTATTTGTOT 

SEQ ID 1608 

MIVIIIYVFVbYVSIJEQHPICAVHHATIPHIHIL 

SEQ ID 1609 

GTGATGACGAGTGmACGGAaCGMGCMCC(^% 

SEQ ID 1610 

VtfTSVYGHEATDTTLPGKATKI^ 
SEQ ID 1611 

ATGCCGTCTGAAAOG^^ 

AT6TCGGGCGTGCC6GTAAA6TT6AC6ACGCGG1 

G6TTCATGCGGGCGGTTTC6GCTTCGATGACTTO 

AATANOCGAAGC^^ 

CGGATOGCGCGCAOTTTGCGGATGATGGATTT^ 

TGGGCAGGTCGCX^AGC^CTCGATMTCGAGTC^ 

GCAGATTTCGCCGTCGTCCATTTCGCCGC^ 

CGGGAGAATTOTCX5CCGC(^GTO 

ATTTTTCX&TTTCGGGGAAGGAAA? 

GCGOT^TCATGOTTCGCOT 

GCTTTTTCGCGCACGGAACAGGT(H U ^ 

CG^PTCATCTGGCAGCCGAA<K5TGCGGAT 

TGCCGTCTGAAMTCGGTGCGGATTATAGCA(XJATGTGGG^^ 

SEQ ID 1612 

MPSItfLQAAFFGVKVl/jFDl^M 

NKHEAQVBGNVFHQFQGUiELAPGIiAGET^ 

ADFAVVHFAAIGIDVLSQEVDPFHALLGKVCT 

APDDVPAFLRGDAAGSMD^ 

CRIiKIGADYSTJft/VWKAKYCFKI 

^ ^TGAAAA^ CGACAGCGAAAAAATCCTCTG!!GT^^ 

CO^Km'CGCCTCGCMGM 

OTTCTGC(nC(mxm!ACAC^ 
CGTCAATGCCTATCGCGGCGAAATGGACGACGGCGAAAT^ 
GAGTTTACCGACTCGATTATCGAGTGCTACCG(X^CCI^^ 
CTTTGGAATACAAATCCATCATCtXKIAM 
ACTGGTGAAAGACATTGCCTTCGACT^ 
GCOTGAACGAAGTCATCGAAGCCGAAACCGCCCG(^!^^ 
CCGTCAACAACCGCGT^ 
SEQ ID 1614 
MKKWIRTFGCQMNEYDSEKMLSVLAEfflGGI 
GLSQVDISFPEIEKFKfliPPARVBGGAAFVSIMEGCSCT 
EFTDSIIECYRDLPKLVSHI^PIQ&^ 
AU^RAETARIN(yiWGWQRaVBGISKKDK 

SEQ ID 1615 

GTGTCTATGAAACCGGTTGTATGGGCGGC^ 
CCTOGCCTTGCCGTACTATTTGTACT^^ 
SEQ ID 1616 
VCMKPWWAALUiCACT^ 

SEQ ID 1617 

TTCCOaOCAATT^^ 
GTTOXmrcCGGGATACGGTTGGCCTM 
SEQ ID 1618 
LPTNFIYMKSERRHHEIK£DTO 

SEQ ID 1619 

ATGTGTAAAGCGGTAGTCCCGGACGACTCCCT^^ 
CTGACCTCGTGATAG(X!ATAAC^ 

SEQ ID 1620 

OTAVVPDDSLPmRAPF]?*^^ 

SEQ ID 1621 

TTGAGGATAAATAimCCCT^ 
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AGCTGC6TTGGGCXK3TTTGTC(^^ 
(&GACGATGACTGCOT 
TTACCTTCGGCMTCXTTC^^ 
OTOTGCGTGATTGTCGMCCC^ 

GCAAAGACArai<^TCTATTTCCOT 

GCftC&OGTGOCGOGAAACT^^ 
TCGTTTCCGCCGCGCATACGCCXXaA^ 

ATMTAIRLARGFTGRDKI IKFEGCYHGHSDSLLVKAGSGI^ 



AHVPWiYADMARSNIDAFi^FHGM^ 

QP/\ |B»% id AAA 

TTGACTftlATATTC 
GAAAC 

SEQ ID D 624 

LTIYFRRHFYSWjKSKKYTPVIPAQM^ 

SEQ ID 1626 

MSDCAVAWQMLAKGAVHDLAAQH^ 

AAAAGCGOGGCGATGCGGCTTTGATTSAATACACCW 

ACCCCGCTI^CCGCGTCGGCA 

CGACACCGAMGGCGMCGCA^ 

MCCCCCAAG^TJ^^ 

GCCGACm^CGACACCCGCCGATOGGTGra 

CCGCTATG(&CAG«^TCGAAACtt^ 

CATTTCCCCCGMCACTO^ 

TAGGCGAAA(XMCCAGCGTGCTGGCACACGGCGAAAGCC^^ 

SEQ ID 1628 

MKKLNTQSPDFQAGU^^ 

TPLDRVGIYVPGGKAAYPSSVIMNAMPAHVAGVKEIIMVVPT 
AGPWbPTSRTAHFSSniGTYDFQKRSSUQVSEQGAQKLGOT 



TOTXH^GTTTGGACACGCGCCGCACGGTGOT 

CGCCAGGCGGACAOTGGCATITGOTGATGCA^ 

AAftOCTGTSlOGCGGCAGGCATCGCGOOG^^ 

AACCCGTTTGCCGCTGCTGATC^GTGOT 

CGCGGftGCGCAAATCGTGCGCGT(^ 

RGAQIVRVHDVKATADALKWEALGVlffl 

^^^bSgQCMCCGCX3QTTTO^ 
GATTTTCCGACTGCCAACGCGTTT^^ 
C(X3GCGOTATCTTTGGTGT(XXX^ 
TTGGCTCAAAAATCGGTGATTATTTCAGATAA 

ccwmx&mAGm^ 
SEQ ID 1632 

IJ^KSVIISIl(BRPREKGAIJlGTaHCUaffiVIiPPAPCSPBP^ 

ATCA(XrWA(KmiCGWCCCTTira 
SEQ ID 1634 
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SEQID1635 

GTGCACX&CAAAAACMCCTC'KKn^ 
AATCCGAACTTTACGCCATCCACCAM^ 

CTGGAAAACTACTTCCACTACCGCAACCTC^ 
ACATTTTGKxAGAGCATC^ 
SEQ ID 1636 

LENYFHYRNLDVSTl^^ 
SEQ ID 1637 

ACGGTCOCTCTCCGCmX^^ 
GTTCG(£GAACA!^ 
TOmTTCGACCC^TCCGA^ 
(mCCC(^CCACGCGCCTCTGC^ 

GftCCH^TGTCGC^^ 
- MCTCGGCGGCATTTiOGCCAATGGTT^ 

SEQ ID 1638 

LPRRLKPISIFPRKPVMPYQQITVNVNEA^ 

SQFDPIRISDRLWTPSWHEAPEGCAVN^^ 

DVWANILANPIilMLGEMLAMTRQG^ 

SEQ ID 1639 

MGOOGOCIAQMTCA&AGOOGQ^ 
G(ffiAGGAGTTCAGATG^ 

SEQ ID 1640 

MPADSmQTI^HAVCRPKSGAVRSA 

SEQ ID 1641 

ATGCTGAAAAAAGTTTTAATCGCCAACCGGGGCG^ 

ACGTCAAACTCGCCGACC^TCCGTGTGCAT^ 

CTA(X3GCTT(XTTGCCGAAAACGC£ 

GCGATGATAGCGGCAGGCGTGCCCTGCGTCCCCGGC^^ 

6(X3GCGGCG6CGGGCGCGGTATGC6CGTGGTCGAGAAAAAA 

GCA(XX^GCCCCGTTTATCGATGAAAAGGCACGCAAAA^ 

MTTTTTCTTTAITCGAGATGAACACGCGOT 

CAAACAGAAAGACATCAAAATCGAAGGCCACGOTTO 

GGCATCCGCGTGGACAGCCACATCTACX3lAAGGCTACC^^ 

TCO^CGCCGAGCTG<KX»TM 

©jAAGC^CGCAAAACCAAACAGGACAAA 

SEQ ID 1642 

HLKKVIJANRGEIALRVLRACREMGIATO 
AMIAAGVPCVPGSDGALPDIH)AEIliKIADKVGYTO 
APAPFIDEKARKnGKAOTDACmGYRGAGTFEFLYElX^ 
GIRVDSHiyQGYRIPPYYDSUGMCVHOTRE^ 

SEQ ID 1643 

ATGGATTTGCGCAAATTAAAAAAACTGATTGATTOGTT^^ 

CTCCCGTTTACGCCGCGC^^ 

MCGmTCGAAGCCGAAAAATCXXSGCACGGTCAAA^ 

SEQ ID 1644 

IJDLMCLKKLIDLVffiSGIAEIEVTEGEEKVR^ 
NEIEAERSGTVKEILVENGTPVEPGEPLPIIG 

SEQ ID 1645 

GTGCGCAAAATCAAACTTCCTGCACATTTCC^ 

GGAAAACCGTATCCGAACACCATTCCt^ 

GATTTGCGCAAAT 

SEQ ID 1646 

VRKIKLPWPQKPPFFDILLDIYRQLRENEHNFTWSffl 

SEQ ID 1647 

ATGTGCAGGAAGTTTGATOT 

SEQ ID 1648 

MCRKFDFAHKMPRVKSVYSNUjVFRAKMPSEKGLKCV 

SEQ ID 1649 

GTGCGTATAATGGCt^OTGOW 
TTTTCgPCGCGCTGTCCGATATGC^ 

GAAGGCGGTATCCGTGCOT^ 
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ATATCGAACGTQCCG(mTimAOGACGACAGOT 

CCGTCTGAAACCGTC(XXXK£GTTGAG^^ 

ACGGACAGGCmCACaaTATTTra 

GGGTATGGGTCATAT(XXXXX!CGTGTAOCGTCATGC^ 

SEQID1650 

VRimCPNESVKMDISDFDFT^EHLIAQHPPE^ 

EGGIRAVMVGREGEIJniRFEGGQTVYEI^ 

PSETVMVEAAKARGKKAWAVGTTSMRALESAARATGYLKDG^ 

SEQID1651 

TTOCCTTTMGCACCCTAATmGGGCG^ 

SEQ ID 1652 

SEQ ID 1653 

ATG(X£AAACGTACCGACCTAAAATCOT 

6CTATAMGTCATTCTGGTGAATTCCAACCCC(^ 

GCGGCCCGATGCGATTCTGCCCACGATGGGCGGTC^ 

G(^TCGACAAGGCGGAAGACCGCXX£CGCTTTAAA^ 

TCGOTTTCCGACGCTGATTCGTCC 

TGAGCTGCTGATTGAGCAGTCTGTGCTCGGC^ 

CATACGGGCGACTCGATTACGGTTGCGCCG^ 

ACGTGCAGTTTGCQGTX3AACX2CTGAAAA 

GGCCSAAGCTGGCGGKXm^ 

TTTGAAAAATTCCCCG^ 

CCGGCTTGTGCGGTTTCMTCCGCGCAGCGAAGACAA 

GGAAGAAATCCACGAGATCTGCGCCAT^^ 

CGCCGTCTCAAACGCAAAGGCTTCTCC^ 

CATCGGTCAGGGCATCGAGTTTGAOTACT^^ 

ACCAGCGACXXmCTATTTCGAGCC(^ 

ACGCATTGGTTGAAAACGGCXSTGMCATCATCGGCACAT^ 

CCGC&T(XjC£CACAACGAAGA^ 

CAGTTGCAAAAATATATGCGCGAAGCC(^^ 

TGGTTATCGGCGGCATCATGCAGCACGTCGAACAGGCGGG^ 

CAAAIOTATGGCGTACGCGCTGGGCGTGGTC^ 

TCCAAAGCCACCGGCGTGCCGCTCGCCAMG 

TGTTCCCATTCATCAAATTCOT 

CXXX^XSAACGCCTC^CCG^^ 

CGCGGCACGGCGC^TACCTXIACCGAACAC^^ 

ATACCGTTTCCAGCGAWXKCA^ 

GAAAAGCCGOOTATC^^ 

SEQ ID 1654 

MPKRTDLKSILIIGAGPIVIGQACEFDYSGAQA(3CAI^E^^ 
AIDKAEDRGRFKEAMEKIGI^CPKSFVCHTHNEAI^ 
ffTCDSITVAPAQTLTDKEYQIMRNAS^ 
FEKFPAADDRLTTQMKSVGEmMGRTIQESFQKALRGLETC 
RRJLmGFOTRLAQUiNVSEKEVRE^ 
TSDRL¥FEPLTItEDVLBIVRTENPW6VXVHYGG(rF 
QLQKYMREAVQVSEDSPVLU)PFTjKNAIEVDVIX^ 
SKATGVPIMVGARCMAGISLKEQGVEKEWPDnAVREAWPFOT 
RGTAQYLTEHGLIVQAIMKVPBGRPHIGDALKNGE 
SEQ ID 1655 
TGCXXXX^CACTTTCAAACGTTCCGC^ 
TTAGCAATCGCCACATC(XK3CAAGGGGTrGTAATC 
GAAAGT1H3GCGGCATAAGCAGGGAACTGCTCCTG^ 
AAAACTGTCGGCAGGCACGACGGCACGCAGCGCGT^^ 
TTGGACATATCCAAGCTGACATTTGACGCAA^ 

SEQ ID 1656 

caehfqtfrmsfkdlptcrpfn™ 

rwgrhdgtqrvfdklphliycaps *grni*g*sha*hrrighiqadi *rnghgrq 

SEQ ID 1657 

ATCTTTAGTGATAOTACP^ 

AAA*EAAmCGCTTTGT(^CAAOTAW^ 

MCTAAATTTTOTITCAAAMTATTm 

ATAACATTAGATCCATATACAGGGMTCM 

CIMTATTACATTTTCOTATGA^ 
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SEQ ID 1658 

MFSDIYLNPKNYFGimLPFKA^^ 
IMPYTGHDlJWHVQIDPNim^ 

SEQ ID 1659 

TTGCCACCATTCGGAGAATGCCOT^ 
CAGGTCSTCTGAAAAAGMTGCCCGA^ 

SEQ ID 1660 

LPPFGECPLPSPPPRGREWVSADFSVAGGLKNHINL^ 

SEQ ID 1661 

TTAAACGCTTAATTCAGTTMTTTT^^ 
CATCGTTTGTCTGCTGCW^AJT^ 
TCCTKEACAA'TCAGCTOG 
GCPiramCGCTCATT^^ 

SEQ ID 1662 

LLPSLQLSPTOIEDGS^^ 
C7Dl^LOTAHRIliNIIPHRU*S^ 

SEQ ID 1663 

GTCTGAACGGCTATAAATTOCGCCGACyW^ 

ATACGACCACGCGCGGAO^^ 

6AATTGGAAAAGCAGCCAGCACGO 

SEQ ID 1664 _ 

MNPPEKLUPAENPALRQBAKAMRQEMSEA 



SEQ ID 1665 

AOTGCGCGATCCC^TGGCAACX^ 

SEQ ID 1666 

MPADKYAPLETVLQEb^ 

SEQ ID 1667 

ATGfcAAATTACXX&ACra 

TGTTtnTOGCAGTCAGAAAATCMOT^ 

ACGGTOTACAGG^TTATCOGCACACGGCATCAAA 

SEQ ID 1668 

HHSALDHLVLTVADIDRTIAFH^V^^ 

SEQ ID 1669 „ 

ATGGTCGAGTGCGCTAATTTTCATTTCC^ 
ACAAA 

SEQ ID 1670 

MVECANFHFQLSFSDGSSILLPSEHHYACFAAFILPMNLSHK 

SEQ ID 1671 

CAATCGGTTACGAAGGTTCGGCTTCCGGCGAAGTCGTC 

CATCGGC^TACCGGCACCAACGCCGAAGATGAAGAAAGCCCK^ 

GACTATTTGGTACGCAAC6AAAC?C6TCGCCATCGC(X^ 

TOAAAAAGCGCAAGAACTCATCGCC^^ 

TTTCOTACCCCmCAMCAGCCTT 

AGCGCQGAAGACGTGTTGGCACTCMTCCCGA^ 

CGATTTTCGGCATCTGCTTGGGACA^ 

AGTCGTCATTACCAGOttAAACCACGGTTO 

AAACCCGOTTCWTPT^^ 

SEQ ID 1672 

MPPABVPSDGIVKNFimiPIMm'PALLVIiADGSVPHG^ 
DYLVMIETVAIADIDTRRLIMIjLREKGAQGGAIL 
SAEDVLAIJfPIX^SNGPGD^ 
KOTCFQGHPEASPGPQDVGYICTKPIGNMKAAKQA 

SEQ ID 1673 

TTGGACATATCXAAGCTGACATTTGA^ 
GATATCAGAATCTGCATATCTGCATGGAGGCM^ 
AGAAAGCAGG!TOAAGCGAACCAAAACA^^ 
AGGCTTTCACGGATGAIXX^^ 

(X^TCAATGAACAGOTGAAGTTTTAAAGGAGAAGATAAAAW 
OTATOTTTCATCATCTAAC^^ 
ACCAAGTTOGCAACAGGAAATTACGCGC^^ 
TCATCATCGCAGCATTGATCXXn'AAA 
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SEQ ID 1674 

UDISKLTFDATAMVGKTEH^ 
RLSRKPAAYAQIKSECAAMKNRHHAAKDEEAAKAYA^ 
OTGMRKIiR&VIiFMPAIWayia^ 
SEQID1675 
ATGAAA!KX}GTCCGCTCCTTCAT 
AGGGGCACGAAGCCCTGATGCCEGAATGGCXj^^ 

TGAcraGcaxxxmxscm^^ 

OTACCCGAAmCCCGACCGGCGTATGTTra 

AA&ATCX^reCCQCXXrPAC^ 

TCGCCGAATTGGGCAAAATATGCCBTC^ 

CAAAAAACTGCAOGGCGCACATCCGOTT^^ 

SEQ ID 1676 

MKSVRSFIRNDILAMSAYKITDVPPGFAK^^ 

HNAALYGMDYVGVPfcNGDFM 

KILPPYNMKQLSI/TCAK^ 

SEQID1677 

atgccc<otac<x»ttt<^^ 

aaacctaaaatcccgtcattcccacgamgtgggm 

SEQID1678 

MPVTHFPTSARRKYGmTVIP^GIRPFEPQSPPIlOiS 

SEQ ID 1679 

GTGGGMTCCGGTOTTTGAGTITCAGTCATT^^ 
AAAAAATGCCGTCCGAAGACOTTCGGACG(^ 

SEQ ID 1680 

VGIRFFEFQSFPim^IECLDSMiRG!^ 

SEQ ID 1681 I 

ATGTACCGCTTTGATTTGAGCGATICCMTCCT^ 

TCATOTCGOTACGGGCAGCGATTTGACCGAA^ 

GGGAGGCGGGTT(XrrCGA(XMGTGOT 

AOTTCAMCCGACCGTGGTATTGCGTW 

TGACTCCCaGAAGCKXKXKXCGATO 

CTGCCC(^CGGATATGTTTACGACAAACC^ 

GGTAAAAAACCCGOCOGCAATAACCGGTGCTTCTC^^ 

GGCGCGAAGTCTTCTTC 

SEQ ID 1682 

MyRFDLSDSNPDKWSTO^ 

TVKPTWLRTAFVTIRSYTGN^^ 

GKKPARNNRCFSGKGVRTliIMDLDSM 

SEQ ID 1683 

•iTCCcriTrccGccTCX^mx 

SEQ ID 1684 

LRLRLRPPPPAGKGKHQDGGIKRGMAEHPPF 

SEQ ID 1685 

ATGTCCGAGCX5(XX3GGGTTTCAAa 

GGGGG(XX3GC<Xa(£CGCAM 

CAAATOATCTATACGAAOaGAGA^ 

TACCTG(XXXXXTACGGCAAGGTTrcCGGCTT^ 

TCATATGCAGAGACACAGGGCMTGC(XXX!AACTTC 

AAATTCGCCCATTTACAMTrGTCGGA™ 

AACAATAGTAATAGTAATCAAAACCTCGTCTATACGACAGA 

TGGATAAACMCGGATTGAAAATATCGCGCCACXX^ 

GAAGATTCAAGATAAMX&AACATTACAGTC^ 

GOXXXXXXKSOTSGTO^ 

CCGACCCCGCCCGGCCCGGCCO^^ 

(&TCAACTCXKX^GAACGGC^^ 

ATACTGOTCCTTGGACGGCCCGGTATTACXK^ 

CAAAAGW^TGACAATtoCAJ^^ 

AAACGGCACAGACCAACGAGGCTACGAACTGAAGCTCAOT 

AAAGGCTATGTCGGOGACCGCTACGGOT^ 

GCGCGTATGCCTTGGAfflMGCAAAATCGAC 

. GGCGAOCXXjGGGCGGCAATATGTACCGCTTTGATT 
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SEQ ID 1686 

YLPPYGKVSGFDTAEU^GKAVMflHTTRAGIiAG 

APAIOTGFVQNGKV^ 

ItbFWTARYYGNDDIFOT 

KGYVGDRYGVIXKFVIiRRITDDQD^ 

LEKjSGSLIKKIEVQOTGGIiSSPTLVM 
SEQ ID 1687 

TTGAAACOX^CCGCTCGGAC^^^ 

TATTTTATCCOTAAATCTO 

CGCCCX3GCTGCXIAMGGGGGATGTTCGCAM 
SEQ ID 1688 

LKPRRSDIRPSGRQDQALSGRG^ 

SEQ ID 1689 

GGGGAG(KXK^TTTTCCAA^ " 
SEQ ID 1690 

VQCIGLFHHSra&IHGT^^ 
SEQ ID 1691 

TTGTTCGGAAAGGTTGCCGCAACTTGG&IA^^ 

CAGCTGTTGTTTCAGGTCGTCTGAAAAATAAAAAGQ^ 

AGGGCAACAAG(XXjCAAGGCTTGTATTTAGGCGOT^ 

CAAACGGTTGCGGATTGCCAAAAACCGCOT 
SEQ ID 1692 

llgkvmtwrmppprpsptgegaix^^ 

qtvadcqkplybg 
SEQ ID 1693 

atgtatagggttgccgcatcattctccttgc^^ 

gqcggcattttccaagttgcggcaacctto 

ggggttgaggtttgctgcaaggaaaacaggttgt^ 

cccctaTccorirccTc^^ 

cagccctctcxx^cggggagagaggacggggaggcto 
SEQ ID 1694 

MYRVAASFSLRAYPWYSGFVftSATOPARHCM 

PQSAPSPWEGRGGGIl^MTFPMST'APIQAFKLVALSPALSHGERGilGG^ 

SEQ ID 1695 

ttggggcggttgagttgttgggaaaggttgccg^ 
gcaacttaaatttoagctgttgtttcaggtcgtc 
SEQ ID 1696 •> 

SEQ ID 1697 

TTGGGAAAGGTOWrGCAACTTG^ 

SEQ ID 1698 

LGKVAAlffiraPSFRPSra^ 

SEQ ID 1699 

TTGOGAACAG&GGCACACATCATGAATATAC^^ 

GTTCAGACGGttTCCmTCCX^^ 

G(^TAGACAC<mATCGGCACA(»(^AAGTa 

AO^AAAACTCATTCCCGT^^ 

GCTGATGTTCCAAAGCGAGT«K^CGAAGTC 

A«yiCGCC(KaG(mTCGC(Xm 

ACa3GCXmCTCTTCGA05AA^ 
GTCGTOGAA 
SEQ ID 1700 
LRTEAHIM^QNIRTIiLDT^ 
AraJ^AMARMGARVGVJ^LYGPSQ 
TOQDIALIDARKAVDMFIU^ 
WE 

SEQ ID 1701 

TTGTGCGMCTTTTACA(X3GTATGCGGCGGATTOT 

SEQ ID 1702 

U:ELUKjMRiOVnROQNIPCKQQSAIKPVKlA 
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SEQID1703 

TTCGTGTTTTTGCTO3TGTO 
GCCGGGATA(^CGGTGCTGGTGTATTTTTG(X3GA^ 
TTCCGGCAGCGCGTCCGAAGTGGCGG^ 
GTTTTGGTCAA^^ 
SEQ ID 1704 
LVFULVSIMHVRI^^ 
■ VLVKNGKMVRYTTGISSYWGLRARIFQADFPG 

SEQID1705 

TOATTCTCTACA!^^ 

SEQ1D1706 

LILYIAESLTKTKVHRKNPPEKFVBASPHNC 
SEQ ID 1707 

SEQ ID 1708 

LGRHCLSLQVIENQTFK^ 

SEQ ID 1709 

ATGCCGTCCTCGACGTTATCOjCCGGC^ 

AGCTTGCCAMCTCTGCGGCTACCG^ 
CGAAAMCACATCGGCAMCCCAMGGCTGGC^^ 
(XTOXXXXTGCCGCAGCGTTACC^^ 
ACAflftTCGACCAAAT^^ 

ccgo:a(^ctcggc(»caaaaaaggcat^^ 

ATOAAmACCCGCACACTMTCGG^ 
(mjXATCWXSCGGAAACCG 

SEQ ID 1710 

MPSSTLSAGSTDKPNGIPtfTKTQI^ 

I EFTRTLIGRFBVDLFEEFFHGI^^ 

ACG<^CAGGTTTTTAmAACGAATAAACGGMCGG^ i 

SEQ ID 1712 

K*YLVMjFAKQHDHAGRLGIERKRCVQNGVFDDLDDFI 

SEQ ID 1713 

TTGOmGCCGTCTCT^^ 
TTTCTCACGGCGACGAGCGTACOTCA 
AATCGTCAGTCGGGAAAATCGGGCGGMGAWGCCGCGTTG^ 
GTGTITCTGGATCCGTITrTCA 

TCmACATTCCCmGTTCI^ 

TTGGGCCATACCAMGGCCATCCCTACCGTTTGGATGTC 

AGGAAATCCTGTACGGCTTGGCAAAGGOGGCAGQ 

SEQ iD 1714 

IAXIPSLVSGCmriVMSRRAAEADIAGW^ 

VPLDPFFKANLYOTtf^ 
IAjHTKGHPYRIJ3VESIVKVR^^ 

SEQ ID 1715 

ATGACCCAATTCAGGCCTATTCTGAATCAGGCGAATAra^ 

GCGTTGAACTCG 
SEQ ID 1716 



AIMS 

TGGMGGTTTCACX3Ca3ACGAO^^ 

CGCCAAAAGACGGATMTCCGCXIATTGCTGArc^ 

W 

SEQ ID 1718 

MCSSSDSKSRRSDVSAHYGVL^ 



SEQ ID 1719 

TTGAAAAGTTTTCKKIAAATCCGC^ 

SEQ ID 1720 

LKSPIiQIRHKPIiNRPLYKNAAHKASPHVKF 
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SEQ ID 1721 

CXXSTTGCGCCGOTArcGAC^^ 
flaGGATGTTGO^ 

CTGCACGGGCATATTGTCGGCAAACATGAAATCGra 
TTAGGOCTTTAGAC 
SEQ ID 1722 

MADLQKTFQTSFRDAMA^ 
LHGHIVGKHEIGTHFV^ 

SEQ ID 1723 

ATGAAAGCC^TGATACTGGCGGCAGGACGCGGCGAGCGTATO 

TGAAGCAGGCGGGGTTTACX^AAATCtntAT^ 

AGCHXMTTTCBAAAC^^ 

GCATCGTCCCTGC(KGGGCACATTTC(^^ 

GCMCGGACmCATTCA(XWGTGGCTAm 

GAGCGGTCAGMGCATACX3GGCC?reTGG(^^ 

SEQ ID 1724 

MKAMIIAAGRGERMRPLTDTTPKPLIiDVAGKPL 
ASSLPGHISAHIiW»VGNPPHNPIX3DFSLLPD6SVW 

SEQ ID 1725 

TTGACCGCCGTTTTCGCCAAAATGGGC^^ 

AGG6OTAAACGGCTTCACGGGGCGCAACCGGACGTO 

GCCCGTCX3ACAMTTCAGCCTGGTCTT^^ 

SEQ ID 1726 

LTAWAOTLQGIDSDFATFIRTLVILAALVLFLT^^ 

SEQ ID 1727 

TTCGGCATTGGCATCXKX^GGCTTCGCCTO 

(OTGtaACTC^ 

GOGGCTGGTTACGGCGGCGTGTTGACGTTGGC^ 

SEQ ID 1728 

IXSIGIGRLRLIDRRFRQNGIAG^ 
GAGYGGVI/PIAUCR 

SEQ ID 1729 

TTGGCGAAAACXXX^GGTC^TGAGGCGMGCCTGCC^ 

SMMID^1730 

SEQ ID 1731 

TTCHTO^GCGTCAAAAATT^^ 

SEQ ID 1732 

LLLRQKFHCGFGADNVI ICLILFSIHLFVA 

SEQ ID 1733 

ATGAGCTTCAAAACXX^TGCCGAAACCGCCC^ 

AAATCAATCCTGCCOMGCGTTC^ 

CGCATTGCX3CCATATCGGCAMGACTCTGT^ 

GACATCAACCTGCACTTCACCGGCGAOT 

TOCTGTGGCGGCt^TGGTCGATATGAACGA^ 

AGTGATC(X}(X3TATTCTGCCTTGC^ 

GGCGC6ATGQCGGCATTGCTAAAAGATGCGATTAAGCCCAATO 

TTACCGCMCCa3TCTGGCGAAACACC^ 

^TG(X3GCAGTCGTCGTGGCGACTGTCCG 

CACATTTCCAACCTGAAAAACGTATTCGGACTGCCC^ 

AA(nrrcccTGAca^^ 

GGGCATCAAAGACAAAATCCGTGCGATTGCXXAAAAA^ 

CCGATCTGaTGGCGAAMCCmTATra 

TTGCGTOTGCGGCAATATGATGAA^ 

SEQ ID 1734 

MSFKTIMTAQSSIMRPIGEIAAKLGIjNVDN^ 

DIKiaFTOTFmGAANI^IAAMLDH^ 

GAMAALLKDAIKPl^VQTIEGTPAFVHGGPFANIAH 

HISNLKNVFGLPVWALNRFV^ 

PlCMAK^YSLSDNAKLIiGCPEGPm 

SEQ ID 1735 

ATGAGTTTGAAATGCGGCATCGTCGGTTT(XXXIAA^ 

OCG(X^TraCGAAG , KXXXX^ 
ATTGCCGACATCGAAACCATCGGCACCGMTTGGCGC^^ 



WO 02/079243 



-141- 



PCT/IB02/02069 



TCGATTTGTGCmAAACTX^^ 
GATCTATGTGGGAAACGTTGC^ 
GAAATCGCCGAATTCGAAGACXSGTGAA^ 
TCACCGCCGGCGTGAAAGMGTCCGOJ^ 

SEQID1736 

MSLKCGIVGIiPNW3KSTLFNM,T^ 

IADIETIGOTiALADLASVEKAIVREE^^ 

EIAELEIXjEKAEFLAQ^^ 

SEQ ID 1737 

TTGCCGGAl&TGCCGTCTGAAGCXX^^ 
AmO^CTGCCTGCCC^ 

Am 

SEQ ID 1738 

LP^SBAGFGGFRRSHTA^QPHQNKIPPAFPICPPRFLQNKLPALPL^ 

SEQ ID 1739 

ATGACmAGCOTACOTACCGCC^ 

TATOTTOT^maGGOTTTO^ 

CGCCGTATTGGCGGCMCGCTGGCAGGCGGCTTCl^^ 

GOTXXX?CCTACAGTTCGGCTTC^^ 

KK^TCGCTGACCGCXJGTGCGGATCXXX^ 

TATTCGGAACAAACCGCCTATTTCC(XXK3CCCC^ 

TCAOCGTTGCCGCCGGTTTGATTT^ 

CCTGACCCTGGCGmTC^^ 

GCGGGCTCGTTTTTCGCGATGAGGCTGCCG^ 

AAAMCAGCCGGAACTGCTGGTTTT^^ 

CTATGTGGAAGGCTAOGCGTCCCGTGTGTTCCAAAACT^ 

AGCCAGATGCXX}GAAAA(&GCCGCT^^ 

TGGCTTCGTCTTACAACGTCCAGCGCGCCM 

GGCAGCCAMTACCECAACXSTCTATAra 

GAATTGGCGAAGC(OTTTT^ |! ■ 

SEQ ID 1740 

MSQALPYRPDIIMiRAAAVLS^ 

GKD^FDPAQBERPIiIiHIWSLSVEEQFYFVFPILIiU^^ 

YSEQTAYFPGPAAIJPCLAVAALIYFN}^ 

AGSFFAMRLPFMQYDRIXjLTRSNTSCHNNTGKQCLWGDTEK^ 

SQMPENSRSUYDAGFFQKFDR^^ 

ELAKRFSEKQRFLDTRHNH 

SEQ ID 1741 

ATGAGCGTCAKXM(^CCTGCMTCGCGCGGC 

CC(&CaGCCflXaCAT^ 

CAM(XrGCCGMCGCA(KOTGMTTra 

MTGCCGACTGGTTCGGCAGCATGAACrGCOT 

CAGGCATTOCTTCACCGAGT^^ 

CGCCGGTATCGACCTGACCCGCCGCCTGAACCAAAMC^ 

MCGCGAAAAAAACCTCGCCGTACX^GTlCTACCAff 

M(mAAGA(^(3GCAAGC(^(mGCCXm 

GTTTraMGACCAAAGCa^CTTAC^ 

TTGGCAGCGTCCAACAAAGAAGCGCGCGGCTTT^^ 

tgttc»tag(x:gaat^caaacgtw^ 

SEQ ID 1742 

MSVIQDIiQSRGLIAQTro^ 
NADWFGSMCLDFLW)IGKHFSVNAMLNKESVK 
NAKKTSPYQF7QFWIJCVADAETO 
JAASNIOSARGFVNAKA^ 

SEQ ID 1743 

TTGGAMTMTCCGATTGCCGC&GAMTGCOT 

OT(mrGCACGGAAACTTATC^^ 

SEQ ID 1744 

LEIIRLPQKCRLKLSWIPIKOTPCra 

SEQ ID 1745 

ATGGATTTATTATCGGTTTTCCACAAA^ 
ACATCCGCAGCCGCCTCCAACAAAGCATTGOT 
CGAACCCGA(X3<XX3GC^^ 
GCCCTGACGCGCGACAGAAACGGCGCTTGGAAC^^ 

GGGGCTGTTCCTTTCAGACGGCATCGGCACGCCCG^ 
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TTCAACGCCGGCGGAGCGGATGCCGCCGGCCTCGG^ 
GCACGGTCAAOm^ 

CGCCIHjCCGCAAKXICGTTTCA 

MTATACGCGGGMGGCGCAC(mCCTGGAAG(^^ 

OGGCTCTCCGG^^ 

ACCTGTTCGGCTTCCACAGCTTCAGCGGC^ 

AGCCGCCACATCGATACCGAACTCTT^^ 

AAAACAAACCGATTCCCCTGAAAATCACCGGTAC^^ 

CACCCTGCTCGAACMTGGCAGTGGCTCAAAOT 

SEQID1746 

MDLLSVFHKYRLmVAVLTOIiLAA 



ALTRDRN& 

mGGADAAGLGLRAOTSFRNI^ . 

RLPQPRFISRLIXJSLSimQNW^^ 

FKSGLYGGHTEGGISIANTRPATYRLQQNA^ 

SBHIPrEriFSDSLyVTSNGYimDTQ 

SEQID1747 

ATGCTTCC&ACGGTTTCAGACG^ 
TGCAGGAACTTATCXSGGCAAAACGGTTTCTT^^ 

SEQ ID 1748 

MLPTVSHnFYTIPMFHHSRQHRN^^ 

SEQ ID 1749 

TTGGAGTCGATGAAT(XMnTO£TTO 
TCATTCCCACGGAAGTGGGMTC 

SEQ ID 1750 

LESMNRV&SVRHSHQSNIST^ 

SEQ ID 1751 

ATGCCGTCT<^CCTT^ 
CCCGCC»Ga^AAATCCGGACCCCC^ 

SEQ ID 1752 

MPSEPSMIFDAPAVYGRGAGAVKYPNRHSRQHRN^^ 

SEQ ID 1753 

GTGC^GGCGGMTTAfiOCTJUDGCCGOOG^ 
CCATCCACCXXIAGGGT6TCX36TCGGCTACGACTTC^^ 
AMCAAGGGCGAAMGATAAACGTGACGCMTATCT^ 
AAATTCAMCCCTATATCGGCATGCGCGTCGOT^ 

SEQ ID 1754 

VQADLAYAAERIWfPEPTGAKKGiaSTVSDYFRNIRT^ 

KFKPYIGMRVGYGBVRHQVBSVEQETTTVTTYLQSGK^ 

SEQ ID 1755 

GTGTTTCAACACATAGCACCGCGOCTCCTGCGW 

AACCGCAATACCGTTTACAA 

SEQ ID 1756 . 

VFQHIAPRIjLRVFVRIiARS^ 

SEQ ID 1 757 

TOT^TTTCGCGCGT'ira^ 
ACACT(XXX3AAC£CAACMTGT™ 

SEQ ID 1758 

I£mVRRREiaiLFPRI&GGT<K^ 

SEQ ID 1759 

ATGTCCGAGTGGCGGGGTTTCAACCGAAAAGGAAATACAAT^^ 

ACCTCGGACACAAACACATCCTCXIAAAAACT^^ 
AACCCCGCCCTCCCCTATOm^ 
GCATTTATCGACCGCCTGCTGCGTCAAACC^^ 
TGC^GACXX3AGCGCACGCCTTCTGT^ 

AAC(m^MCGTO^OTCCTG^ 
SEQ ID 1760 

AFIDRLLRQTLNTRYLLVGDDFRFGAGREGCFELLAQQPra 
DGAFGTRRGVASFGFNPTVDGGCSQKLEVHL^ 
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SEQ ID 1761 

ATOTCCGttGAAACm^ 

(XXxAGTCCGCGACGG&TTGGtfX^ 
Gmm3GATCG(?KmCCCGra 
TCC^CGATGTCGGCAMGGCTCGGGCGCGAAACT^^ 
CATCGTCGAAGCCATCGGCGGCTCGGCMTGGA 
AMXTCGCCGTCAAAGAGCTTGAACAGGCftG^ 
ACCTGMATTGGCAGAACAC 
SEQ ID 1762 
msaet^igwvglg^ 

VSGSVGPA1NGTLIJIjFGGSEAVIiNPI£OT 
NIAVKELEQAGNIlPAVETVftASYRKAVEAGyGB^ 

. SEQ ID 1763 

GOTTTACGCTTGAAAAACAGCC^ 
GCATTTCCGGAAGOTGCGCCATAAAA 

cGGCAAGramauxxrc^^ 

AAaG(m^^CGGTTG(^GCMTCamS 
SEQID1764 

GFTLEKQFVQIlPCSAARCfflDIEKY^^ 

HRLOTVAAIGH 
SEQ ID 1765 

TTGGOTACAATCMCCTm 

SEQ ID 1766 

LGYNQPFNCSDGIGVSRCEILFEGQCRLKPKYCNNR^ 

SEQ ID 1767 

TO^TACACCACGTOI6CGCC«XA 
AGCGCGACGGTGTOTTAAACGGATAGGCMCGTCMT^^ 

AAAGTOTTT'^PC^COG 
TGGCAGTCCCAACCCGGTACATMGGCGCGTC^^ 
GCGGGCCGTCGTGCAGAATGAATTTCGGACGGC^^ 
(ATCGGAAACGGGOTTCGAGC^GGT^ 
GAAACCCCTATGCCGTC 
SEQ ID 1768 

LIHHVRAGRHGLARRQSPRRGRP^ 

KVFFLTGNLPGGIFAAQFAETGFRHIFAVQFflSHFDGQAW 

HRKRAFEQVIXjFTVIGHFLISIVTIFRFQT^ 

SEQ ID 1769 

ATGACCGATTACACTAAMCKX^TCMCCTGC 

TGCGCGAAATCGCCAAAGGCCGTCCGAAATTCATOT 

CAAAACCCAAG(XGGTTTTGACGCG^ 

TTGTGCCGCGMTATGCCGCCGAACAGATTGCCCGra 

(XXHCCGTATGCTOGGCGAAATCTACAAAKXXK^ 

AGTATCGCCTGCGATTGACGTTGCCTATCCGTO 

ACTCTGCCTGCGAGCCAGGCCGTGTCTGC^^ 

GCTTTTCAGACGGC&TCGCCATC^^ 

SEQ ID 1770 

mmSKTVNIjLESPFPMRGNIiAKIffiP 

LCREYAAEQIARQKKDFIRLGVLGDWDNPYL^ 

TLPASQAVSAGABVVYQHOTPKGKLVL^ 

SEQ ID 1771 

ATGAATCATCCGTTCCTC(MCGCGA 

GCAATAAATACGGC&TCGAGCITTAC^ 

GGAAGAAACCGGCAACCTCTTGGCAAGCAGCAA^ 

GCCX3GCAGCGACGGTAAAACCCTGCGCGACAAA® 

TGGTTCACGCCAACGCMTG<K^ 

AGGCATraAGGOTGGTTOT 

TCOTTCTAAAACAAC(mAGAATTGGM 

GCGCACCGTATAAACAGCTGCTGACGCJ^^ 

CATCCTGCGCCTGTGGGWGCATCCACCGA^ 

GCCAATTTGAGCGACTTTMTttXiAT^^ 

ATCCGCGCTATtmrcCACTTCGC 

CTCCCGCCGCAOXXmAACC^ 

GACAGCXnXXTCTTCX&TAOT 

AGCCOTGCGC(XXX3ACAAAACC^ 

GC?IGTCTAAAGCAGAAGT(&AAGTCGGTGATGA 

GAAACCGTCTGCAAACGCTCTGCGGAGM 
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SEQ ID 1772 

MNHITLERDIPMLNGEHVTTDAG^ 

AGSK5KTLRDKAIKAVDIXrEFFPPWGRARI£SM^ 

SWKQREELEWPADLYLBGSDQHRGW^ 

ANLSDFNPimVQQADMVEIDK^ 

DSVLFHTWHEFPAIKEKAEAELVKKWTM 

ETVCKRCAENVGGBGEPRHXA 

SEQ ID 1773 

GTGGGMTGAC(XJTTCGCTTOT 
ACEX£AACCGTCATCCCC^ 

SEQ ID 1774 

-VQfimiiRPADSPTEWSEPSDGIPDM^ 

SEQ ID 1775 

ATGCCGTCTGAACCTTCAGACGGCATTTO 
CCC(XX£AGGCGGAAAT(XX3GA^ 

SEQ ID 1776 

NPSEPSDGIFDAPAVYRRGAGAGKYPN]^ 

SEQ ID 1777 

ATGMTCCAGCCCGCAAAAMCCTTCTCTTC^^ 

CX^TTTAGCCTACGCCGCCGMCGCATTACCXACG 

(XCCAGGOTGTCtt^^ 

OiGAATAAGAGAGACCTGAAGACGGAAAATCAGGAAAACG^ 
GTGCGCGCGTCGCCTAC(KaACACGTCAGACAC^^ 
TCCGCAAAAAAACAOTATCAAAAAAGCAACA^ 
AACTGGGGAC6OT6GAAAACACCCGCTO 

SEQ ID 1778 

MNPARKRPSUiFSSLLFSSUjFSSMQ 

QNKRDIJCTENQEWGSmVSSLGLSAVYDFKLN^ 

NWGRLOTPRFKTHEASLGMRYRF 

SEQ ID 1779 

ATGMTCGTC^TTCCCGCGCAGGCGGGMTCCGGTO 

SEQ ID 1780 

MNRHSRAGGNPVRSVSVISDKFIiLLFISRFPLPWE 

SEQ ID 1781 

TTGATGGAAATG(XX3GGCCGAA(X!ACAOT 
TATTTPATAGAAATGAC6AAATTTTAGATTGCAAGAAOT 

SEQ ID 1782 

U^(^P¥GPPDFRIiRGinX^^ 

SEQ ID 1783 

ATGAAACGCCAAG(^TTAGCCGCMTGATTGCTTCOT 

CCGCCGCACMGCTACCX-ICCGAAAC^ 

AAAAATGGAMCCGTCGAAAAAACCATCSAmTGGACGA 

GTCGAATTCKmCAA3^^ 

GCTTCAAAGCX^CTGCAACCGGGCCTGTACATCTACC^ 

AGTGGATAAA(5AGTTCTACATCGTCCAAGGCGACTTC 

TTCMCGGCCACCTAGGCOT 

GCGAAATCTTCGACAMGTTTATGTTGMGGCGGC^^ 

CTACACTTTGGTCGACC^CTCCATC^^ 

GCC(mGC(XXXXmrrTCT(^ 

SEQ ID 1784 

MKRQAIAAMIASLPAIMCGGEPAAQAPAETPAASAEAASSAAQATA^ 
VEFSNNPSSWPHNVDFHAATGQGGGAAATFTAPGRTSTFSFKA^ 
FNGHVGAIAGDNALKAKAGBTVRMYVGKGGPNLVSSFi^ 
AGSGAASAPAASAPAASAPAASASEKSVY 

SEQ ID 1785 

ATGCTTGCACATCTTCAATTAAOT 
GCGACXXIXXXjGGGCTTACGGAGCAAGTCOT 
ATCCTGCCCACCTGTGTCG6TTTGCGGTA(X^ 
GTTAAGAAAACKXXK^TTTGCCTMGTCTTCC^ 

SEQ ID 1786 

MLAHLQLTFRffi&OTOTSTT^^ 
VKKTRICLSLPPTGLNKLFQQLANLTPSVPTSa 

SEQ ID 1787 

ATGTCGGGTTTGACXSAAACTCGAAGCCC^^ 

TG(X3CG&^TCA(£GCMTATOT 

GCOTACCGGTrGTCGGACGTrGCCGCCAC^ 

GCCGTTTCTTGGCAGCTGGACAGGGCGGCGCAAT^^ 

CGTCCGCOXXSTATCAGG^^ 



WO 02/079243 



-145- 



PCT/IB02/02069 



PBGGEYV*RIiQGAARRAGTGVGRGIiSCQ 
^^ACOMOTIO^ 

tccga 

SEQ ID 1790 

HSTVFIESIESAPSmmTEPSPPRKMESRTBCiaBlFIR 

ATCTAGGACGCOGGGOTrGGG(aJiCCGWrTATCCGATRAGTTTCOT 
MGAMGGGWGT^^ 

CAGTGGTfflAAGTGGGCGGTGCTGTCGTCGTTTCAGTATCG^ 
ATCAGG(KS3GCTGGCAGARATACTTTTTTTTGGTSC^ 

TQCGTCGCTGCGGroiTOGGK^ 

SEQ ID 1792 ^..^TM.^nioefflWfflmDVUOT u UTOtJl 

Vl^AISIHPTCIiPFSSNPSOTPFRPPGPTQPHBHSHGS<^ 

gWSmVLSSFQYREPA^PSFro^ 
CVGAVLAVLQJIVHRKDSKRr 

MgKGTCCACAACAAATT^^ 
GGCGGTGC^GCAGGAAGOKSCGGAGQXMGTTTCCGGGTATT^ 

TO^TATGTCAGCCAAACCACGCTCTCGGTCGA^ 

CAACOSGCAAACOXICGTCAAAGAGCTGGCGGAACABl^^ 

OTT ATAT«rrG<»CAAC<X^ 

TACGCGGATGGGGGCATGAAACCGTQ(MCGAAGGGGGGGGCGCGGAA(W^ 
SEQ ID 1794 



AYHVDNASYLQRTWFEGKSKVtnm^ 

ccq in 1795 

CCCCCCC^GCC^^ 

ACGTGCGTTGCAGGTAGGACGCGTTGTCX^CCATATACGCA 

(^CTGTTCOXX^GCTCr^ 

GTTTCATCGACCGAGAGCGTGGTTTGGCTGACA^ 



•MCCGGJ^CGO^ 
raCGCAGGTTGT^^ 
AQCCGCGOGGATTGGCAMKjATGATGGTTT^^ 
ACCGCACCGACGCAGATAAAGCTGTCGGCM^ 

ALFRQIiFIX^IjPVGRRIAEOTLVRIADIGETR 
TAPTQIKLSAILKAG 

AAAAAA(£COTAOTC^^ 
CATCAOCCGCATCGGTTTCAAOTtH^ 

SEQ ID 1 798 

MSTFFRQTAQAMTAKHIG^PL^ 
KVGAQSHLKAMCLMjLKAANRLSAPAAA 

SEQ ID 1799 

aACAftOSCMTATGACACT^^ 



CATTGGTTGTTOSCGATACGGCXaCGACCTGCAIATG^ 
CCTAITGGACGGTTTGTCCGAACTX^^ 
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SEQ ID 1800 

MTPPKLIIFDWDGTIiADTTQ^^ 

SQTATGGYWLATACAGEYPSKPSPEMVFGLCGELGIjDPREA^ 
SEQ ID 1801 

ATGAACCX3ACTCGA0CAACTTGGAATO 

MGGCAGCCGCAC«y^GCACATCG(&^ 

CAGGCGCGAACGGTKXyKntGCTQAC^^ 

COGC0t3TTTGCmmGG& 
SEQ ID 1802 

MNI&DQLGIRIHLICNVFDKW 

RLFAELDALAEVMEKTISENKK 
SEQ ID 1803 

ATCAAATACAAAGACCTGCGCXSACT^ 

GCXXXXMGGGCTGGCGTT6TT6TTTGAAMCCCGOT 

GGACAGCGTGTCCAAGCTGCGCGAAATCGGGCAGAC^^ 

ATGGCGCCX^CGTGGTGAAAAACGCGCX^jTGTCAG^ 

CGTG(K3GCTTGACCGTCACGCGCGGGCCGCAC^ 

GTTGGATTATC^GGAATTCCGCAMCTCMT^^ 

(^TACttGITOXXX^ 

AAAOCGCGTTGGAAGGCCCATACGGCGACCACAC^^ 

CACAGO^UUiaXmSACG^^ 

TCCTACCGCATGGCGGTGGTCAGCATGAAAAAAC^ 

ATCACGATOIAAACGTSCGCGACTGGAAAGAAGTC^ 

CCCCGTCAGCGGACTCGGCGGCAAAATGGGTTTGGATG(^ 

GGGATTTGGGGGAAATTGGGGTTG 
SEQ ID 1804 

MKYKELRDFIAMLEQQGKLKRV^ 

MAPNVVKNAP(#EIV^ 

EYQFAGLLRGSRTELWCrc^ 

SYRMAWSMKKQYAGHAKRVMTGCWSmQFMYTKPIi™ 

GIWGKLGL 

SEQ ID 1805 

TTGGGTTTCX3GATGTCEAAGGMGGGCT 
AGAAMCCGGGCAGTTCGGACAAACCSTCCAATACGGC^ 
TATGCAGGTCGTGCGCXX3TATCGCCGACAA 
GGTGGCGAGCCAATAGCCCXEG^^ 
GGGGCATCGGGAAAGAGTGTCATATTGCGGTTGTTG^ 
CGGGCAGGCTGTATCG3ATCAGGCTG(^^ 
GTCCCAGTCGAAGATGATGAGTTTGGGCGGGGTCATGGCA^ 
SEQ ID 1806 
LGFGOffiKGFFAKASCRLimL^ 
GGEPIMGGGLADGVVQTRPPFARGIXSRPEALRFELVQ^ 
VPVEDDEFGRGHGRLVAVKKQIIX^KRRRVAFSTKFVG 

SEQ ID 1807 

TTGTGCTTTATTAAAATAGAACCATTAT^ 
CA(MACGGCGGATTTGGAAACCGTCCATATC^^ 

SEQ ID 1808 

LCFIKIEPLSFI^JGQFMSWRIIOTAATVLAALSSPVF PASPPAA 
SEQ ID 1809 

ATGGATGACCGCCK^TAOCCGCGTC^^ 
SEQ ID 1810 

MDDRLmVSPCRLKDKNAAEPEQPC^ 
SEQ ID 1811 

TTGGW^ACGCGTGATtm 

TGCAGGGATGCATGCAGGCGGTTATGCCAC^^ 

GTCGGCGGCACGAACACCTTTAACATTCCCGGTO 
SEQ ID 1812 

LGKKVMEGVETEISGAITPKWQIHAG^ 

VGGTNTPNIPGSERSLTAKLRYSF 
SEQ ID 1813 

ATGCXX2TCACAACCGGAGTATGGCATTGCGAAGGAAAA 

AACCCATACCGACCGACACATTCCAAC(TO 

GTCTGMGGCTATCCOGTCCTCTTTGTO^^ 
SEQ ID 1814 

MRSQPEYGIAKHIRPNRQADRFAGSVLPIiAASLTHKPNIMKPIPOT 
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SEQ ID 1815 - 

ATGCAGTCGCTGATGAACAACCCCGTTACCCGAAGCAAC^^ 

CGOTC^GACMCGCCACAGtt^^ 

T6AAACCGCCGTATTCGGACACTCGTTCGGCGCATO 

ATACTCGACTTTATG(X!GTCTGAAAAC 

(XXMCAMTGGCG(£AGAGTTGGACAGGCGCGGGA 

GGAAGCGTGGCAACGG 

SEQID1816 _ 

^StMNNPVTRSNAPCLIVGIGYTTGS^ 
IU)FHPSEmDGIDACLNIGALERSS^ 

SEQ ID 1817 

ATGGCAAGCAAACACGTCAGTCAOTGGAAAAC^^ 

CG(TCXftCACGCTCGACGAT(^^ 

GGAATACCGC6AACOTATCCC6AAGTGGOT 

CTtftTTGCGCGCCCGCT^^ 

CCTACACCAACCAGCAGAAACTCGACCTC^^ 

GGGCGCGTGCATCGGTTATCAGCCGCTTKX^ 

GACAGGGCATTTTTGAGTGCGAMGTCCXX^GT^^ 

SEQ ID 1818 

MASKHVSHLENTVQAKLLHHNSRNLSLTEAGEEyy 

LIARPLAEIEFALLASPBFLRHNG\TO 

DRAFLSAKVRSPIDFLNHCIASRKGCRNAV 

SEQ ID 1819 

ATGACCGAGCCGACCTACATCOCCCTGCGC^^ 

TCAGCGATTTGATC&ACGAATT^^ 

GTTCCGCGC(^TGCTGGTTATCCGCAACGAT^^ 

CTGGAAAACGGC6ACMCAGCGGCTTGATTTGTTO 

(^TGGTTTCCXXftTACGTTTCAOT 

GCATCCGACACAGTTTATGMttGCGACto^ 

TTCATTCCGCXXX3AAACGATGCTGGAGCS 

TTTTraCACGCCCaCGGTTTATC^ 

GGMTATCAGGAGCGGCTGGACTTOAGCTGAACAT^ 

CCGGGACGCGGTTCGGGCGCGGGTTCGCTM^ 

CCGACOTTCACGTGGACTTCTGTC^ 

(&TCATCCGCGACGTGGGGCGTGTGCTGGM 

(XmGATTCAGGAATTGCTOAAGCGGAAGAAGC^^ 

GCAAGATTTOGATTACAGC3CCCGTGTATCAGGCGGACGAA^ 

CMCCTGACCAmTCGAAAT«m^GMCM 

ACCACCGCCGTCTTCCAGTTCGAGTO 

ACATTCCCGACTTCGTCGCACGCATGMGGGAC^ 

GGCGCAGATTATCGGCGGCTACTCGCTCGGCGGCGCG^ 

(^GGCATTTCGCGCGAAAAATCCGACGtf^OT 

TCAMGCCCACTATCCCGCXX3AATTTATGGCGGC^ 

CGACATTMCGAATCCGACTACCGCTTCACGCCGTATO 

AGCGGCGGCAAATTTACCGGCCTGTTGC^ 

GCGCCATGCTCTOKXSAAa 

CGACGCACCCATGTGGAGCGAATCGGAAAMC^^ 

AAATTAGGCCGTCTGAAACCGCAAGACAGCGTGOGCX^ 

AGGCTGAAATCATGGTCGGCGGACAAACGCTGGAAAACTGC^^ 

GCGCATCATGGCAAACCAAffPCATGACCCl^^ 

CACCAACTGCC(X^amX^C6CA^ 

AACTGGAMCATTGCTCXX^GCCGGTCGGTCXX^ 

SEQ ID 1820 

MTEPTYIPLRLHTEFSIHXaMVRIKK^ 

LMmNSGLICLSGAHYGEVGV^ 

nPPEfEMLERPJfflLPEALEMTVEIAKRCinfllTI^ 

PGRGSGAGSLVAYSLHTDIjDPLKYALLFER^ 

PQIQELIiEAEEADELITLAK^ 

TTAWQFESTGMKKMLKTAHTTKF^ 

QGISREKSDHFNYMEKFAGYGF^ 

SGGKFTGLLDFCERVGKEHMlOTTLEM 

KLGRLKPQDSVRLAGFVTAVRTMMGKRGKI^ 

HQLPDTPRIPLQLSYANEKASGRI^VPPKWTTO 

SEQ ID 1821 

GTCGACAATAAAAGGCGTATTGTAGCAGGGTTGOT 

SEQ ID 1822 

VDNKRRIVAGLSIiDGGAGKAVS^ 
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TCGGACGGACGGCAAGCAGT 

SEQ ID 1824 

HFFELLRRQKNASttV^^ 

SEQ ID 1825 

SEQ ID 1826 

LTLMHILKRGLHyi^AlGIKTKSKTCIiNVKSI 

GGCATCAAMmAATC^^ 

SEQ ID 1828 _ 

MPPSVQRl^AQI^AAKPVPRRIRIDADAHPQTO 

AC^GGCAGTGCGGACACATCGCCGGTmraxm 

SEQ ID 1830 

l^ISVNADSSGNRICa^ 

SEQ ID 1831 

ATCGMGCCACCGTCTATCT^ 
TGCGCAACG6CGGAACCGAMTCCGCAAAACG(^^ 

SEQ ID 1832 

mWYLEDNEYIAIiCDimAGLAES(X3Q^ 

C6ATTTACTGGGCX3GACAAG6CACGCTAOGA 
SEQ ID 1834 

SmTUHKASALLGEPLliDKPVRPESW^ • 

SEQ ID 1835 

AATATATACTGOTTTATTGATACAACTIT^ 

SEQ ID 1836 

NnCFY*YNFSTQ©WDTLFSLKVFRQVVQ 

S A^CAcScTTCCOXK^^ 
ATTCTGATTTC^ 

CGAAAAGAGTTTTMCGACTATCAAAGCCGCTTTTA^ 

AATGTCGTTACCCAGTCC^ACMCCAACGCCCT^^ 

GGGCGGTATCCGAATCGGATTGTTCAGGA 



S |re^A^GTCCMCC(?^^ 
CTCGCCGCAGTCGCTGCCGCAGCATTCCC^ 
GCGCGCGCCGTCAMCTCGATGACTTCTCCGC^ 
TCX3GCMGTCCGGCCAATTTCAAGAGGT^^ 



SEQ ID 1841 

ATGCAAAOJAAAAGCCOTAra 
TGGCGATA 

SEQ ID 1842 

MQRKSRYAVIPAQARIQTLIYWKCLRLTAISNFRI PAFEGHAI 

SEQ ID 1843 

TAATCTACTA 

SEQ ID 1844 

MGCTVWNEGRDVSM^ 

SEQ ID 1845 



TTAAATTCTTCJ^GGTT^^ 
ATT03TCGCGTACCGa5CAGGA3TO3OT 

TCC AGGTTTTC(mxnra<X^^ 
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Gtt3CTTCGGAGATGCCG(XXX3^^ 
GCAGTGCAtXamra^ 

SEQID1846 

VEVAARA($Q5EAVCQKRW^^ 
FVCI^RQRAVVCAADAPimGGDGFKTFQTAFADI^ 
AVQVYHGVLCGGE 
SEQ ID 1847 (XCTTOCOGM^ 

GCAQGACGAAAACCT(K1AAAACCTGT 

GCGCTGGOGGCAAACAAAGAAATGGTCACXXGCA^ 

GCGCGGTACGCX^CGJATGGGCGACG^ 

CCTGTTT6AAGAATTTAAAAA0CTCGG<^^ 

GCGGGCAGCGACTTCCACGTOT 

SEQ ID 1848 

MIDIiiCHSTVSIX^PAEVVRIiAHQNGCTLL^ 

AIiAANKEMVSRTOAKnilKNGH^ 

AGSDFHRUJDFSGGIU^CPDLPENCKFVRAHFSRH 

SEQ ID 1849 

ATGCCGTCTGAACCT^ 

TGATGGACGACCTOGGCGACGCGCT^ 

GGCGGMC&CGCCGTCGAGAACATCGGCAACTACT^^ 

ATCGCGCTGACCAACGCTr^^ 

MTACATC(KKrrATGCCGACGTGCATGTCGM« 

GCGCGTGMCACGGCATTCCGCCGAT^ 

CCGTTTTACCGCAATCAGGCGCAAACCOCCGTro 
CCGATGAAAATCCACAMCCCGAAGGCGCGATTTTCC^^ 
GCGAACMTTCTO^^ 

(X3TCCGAAAACCGTACGACAACOT j 
SEQ ID 1650 I 
MPSEPSWIPYKPICLTHTEFD^ 
IALTNGSQKAFFYIiFNIiF<^ 
AREHGIPPI IDNAYGMPPP*EIDSGWLNWHENI ILC^ 
PHKIHKPEGMFLWLOTENLPVSSQT^^ 

SEQ ID 1851 

ATGTTTTTCGCXfflaGGM 
TAAACCGCCCACCAAWaT^^ 
GCAGGTCG(^P6T!rc^^ 
SEQ ID 1852 
HFFMGSASVHKVSDFQHKFAVKPFAG^ 

SEQ ID 1853 

TGCGCCGCCGTTGCCGAAGGTTCGCCCGTT^ 

CAAOmXKK^TCCCGC^ 

CGACAACGCCC7ITGGCACGAATTGGACAAACAC 

SEQ ID 1854 

CAAVAEGSPVIGimDDAETDNPRRAAmQSAAELAQGL^ 

SEQ ID 1855 

ATGMTCTGGAAACCCTGATCGCCCTGC^^ 

CGGACXMTGGTTCGATTTTGCCAACTCGCA 

OTGACCACCGOTTTGCCGGCAGCCTCGCX^ 

TOGTGACCGGCTCGQjTATGGTTTTG^ 

AACAGATGCGCX^TATTCCGC^ 

C^GCX3TGGAAAAAATCCTGCTCX3CCATTCOCA^ 

ATQGACGiGGAAAATCAGCATCGGCACGCTC^ 

TG6TAA£XX5GCGCGGGCGGCTOCATCGGTTC^^ 

CACfl^^ 

TOSTCCraTCTCCACCGAC^ 

CATGGTACGTTTCQGCAATGTTTTA<^ 

ACCATACCCGAAIXSGC^^ 

CCTTAAGCGGACTCAMCXCAAAACGC^ 

CAAAAOCGGCCATOGOGCATCATCACCGOC^ 

GAccTGcranoAccmx^^ 

SEQ ID 1858 

MNimiALPRNimCFLIH^^ 
SVTGSRMVliRGLLSDHQKKTHDPVIH^ 
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MDGKISICTLKKISVSDLI^ 
HAAAYKIWPIWEFNTVBGII^ 
TIPEAAQLVIQAGAHGRGGDVFV™^ 
DLLVNAPTGnPTGGICDLLWVCETHRKNAV 

SEQ ID 1857 

AT6CCGTCK3AAG6GTTATCGTGCGGOT^^ 

MTTCGACGGCGTTTTTCAG^ 

TGACGATGCGGTCGCGCGTCCAGTCGTCOT 

GCGOUiACTTOGa^ 

AAACTTTCGTGCAGCCAGCGGAMCCGGGCGC^ 

(X£TACCG(X£TCGCCGCCGGTGGTGATG# 

TTCMTCACCCACAAATCATGTTC^^ . 
GTTTCCGCGCTGATGTTTTG^^ 
CCGCGCCTATGCmTTGCTTTGAGTGC^ 
GCCCGTCCAGTAGTTGACTTTGTTGGACAGCAG^ 
. TCGGACGGCATTTOX^CCCTAT(m3TC 

SEQ ID 1858 

HPSEGLS(X3FGQ1XjFDVFFAIM 
AQTFCQLGGVPLAARRPFGQVFELDAADDSLHF^^ 
FNHPQIMFFCQSHDAVHFGItfTO^ 
ARPWDFVGQQDFGNttG^^ 

SEQ ID 1859 

atgccgtccgaacccgacggaccam(^ccatgctgm 
acttcgacgggcagcgmtgcegcgmtttg^^ 

CATAGGCGCGGGCGAGGATCTGATTGTTACCTC 
AGCTCEGAAACCGTatfU^GTGC^ 

gggtgattgaagactgcgckxiaagcgcacggcgcx^ 

03gc(k^atggttacgaccaacgacaaaac^ 

cacgaaagtttcggcaccaactggcgc^tgato 

aaagtttocqcaaatoaaaagcas^^ 

(xxx!atcgtcagcgaactgaa(x3cgcgc^ 

SEQ ID 1860 

MPSEPDGPNTMLNTALSPWPSFTREEADAVSKVL^ 
SAETVKAVLTPITCKAVIVVHIA^ 
HBSF(nWWRMM^AVI(aiQliKHLPEWTARRQQ 
AVEDSGTALTFLVHPIliTDDEIAFCKKHIKAVLTEAAR 

SEQ ID 1861 

TTGAAAGATTGGCAAATGAGTAAAGCCGTCAAAC^^ 

TAGGTTCGCCCGTCTTOTCATTCGGG 

CGAACGCCTGACCGATTTCGGCAAAAMTTA 

TATCTGCCGCTTTACAAOUATTTCAAAA^ 

TTTGGTACACCGACAATTTCAGCTT^^ 

GGGGAATOXyUACTCCKX^ 

GTCMrcGCTTCCCXXyiOT 

CCGCCGCGCTCGGCTTO^^ 

ATOAMGACGGCGTGATTGTGMCACT^^ 

AGCXXK&TAGGCACGGGCXSCGTGCAGCCGCCAG^ 

CGGCAMGCCOT^ACGGGCAAAMCCC^ 

SEQ ID 1862 

LKDWQMSKAVKBLFDIIASASGLn^^ 
yLPLYNKFQNRRHEMKPGIT^ 
VNGFPVIGTTLLIiENSLSPEEFDIT^ 
SRIGTGA(^RQQTTVGS(OTAG^VI^ 

SEQ ID 1863 

ATGAACATTACCATAGCCGCCCCCTACTO^ 
TCAJGCACTACGACAAATCTPTCAGAC^ 
GACCAGCCATCACCGCTTCGTCAAACATO 
CACAAAGCGCGTTTGGGTTACAAACTGATTG 

TGCCGCCATCGCCCCCCXXXXXXS^^ 

GftCGGGGAAAACGTCGAOTGCAC^ 

TCGCCAAAGGCTGCGACATTGCCGTCAACGCCATC^ 

CGAAGTTGCCGAAGTCCTTACOTGC^^ 

GAAATOGTCCGCCGCTTCAGGCACGACAT^^ 

SEQ ID 1864 

MHIYIAAPYCSLPSEPHFNRFWflAELI^ 

HKARLtmiVDVQDWP^^ 
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KmELHIMGGGPDLDR^ 
EIVRRPRHDISYRKIVNIiIERIAKE 

SEQID1865 

ATCGGCGGCATXTCTGGCAGGCPGCAT^^ 

CXXnTCGCAGCCGGTTGATTTTTACCGGG 

(nCGTCCCGGCCTGC(XXXOT(^ 

GCGCGTACATCATCGGCAACT(£AAAC^^ 

AAAAGAATGCGCCGTACTCGGCACACITOX}^ 

GGCATAGGAGAAGAAATGC^CAAOTCAAAGCCCAA 

TAGTC^CACGCCGCATfGOTAGGTGATC^ 

CATTACCGGCCAAACCGGCTACTG(^ 

CATGTCGAAAIXTOTGCTCCG^ 

SEQ ID 1866 

MGGISGRLHPAPPEKFAQWSLFEOTRFPIMKI^ 

WPAOUiAAKIVSPKTmCHSHGL^ 

GIGEQfiMjKAQAKRLGIEDKVTFLGGV^ 

HVETLCSDDEnRTTHAAYEM 

SEQ ID 1867 

TTGCTATATMTAACGCCGATAAAAAAOGGCC^^ 

SEQ ID 1868 

LLYNNADKKRPVQTAARPEKRNPICRTPPAAATA 

SEQ ID 1869 

ATGTGGAAAATGTTGAAAC^ 

TTAATGCC6T6ATTGCGGGGAG6GT6TGGCAGGCGTTG^ 

TACCGAAATCGCCGTGCCGGTTGTGTTGC^ 

ATTGCCGCGACATCCGTCGTATCCATATTCGGCGOT 

TTGCCGCCATCAGCGAAAACCTCTATl^^ 

GCGTGTGCTGATTTCCAACCGCGAAGCCTTCG^ 

ATTTATTCGCTCGGCACTTATC^^ 

ACATCAAAGOCGGAACT . | ' 

SEQ ID 1870 II 
MWKMLKHIAKIHIUaU2OT 
IAATSW5IF(»CIMLLVIjEFW^ 
nSV6mWMFAMSU3DVPRLVEQYSN™ 

SEQ ID 1871 

TTGGTCTTCGGC^ 

TGCMCAGCCCGACCTGOTGAAAAAGCCGTGC^ 
TGATTGGTGGGCGGTTTATACAGGCGTGATTGOTATO^ 
SEQ ID 1872 

SEQ ID 1873 

ATGGAC^CAAAAGAAATOTCGGCTACGCGGCAGGCTCGAT^^ 

T6ATGCAGAOG6CGGCtX36ACT6ACG(MGTCG6TA 

GCCGCTGCTGTTTTCCGCCGCGATAG^ 

GGCTGCTGCACTTTCCOXX^ 

GCCGTTTTCGCCCGCCGTCCTGCACCGGGGGC^^ 

CTGGAACAGCTCGGCGTTTATTCGATGGGTATTTCGT^^ 

CCGGTTTACCGTCGTATCGTGTATCXrFGCCGC^ 
CTGGCGGCAAACCTGCTGCTGCTGGGGCT^^ 
GCCTGTGGCAGCCGCTCAAACGCCTGCCGOT 
GGCGGCATATCTGGCAGGCTGCATCCTGCGCCACC^^ 

SEQ ID 1874 

MDTKEILGYAAGSIGSAVLAVIILPI^^ 
SFLPIRFLIiVLRMEGItoLAFSSAQLVP 
LBQimSH(nSFGGAALLLQSIPSTVWmiFRM 
LAANLLLI/MVPSGGTR^ 

SEQ ID 1875 

ATGTTTTCX3GACACAAATATATO 

TCGGACAGGGCTTCCACGTCAMGCGG(XX^ 

CGGACGCACACCGCOT*^^ 

(XXMCAA(&(XX^TGCGGACT^ 

GCAAAACCGCCCTTTCAGACGGCAGCAGC^^ 

ACCTACATCGCCACCTOGAACGCGATGAAGW 

ATCATGGTCGAAGCAGGCTCGAACTCAC^^ 
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ccTCGQC( ^^ 



ARAAACGGCAACGCCTTT 
SEQID1876 



F.Q ID 1876 

GIKTBCGLI£MlAREUmOTS^^ 
TYIATLEKDEDKLRPYRfflSHIRIlfflSBTAIXMDIflniffil^^ 



KNGHAF 

ACCGCCGAACTCAAAATGCC(XX!CGTCA 
A(^TCGACGAAACCGTCCGOTGAC(^ 
CGCOTCCGTTTCGCO^^ 
TTCG6ACACAAATATATCCA 

S HP^PRIM^ 
RRPFRRPAQTLRQPGTFRLVAGARR^^ 

CMGAAACOTOXCC^^CAACG^^ 

GCCa&AACATTTCGCCTCGTGGCTG^ 

^ VPKKMffiSP^PAYHFDVQTA^ 
n^IffllSSSELVEIVLEELFGCirrEAAVRFAAI^^ 

CTCGMTTTTTCCCCG^ 

ACACGOTOTK^OmT^^ 

CGTraKCAAGCCOSTOGGC^^ 

^TCGTGGTCCCMCTOCCGTCAACGA^ 

CCC6CGCCGCCGGCGT&TCGGTT 

TOFFPBQGEFFHTOTOSERAFCHIUna*TRFA«m)m^ 
DESWSQLAVHELVMFRESIIVEHDRPEHPVYHSAEEGIiEIARASGVSV 

GTOTTGTITTCTGTWrCGCGGGAA 

SEQID1884 

IJiIPQIPACAGmQKIitNBKQPKPDK^^ 

S fra!cC^ATraXXmAMCAGAAAA^ 
AGIXGTCATTCCCGCGCAGGCGGGAATCTGGMTTTCMTO 

SEQID1886 

IOTIIPAKTEmaxaMFBHSIUWS^^ 

^©i©c!Scagcaocttagg<^ 

(EGTTCAGTTTCGGTCATITCCGATA^ 

SEQ ID 1888 

VLQHLRESFPLSRGGMPIBFLIiiraSHP^ 

GTTCGCCCGTTATCGGCATTTGGATTGACGATGCCGA^ 
mCACGGCGGCArcGCC^G«XTrGTCC«K 
GMTOGACAAACAC^TCGCOTAACOT^^ 
GanWSACCGOTTraAA^GCCC^ 

TATTOAGTGTCG(XTGCATCTC<SXBCGCCTGCTCG^ 

CAGCCTCCACCCraXrTCAGACGCTOAGCKGGA^^ 
MCC^GCAAmmC^^ 
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YLSVOTSPRUARBSU^^ 
RQGNWRLAASRETCPDIM 

SEQ ID 1891 

ATGAAftCCCGGTITmtGGACATAT 
GTACTGTCTGCGGCTTCGCXXj(X^TGTO 

SEQ ID 1692 

HKPGFFtaiFI/FVLSIATOAU^ 

SEQ ID 1893 

TT6CAG6GATTTA3CCGGAAT5AA06ACAAOT 

CM«JCGG05AAGCC<X^^ 

SEQ ID 1894 

LQGPIRNNDNLSAVlSAQlffira 

SEQ ID 1895 

A TCTG TTGCCGCGC(^^ 

SEQ ID 1696 

BCCKAGLra^DFPSSFSAKAGIRLFHIAGIYPE 

SEQ ID 1897 

ATGAAMCACTGACCGTACACACGCCTTCGCACAGCT 

CCAACGAAACCGTCGCCOX^^ 
(^CCTCATCTTTGACGGGCTGATGCAAAAC^ 

gcgcocttcatoc^^ 
cggtgotgcagattiksgacaogctgca^^ 

GCATATGCCCXjAACTGATGGCACTCGAACGCGCGC^^ 
CTCMCCTCGGACACACCTOGGACACG^ 

SEQ ID 1898 

HKTLTVHTPSHSYPIFIGNGUiP^ 

APPIQIPTTLLSQVDSSVGGKTAimiPWSKNMIGAFYQ 

UJIX3HTF6HAVBAEMGY6VWLHGEAYAA6C^ 

SEQ ID 1899 j 

GTGATGAAAAACTTTAACGGCAAACTCATCXTCATCGGAC^ 
^XX^GCAGCGGCGGGCGTTCCCATCCCCM^ 

TGCAAACCCTGCTCAAACXKnTATCCCGA 
SEQ ID 1900 

RPIiLQVADPLAKLRELYAARDPmQTAD^^ 
SEQ ID 1901 

TTTCTGATTTTAAGGAAAAA 

SEQ ID 1902 

HFDPPAFRG(ffiVRACHVTAGFY^^ 

SEQ ID 1903 

TOraATCCGCTAT^ 

SEQ ID 1904 

LLIRySIIMIFNinrailVYLSKimiPLra 

SEQ ID 1905 

ATGAGGCAAGMTTTATCGGAAAAAMCCGAAGTTTAAAGACOT 

AGGATTTATCCCnTCCGCCGTCATTCCC^^ 

SEQ ID 1906 

MRQEFIGKKPKFHHiDSRI^^ 

SEQ ID 1907 ' 

GWX^GACmTTCGGAOTW^^ 
TAGGCGTAACTCGGCGGTTKXMTTT'^^ 

SEQ ID 1908 

VRKSDFRLRGNDD^ 

SEQ ID 1909 

SEQ ID 1910 

LSRHSRBIGNPVRSVSVISDKFLLLFISRFPIjLHE 

SEQ ID 1911 

A3t5ACGGCTOAAAOTAC^ 

SEQ ID 1912 

OTAQKLRBEKQPRPDKLDSRLRG^ 
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SEQ ID 1914 

HFYSICPFFMLFAATmSFPBERESHLSGFGCFSSRHF 

MMK 



GATTGTTi 

S5S5SK 



TCATTACCCTTGAGTCGCTGAAAATCGCCCAA' 
SEQ ID 1920 



SEQ ID 1922 

HVVAARKLQAEDFBIWRTVNDV^ 
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CTCftACCAACTCATOAGCQCAOT 
TTAOCCAAGAAATACAAAGGGTCTTGCAGTTO 
AACCGTCGCCTCGCAAACC^T^^ 
TTCGG7PTGGCGGTACGGGGATTA 

SEQID1926 

MRLFKSLKNPKKimKLPKKSSGIJWRAM 
QGFVESSISEASSISLEEAimJYQ^ 
I^IJQRTYQVTAEKAEEIINSPQltf^ 
FGIAVR6I1 

SEQ ID 1927 

GTGAGTAAOCXjTTTCGGTATCCGTMT^ 
GCTATGAT 

SEQ ID 1928 

VSNRFGIRNGFLVUmPC^ 

SEQ ID 1929 

ATGGTAAATTATTATTCAGCTATGATTAAAMGAT^^ 

AACTGOCGTCTTTGGATTCTOGCAGCA^ 

AGC(nKX^GTCXK3GTGCGAGTACGATTACACAGCAGG 
CFTTAAGCAAAGACAAAATCCT^ 

ATGAm(XX5TGmCAGCGCGATaGGCATT^ 
AGAAATATGGTGMGATGCCTATACGCAGGGTTOM6GOT 
CAGCTAC(XXX3GTGCGGAAMCTATATCGATTTGAOT 
GATGTTACTAAAMGAAAMTGTCGTCATA(^GCTGCCCG^ 

GCTGGTCGGCGGTTATGATTTTCAM 

GCGTCCACAGTGGTTAAtt&TGCGCCGATT^^ 

TGACGGCTTOCAAGMTATGGTTTCCATCCGTATTTTC 

GGCPTTAGGTACGG(XX^GA£XiACG^^ 

TTCCXmmAATGCAACCTTTG^ 

GGGCAGCTGOGTTGGGAAGAACGGATATTGCCGCT^ 

aiMCCTMGAGTAT03GGC(?K^ 

GTGGTCAGCAGCAATGGCGAATACTATATCAAGGAACGTATG^ 

CGGCAGTAGAAAACXSAACAGCAGGGAAGGTCK^ 

SEQ ID 1930 

MVNyYSAMIKKILTTCFGIiFFGFCT 
SVQSGASTITQQVAKNFYLSSEKTFTRKPNEVIMYKI^ 

DVTOKNVVIQIiP^ 

ASTVVm)APISIiKaCGPNGSVWTPKNSDGRySGnTLRQ^ 

WSSKSEYYWCERMVTDPGlifijD^^ 

SEQ ID 1931 

ATGAACCTTTTOAAMCGCG^TO 
ATACCCTGACCM(XATGTCC<^^ 
TGCCCAAGTaCCGAAGCGGTACGCGCACATTGG^ 
CTCGACATCCGTATGCTGGACTTITC 
TGCTCAAACCTTATTOGACAG^ 
CGOTOCTCTCCAGAGGAAAAC 
SEQ ID 1932 
MfcFQNAKFFTTVN^ 
LDIRMLDFFHTTGRPVHILLSKA^ 

SEQ ID 1933 

TTGAAAAAGGTTCATGGGCATATOTTGTTT^^ 

SEQ ID 1934 

LKKVHGHILVFRBfiLSDSKNMRLVLCET^ 
SEQ ID 1935 

ATOCCCGAAoacamr 

ATTACCAAAATATCGGCAAAAACCTGCCCX3GTTTC 

G(X£AAGGG(XjAfiGGOGAAAG^^ 

AGCAGCGCGACGGTTGCCTTGCAGGAGCAG^^ 

CCTACAAACTCTATCAACTGACGCAAAG^ 

CGCCGACGMTTTTTCGCCCGACGGTTCAACGGC^ 

CCCAACOSTCCGGAATCmOTmA 

CTGCGCXXX5AAAACAGTTTCTATTGCA^^ 

(XlftGOTACCGGCAAAM^ 
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CCGTCTTTAAGTATGGGGCTGTCTGAAMC^ 

CTGCACG(&(XEKjCT^^ 

CGCCCOTATCGAACAAATCAGTGCGGTTTG^ 

MTGCCAGCCCCATCAGCAGCGCGTCTCACCTTGCC^^ 

AAACCGGOTGCTGTGGCTGCCCGA^ 

(XACACXXXXXSCCGTM^ 

(^ACCTGCCGCTCTTGCTCGTACMG 

AAGO&rCGACCTCXXX^^ 

CCCOTCATCGAAATCACtfHOX^ 

CAGCGGTACGGCCAACAATTATTGGCCGGCCTGC^ 

SEQ ID 1936 

MPEPPSDNDTVHffRmM^ 

SSATVALQBQLVDRDIiPPLVEKSGI^ 

PNRPECPFYliARIMaETVDVVVANro 

PSLSMGLSENDHIITNSEPTYM 

NASPISSASHLANSLWRRAAGAVI/T^ 

DYLPUjLVQGELPKAVLIiQKHHQAIE^^ 

QRYGQQUAGLPPFKRIG 

SEQ ID 1937 

ATGCGCGATATGGCC^GTGGMGCAGTGGGCGGAATGTCC^ 
CGGCGCGTTTGCTAT(mraCGGCTTGGGAa 
GAAATCGCATA(XX3TGTCGCCGAOT 
GGTGTGGAAACGGCATTGGCGGAAGCGGCGTCCCTGTT^^ 

GGAATGGGTGCGTTTTCTGCTGAACGGCAGT^^ 
ACGACGTGGTACAGGOGGTAT 
SEQ ID 1938 

MRmWQWAECPDFADGLPDVRRELPFIiPAMRRRMjSKAm 

GVETAIAmSLLEEGCGSVLVLAAIB 

TTWYRBY 

SEQ ID 1939 

GTGTTraCATAAAACCMCCGCATATTTTTO 
GACCTGC(X^CACAC(X3CTCGAMTTGCCTTC 
GGACACAG(&TATCAACTTCTTCGAGCTGC^^ 
CTACCTCCGCCATCGGAAAC^Gm^ 
ATOTGCTGTCAm«XX»^^ 
GCCTGAAAAAACAAGGTAT 
SEQ ID 1940 
VFHIKPTAYFCCRTNGGGKQGYAHEPFSTO 
LPPPSETAYRAGFDYGCPPSFKRTRHPYAGLFPHDRQTGSHPAVKSRQ 

SEQ ID 1941 

ATGCTCTTTTTrATATTCAMTAlK^^ 
GCATATTTTTGCTGTCGGACAAACGGCGGCGGAAAACM 

SEQ ID 1942 

MLPFimGIFDPIKmSVNVLFYTKFTl^ 

SEQ ID 1943 wscsqccmaix^ 

{^GACGGTAACAGCGGCATTGCGATGTATCCGCGTT^ 
GAAACCGGTGGTAATGAATTO5AGCGATCAGGA^ 
ATCTATCGCGGCGGTTTGAGCGATAAAAAAGTGTO 
AGCACCAGGCATATATOTTGMCAGATGAATGCCTACAAff 
TATCCAGGGTTTGCGT 
SEQ ID 1944 



IYRGGLSDKKVPACMSC^GPSGAGMPGGGSEIQAYPRliGGQHQAYIVEQ 

SEQ ID 1945 

ATGGAGCTGAAAAMGCGAACCACGGACGGGAAAGAAGTGTGGGAGAT^ 
CMCGTCCCGATTCTACTGTCTTG^ 
SEQ ID 1946 

MEUCKANHGRERSVGDIRDLLIVC^^ 
SEQ ID 1947 

ATGCGCTTTGCGGTCGCTTTGCTCA 

TTTTTGAPTTTTTGGGTTTGTATGATG^ 

AATGAAGTCOTCCGGGAAMGGTTAAAGAAAAATO 

CAGGGAAAAACCGTCAGCCGTGAGGACGGGTCGGTTC^^ 

TGATAGACACTMCCTCCTGW^ 

ATOAGGGGCAACGTCAATATTTOCGAGGGGCAAAG 

GATTTTTACMTACG(K3TATGCCGCGCGATTTTG(^ 



WO 02/079243 



-157- 



PCT/IB02/02069 



TCACGATTTATCAGGCGAGTTTOCCGACGGC^^ 

<?ITGGAAATCG<m^ 

(XXIGCCGTTACTCAGGAACOTAAAAAATACA^ 

ACAAAGATTATTITTGGCTGACCGGCACGCGCAGCGGCTT^ 

GAAAGATGGGGMGGGCGCAAACGTCTGGTTG(X!!G^ 

TATTT(£GATTGGACGMTTTATTA 

GCCGGTACGGCTTGCCCGMTGGCAGCAGGATCAAGCGC^^ 

glTfTCCGAGGTGCGTTCCTCAGGTT^ 

CGGGCGTGGGTA!nGTTTTC&GA 

ACTTGAATCATGAC 

SEQ ID 1948 

MRPAVAUiSLLGIASVIGTVljQQNQPQroyLVKFGP 

QGKJVSREDGSVIiIRAKKGlMHKMGn 

DFYNTG1PRDFASDIEVTDKATC 

RAVTQEGKKYTNIGPSIVYRIRDMGQAVEYKNYffljPILQDro 

YIXHflEFITSNIPKGQQDKM^^ 

RAWVLPSDGKIKFAMSSARSERDIiQKEFPKHVESIiQRLGKD 

SEQ ID 1949 

ATGACTGMCAOTATAAAACCCTTC^^ 
TGCAGACC(X3TTCCGGTATGCATATGGAC^^ 
CGTATTGCTTGCCTATGCCX^ 
TTCTTC(XraGTTCGOT 
CCGGPCTGCTGCTACGTTCm 
CTATTACGAGGGTAAATTTGCCG^ 
CTGATTCCCGCGCTCCAO^ 
AAGAAAAAGGCGGAAMCTGTGGCTCCCGCCGTCGGCA 
G(£GGACGCTTGGGGACG^ 
CTGGCGTGGTGGGCX^TTATCGGTTTGTTCff^ 
SEQID1950 

MTEHyKTLPEHELIJQKSLIRIJLNIjWDWVFAVLW 
FFALPAYISGAVhASVKHVPTNTI^^ 
LIPALQSVfWMKIHVPANFIGYGAFCIS^^ 
LAWWAIIGLFVTAFAFIGVNMPLSGLHSYGTL 

SEQ ID 1951 

ATGTTGGTATTAGGAATCGAATCTTCTTGCGACGAA^ 
GGGTCGTGCCGGMTTGGCGAGCEGG^ 
G(XXX3GTTTGGGCGGTGCGCTGCTGGC66GTTC 
GCGGAGGAAAAACCCGACITOXT^^ 
gaGGCGAGGCATTCGACAAAACGGCGAAACT^^ 
TCATTCCGACGATITGCAGATGAGTTTTTC^ 
TGCCGTGCGTTTCMGATGCGCT 
GTGAAACTTTOSGCAACATGACGGT^ 
GATTGCCTTT6CCG6T6CGATGCACCTGGGCAAG 
SEQ ID 1952 

MLVUUESSCDETGVALYDTERGM 
AEEKPDFPFVALLVSGGHTQIMAVRGIGDYEI^ 
CRAFQMWEVLEAKVKKALLQTC 
SEQ ID 1953 
ATGAAOTCACCCGGTTGCTCAACCAAGK^ 
TGCTGCTGAACGGTAAAAACCGCAAAACOT 
ACAMGCGATTTCCAACCTGCCGGAAAM^ 

CCTATCTGGCGGCAAGGTT^^ 
AGGGATT 

SEQ ID 1954 

MNFTRLLNQVLSTVQKKGNTFSGSPI^^ 
EQESGTDPETAAWLAAEYRLPASIGDIAAAVGTO 

SEQ ID 1955 
TTGCCGCCCGAAACCCX^ 

SEQ ID 1956 

LPPETRQSAMSKWKRQI IGGIVSFKNLHCAAP 

SEQ ID 1957 

TTGGATTTCCTCTTGTTCAGGCAIX^ 
AACCGGCATGGGATGTT 
SEQ ID 1958 
IJ)FLLFRHm!i™PHPTDmTAWSGIVIFSLVPGAGIEPAtn}V 
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SEQID1959 ^^^^ 
TTGCCGCAATACTTCOT^ 
CGCTTTCCTXSTCTGCACAC^ 
GACGGTCAAAGCCGTTTTTGCGGA^ 
CAGCAGGCTTT(£ACAAGG<^ 

GGCMCCATCATCCTGCCC(^CCACGT^^ 

GGCGTGAAAACCCTGTTTTTCTGC^^ 

ACCGCMTACX^GAATATTGGATACGCCGTTTTC^^ 

SEQID1960 

LPQYFPFSIXaiFRLQFRLFTPIiR^^ 
QQALDKGKIiIiFITPHIGSYDLG^ 
GVmiFFCCERLPOKKiFVmRW 

SEQ ID 1961 ^^^^ 
TTGCGGCAATATOOrrTTTCTGCTi^ 
GAGCAATCCAAACCCGCTT^ 

SEQ ID 1962 

LRQWSA!IMRAAIiRVGHSMP1^ 

SEQ ID 1963 %t 

ATGTCGACAGCGGGAAATATAGCAAAATTTCCCTAT^ 
(XXAITCAAGCGGGTTTGGATTG^^ 

SEQ ID 1964 

MSTAGNIAKPPYSCBSVmSYUIGAIiESLSV^ 

SEQ ID 1965 

ATGGTTTGTTTTTCAA^ 

TGGCGCAAGACCCCAAAGCGOTGTCG^ 

CATCAAACGCATCGGCTACMCTCCTCCGAGC^^ 

ATCGACCTGMCCAAGGCGCGGGCGACCAAGGC^ 

AATTGCGCAAAGACGGCCGCCTGCXTTGG(^^ 

6CACGATCCTGCCATCAGCCAGGAA6AACT6AGTAAAG(XIGT^ 

COrnCGTCATCGGCGGCCCGCAAGa 

J&GTGGACCGTTCCGCCmTAC^^ 

GATTTCttTOSATAOTT^ 

AOTACGGAAAATCTGCCGCCTACGGGCATTTCGGO^ 

SEQ ID 1966 

MVCPSraSMSEYLFTSESVSEGHPDKVAIX^IM 
IDLNQGAGDQGLMFGYACDETPTIjM^^ 
RFVIGGPQGIX^TGRKIIVDTYGGAAraGGGAFSG 
IYGKSAAYGHFGREEPEFTtfEETDKAASLKAAAGIi 

SEQ ID 1967 _ 

GTGCGTTTCAATCATTI^TAATGGTMCGATTATTATA 

ATGTAACAGATOACCTA'IGAAT^ 

TGTCC^GAGCTT^^ 

TACCGCTGGGCGACCGAGTTTAAAAGCAACGGTA^ 
AT^CGCCAGTOGCGCGACAAAGGCATCCGCMTATCA 

AACTTGAAMTTACCGCCTCCmG^ 

CTGTCGGTGTCCGGATGTTCGCGCTr^^ 

GCAGACOTTGCKXTTTGC^^ 

CCX^GCC6TTTCCGMCAGGCGGCGTCTGC(X3 

CGAGMTGATGGCGCAAATtH^^ 

CGGGCTGTOXEOTAAAAACCGGCAC^^ 

CTGCCCGACTTGGACAACTTCGT^^^ 

SEQ ID 1968 _ 

VRFNHFDirailYVISPANKPVRRPGVPTyP 
YRWATEFKSNGTVNIX3TliIX3NLYWAGSGDPVFNQE^ 
KLKITASQAACPSVKKUffiASPSGNTLKI^^ 
PAVSEQAASAVRRELAVSGIBVAD^^ 
LPDLDNFVAKNI ISGGlXMiDAKLMCKERRA 

SEQ ID 1969 

(?K&AACCGGCAAATMT^^ 
GACAACGCCG(XXX!TC6GCAAACC6G(X3G^ 
SEQ ID 1970 

VEPANNRRHSFFQKPTLCRILTKAHPPDHAEIQraPRHPra 

SEQ ID 1971 

ATCTGGAAAGAAATTTTACTGMTTACGCTr^^ 

GTGTCGTACT6ACAGATT?PTCGGAAAATTATA^ 

©3AAGCCAAA(OGAGAAAAA«OTTGAA^ 
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TTGCGTCAT^GATTACGGOGGTGCTTTTGATT^^ 

GTTOCGOGAACGCAATATTCCGCTG^ 

GGTGGGTGT6GTGGCG6AAGTGCCGAATATCCA0CGOT 

GAAAAGGGCAAACAGAAATTCCGGCAGGAACTGGAGGAAA(X^ 

TGGGCCGGCA6GCGTTG<KXnTCAAC^^ 

GATCCAGCGCATTGGTTTGCAGGCGGAAGC^^ 

SEQ ID 1972 

MWmiMYGIFIiimTOGAIALIVLAIVQSKKQSK 

LRHOTAVLLIAKPEDEVIi^^ 
EK(^KFRQELEmQLFKQFVSENRP^ 

SEQID1973 

GTGGCATTGAATGCGGCGGft^^ 

CCGAAAGOTACACGGCTTTCC«Xim 

A(XX^TGCCGACGTGGCTTGGAAAAACAAA 

CCCGTAACCGGACAGCCGGMGTATTTTTGG 

TGATTGAAAGAAATGCCAAA 

SEQID1974 

VALNAAmPEGWQAEIVEIGHLPLYNFDyDDPEVEDVPIiPESYT 

PVTGQPEVFLGNSPTI»FDEN(3aJDSAI^ 

SEQ ID 1975 

ATGTCGGCAGGCGTGAATGGAATGCAAGGCTTATCCCG^ 
CTGATTATOGGGGTCATCGTATGGCKnT^ 

SEQ ID 1978 

MSAGVHGMQGLSRICWRPCIflGRPYVQNKIKK^ 

SEQ ID 1977 

TKKnXX3(X3GGGTTO^TGTATCCC^ 

TGGGGCTGTTCCGTCXXjATGATAGG(XK?OTGATTM 

GACGTTTTTATCGGT(X3TCCC(XX^^ 

GGGCCGATCGGGC(XWTTGCAAGCGTGTTGGGCGCA^ 

TGCGTGAAAAAAGGCGGATTGTCCGGGAGTTGGGCGCGTO 

GAATGTCCTGCTG?ITPTGGCGGCGAT^ 

CAGATGGAOCGGATTTATGOCCACGCCGGCGGCGCGGTGAT^ 

ACOTGTGCAT^^ 

TGTGGAACAGATTACCTCGCAGGCGGGCSTTQC^^ 
GtarnXKXSAAATTGTC 

SEQ ID 1978 

UAGFDVSQIiKEFYSaTKSVTQIVIMF^ 
GPIGRVASVLGADVGELYKPLI/I^IIGVWa 
QMERnAHAGGAVKMASIILAAGTFLGILKGAGHLM 
GLAKLSHGKHIRYSFFWAKGLSLAILVSSIAAGIVPIiP 

SEQ DD 1979 

TTGGGGCGCGATTGC^CACACGGACAGGCAGGCAM 
TQACCAATATCGCCAGCGACAAACCC(^GCCCAG 
SEQ ID 1980 

LGRDCNTRTGEQSLRQITGTIRASGGFAVYGRGTI PAAIELTNIASDKPHAQKHE 

SEQ ID 19811 

GTGTCGGCAAGmmAGACGGC®^ 

GGCGGGAOKSTTTGGATGTT^ 

TGACTATATCGTCMGCCfflTOOT 

gcaacctatcaggtagaaattattgccgagg<^^ 
gacaaggtttaomtoggg(x3gg^ 

SEQ ID 1982 

VSASLKDG(^VDWVKHGAQVAA^ 
ATYQVEIIAEGRGGIEQPGVFGIAGFAGEAGCDFVALGFG^ 

SEQ DD 1983 

ATGCCGAAATTCAAACGCATCCACGCCATTO^^ 
CHXK^KXXXK^CAGCGGGCGTATGGTTC^ 
SEQ ID 1984 

MPKFKRIHAIFSDNAAPRQTARI SLNI^LSLIMPDYRHRLPAAAQRAYGSDRPHYPIRR 

SEQ ID 1985 

CTGCAATGCGCCGCCTCGGGAATGCTT^ 
ATTTCGACCGTGTTTTTGGGCAGGT^ 
GCCGGAGCTATTGGTTTAM 
TTCCCTGAATCGGGCITreAACC^^ 
ACTTCATCCCGGTATTTOGGCAC^^ 
• CCAGAAACCCCCGCA(XnCTCCGGCGTGCGAG^ 
CmOTmTACAGGmTAACCGAC^ra 
AACAGGGAAAATKX3GCCGTCMCG0GGCAACCGTO 
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AAGAGATTTTGCCGACAAATACGATGGGGC^^ 

CGGfflTAAAOGGAlOGTCTTTGK^ 

CXmAACXIGCmCAGCGAACCCAa^J^ 

ICAAAfiTCAGAAACAGGKTCAT^ 

CCAGATATCKaCTACGGGC^ 

TCTTCGTAAAGGAAGATTTGAGAAGCAATCA^ 

GTATTTCAGAAAGAATGATGTTGGTAATGAGGAA3^^ 

GGCGACGGCCCGCAATCCGTCMTTTCAG^ 

SEQ ID 1986 

FPESGFEPGYQEGHtFKSRHGL^^ 

PSnQWTDKHQGRGEI^ 

RJMOUCVFVDHEQAIKQCAETE**KQliPFSIFa 



GDGPQSVHFRPVSDSIflQ 

SEQ ID 1987 

GGGAAGTCGAAAG(^TGCGGTGGATTTTCTGATTO 

SEQ ID 1988 

LPRGGVALSTOEFSVLQM^^ 

SEQ ID 1989 

ATGCCGGACCGTTTTTOAAAATTTT^ 

TCCACXSAAACGGAAGOGTTGCAGGACGACCTGCT^^ 

C^CAGCAG^GACCCT^ 

CGGMTGCCGTCATGCAGGAAAACGAATACCG^ 

(XJCACAAAGCXMGCGCCCCGTC^^ 

C^TCAATCTOTTTTGA^ 

MCAATATGCaXrrcCCACTC 

CCXXTPTGACGAAAACGACA1TCGGGCTGCAAAGCCG 

AGTGTCTGCCGACGAMC^AAATCTATACGCT^ 

GCCGTffTGGGTGGAGGATMCGGGAACGGCATTC^^ 

<XX&CACGCTGGCCAAAAMTACGGC^^ 

SEQ ID 1990 _ UJLUL 

MH)RFFKIIiKHSIjQVRISIiALIWMFVPLAMLAGM^ 

RIAVMQENEYREDIABD^ 

JMPLPPDAGRQSAVI^IRRNKH^^ 

AVWVEENGNGI PESECPRVLDPFYRILGTEQQGAGl^ 
SEQ ID 1991 

TATOGTCAACATCACGCX^jCAGG^ 

am£GCCmTACT«XCG(X&^ 

ATOAACATATGCTGTTTCMG(^ 

TTCCGCAAGAAGACCACATCGGCGTGTOCCAAC^ 

CACCAACGCCGAmCGCCACC(&C^ 

GATTTGCCTTTMGCCTGCGCGTGTT^ 

GCGCATTGGGCAGGATAGMTTGTTCAMGGCGAACGCCCGC^^ 

GATTATCGAGTCXACCGAAGCCATGACCAC^ 

CGCGAATTAAGGCTGCGCAACCTTGQCGGCATCATCATC^^ 

(XXGCGTTACOTGCACGATTTTACCAGC^ 

CCTGAAAACGCCGCAAACCGTATGCTACGAAATCCAGCGCGA 

GACGAAGAATCGCAATCCTTGGCAATGCTGATAGATT^^ 

SEQ ID 1992 

LHPimGRNGCKNIPITFSGKmSGLPIPKDIAR^ 

IEHMLFEGQSVLVQVIKDPINTKGAR^ 
DIMLRVLRD^ 

RELBLRMI^IIIIDFIDMAQESHRE^ 
DEESQSLAMLIDFIGKPISIAVETAYTQEQYDIVLM 

SEQ ID 1993 

ATGCAAAAATAOTTTATCACATTTTCAGGAAAAA 

GTGGCGGTGTTGGAGGAAAACAATATCTGCGAGCTGC^ 

CGTTTATCGACATCGGCTTGGAAC(X^ 

SEQ ID 1994 

MQKYFraFRKNHVVOTPHPQRHRAPARNDIG^ 

SEQ ID 1995 

ATGATAATCMTATAGGGGCAATOGCTCCGTTTGGCA^ 

SEQ ID 1996 

MIINI(MAPFGKCGTOARSI>aKGVCQSFFFK7 
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SEQKD1997 

TTGCAAAACTTTCGGAAACC^ 
TCCGTGTCGATGCMGCCCCGMGCCTTK^^ 
TCAGCAGGAAGG(X3GG(TO^CGGACTGCT6AACOT 
SEQ ID 1998 

SEQ ID 1999 

AT6AGCGAAGAACT6CAACCCGTATTCA6CATC^ 

G0GTTT0CftCft3G(^ 

SEQ ID 2000 

MSEH£PWSimYVKDl£LEVP^ 
FPPVLLAPINPEAIYQQQQEAEAAGA 

SEQ ID 2001 _ 

MTCXJGTTTCGCCMCGCTTQiGCOT 

CGGCAAGCGCGGCAGGGTTC^^ 

GCX&TGCCAGCGTGAGACGGGO^GTOT 

(&GCCGAOTKX£CCGOT 

CCATATAGTCGTGCGGCTCGATTTGGATGGTOT " 
CPimGTTGCCGAC^^ 

TTTGACATCATGATTGGTC^TGGCTTCGATG© 

(^TCTCXXSGCTCGO^ 

CGTCTAGGGGGGAAAGCGAGGCGATGGGGT^ 

SEQ ID 2002 



ALPA*K$FFRQKAQYGVCHAEVA^ 
VWADRAVDFAEEmLKDFEI^^ 
PDIHIGRGFIX5PDGGRVP**KVPIH)FVDFGKRFGT^^ 

SEQ ID 2003 

ATGAGGATMOSmTMTCATAATTTTACAGTCC^ 

OTACCTGCCGTTCTCATGGOT 

TTCCGATTTCGOmTTCCTC^ 

GAMTOCCATTKXX^^ 

TGCTGMmTCCCGATTm 

AMTOCC 

SEQ ID 2004 

mWIIIILQSAQRRPMPWNIPIFLTWIJWLLIPV^ 
EITISAIiREWMAQHGKRSSVAVATVGKFKTAAQMLM 

SEQ ID 2005 

TOGCGGAAACATCGGTGATTTTGAQCTC 
GCATCGGCA(XX!AGGMTCMGCCC(XKXX^^ 
AAA(XGCTTCG(X3CGCGTAAGGGAAGAGGATGTT^^ 
AAA(ATCGTGCGCTCGTTO^^ 
TCT-iraAAAAGAimCG^ 
SEQ ID 2006 



KHRAIjVIQFGGI&YGNVHVVVAVF^IjLAAGG^ 
SEQ ID 2007 

ATGATGTCGCCCGAAACCCAGAAACAGCTCAAA^ 

ACGAGACGCACATTATCCCGATTAA(XMG(XXXX3 

OGACGGTTCCGGCAGCGTCCTTTTTCTrc 

(^C(^T(^TTCATCC(^ATCCGC^ 

CGGCGTT05ACGTTACGCC^ 

TCATCMCTTTCAGACGGCACGCTGCCGGCG^^ 

TTGCGCGGCGACGGCACATTGACCCAAGCCCTGCGC^ 

ACXGCCTGCTGCAAGGCX&TGTCCXXA^ 

ACAGCACTTCATCAAGTTCAAACAATGGCT^^ 

GTCAAAATCGCCGTCGO^^ 

ACAAAGGGaxmCTCaTCAGCTCAT^ 

* ACCCWmCTGCAAACCGCC^ 
AATTTGCCGCAGGCCGTCTGAACGTCT^ 
TCACCAATTACGCGGACGGGTAGGGCGCGGCGCG^ 
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(OTTCGAMTCmXXSCCAAG^ 
AACMGCOXXX^CCGC^ 
SEQ ID 2008 
MMSPETQKQLKITDVSAKKL^^ 
MMIHPKIMMGGGIAESL^ 
LI^DGTLTQALRQMjPFALTDAQHCVV^ 
VmVGTHALFSIXNAFHN^ 
TLQLQTAAETLAR^ 
GFEIARQDLNIRGPGEFLGARQSGVB^^ 

SEQ ID 2009 

TTGACACATTTTCATGATTTTAAATCCCGTO 
GGAATOTAQAAATGAAAAACCACGGTGTTATCGGAAAC6AT 

SEQ ID 2010 

I/EHFH0PKSKYSQV^^ 

SEQ ID 2011 

TTGGGTTCGGACGGCTTTTG(X^ 
GCTTCX^TTCCGTCATTTCTCCGGTTO 

SEQ ID 2012 

LGSIEFWIJIQRKQKSKIPSFI^^ 

SEQ ID 2013 

ATGWXAAAAAMTCAAAGCCG^ 

CGGGAACCGCAGTTGCCCftTTAOTTCC^ 

CA!TCG<XaTGAMGA06CGCCACQ^ 

GCCCCCGGCCTCGTTTCCCAAGCC^^ 

TGCCGCTGTTGCGG(MTGCCGTCTGM 

CAACTTCAAAGCCTACGGCACAGttX^^ 

ATACGC^TATGCACQCCACCGTTTACCTCCACOT 

CCGAAA(X!CGCAGCGTGCGCGG(X£AAAC^^ 

(KXAGTCCAAAATATGAACATTA^ j 

SEQ ID 2014 

MSKKmGIVGATGYTGVEimiAAHPD^^ 

APGLVSQAWGLSELNREAVAQARLVANPGCYPl'CVSIiPLWIilA 

IRGMHATVYLHLSIESDPE^^ 

SEQ ID 2015 

Aa^AAAACCCTCGTOTO^rc 

ATTCCGGACGGCAGGGCTGGGTTTTAAAAACAGAGGCGGATGC^ 
CGGACAGCGTTCmCGraTACCCGC^^ 

SEQ ID 2016 

MKPLVLLIJiSFSTTTA^ 

SEQ ID 2017 

A!R3GGC6TGTCQ3ACTCCGAACTGCGTT^ 
ACGGAACAACTGACGCCGTC 

SEQ ID 2018 

MWsasLRsmaPETsscOTii^ 

SEQ ID 2019 

T(£CGTCTGAAGCCCGTGTTO 

CACTCOXX^GCa:<k^^ 

G(&TCACGACATATGCTTGGG^ 

(aGCXXTTACCOAA^^ 

GCAAACGCATAATCCGCATGATCCTGCGGCGT^^ 

(XIGGC^CGCGGTTTACATTCTCCAr 

SEQ ID 2020 

OUjKFVFRQIRiroFVF^ 



SEQ ID 2021 

AT6AAGCC(X3ATTAPAGCAAAAGGTCGTCTGAA^ 
TTAAC 

SEQ ID 2022 

MKPDYSKRSSENRFQTTFYIiITIKMLTTKKRPLFSTO 

SEQ ID 2023 

ATGGGCATCCATCTCGACTTCGGCATTA^ 

TCAACGAGATATACCAACGAGCAGACCCAACCG^ 

(EGTTTTATTAAATCCGTTAOT 

GCACATACTATTGTTAGTGGATATCTTGCTTO^^ 

aXX^ATTTCCCTAT^ 

TTGGTGGCACAGACCTATCCCCX^TCGGCT^^ 

CTtKXmCAGTTTCTCAGACTG^ 

GCAAAGAACGTACCGATACCGCTTTT^ 
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ATATTTGATTGGGMTGGGATGAAGATAAGCGTAGATGAGTO 
AAACATGAAC^GTTAGGAAGTTGAOT 

SEQ ID 2024 

KIHLDFGISPKTFRQflTLYQITKIiFKGAVRHIiEAA 
AHTIVSGYLAFGSDESYKIQJHOTRDVYAIQ 
LRHSFSI^QDRTRINDTMQIAMIADPVNYEAFSBD 
KHEEVRKLIYQLIELDFLBIL 
SEQ1D2025 
GTGAGGGATTCTATGAAAAACTGGAAGO«7^ 
GTTGGMTTTGMTTTTATTCAGGGTACTTT^^ 

SEQ ID 2026 

VRDSMKOTTCQFIPFVILVIACYQVLYF^^ 

SEQ KD 2027 

ATGTTTTTGTTAGCCGMTTTTTAGGM^ 
CTGGAMTGTAGeTMT(X^<mCM 
T6CAAATGGTGTATACAATCTGACTOT 
GATAAATCAGGAAACGGCTACACTGACCKX1ACAGATTACTGC 

SEQ ID 2028 

MFbIAEFLGIPTKKRSIKMIEI£^ 
DKSGNGYTDGTDYC 

SEQ ID 2029 

ATGMTAGACCCAAGCMCCTTTCTTCCGCCC^^ 

CGATATCTGCGTTACTGATOTCCTK?^^ 

GACMTTACAGCGAAATTCGTGGAAGATG^GAm 

«3GGTTTTCJtoGAAA^^ 

CX3CMCAGATAGACGGTCAGAAAAG<XXX!ATTAG^ 

GCTTTTAGAGCAGAAAGCCAAAOTGATGOTArc 

AGCCAGCTTGAACGCACCATTGCCGATATTOT 

GACGGCAGGTTGATCCTTCCAAAC^^ 

CCTACGTTATCAGGCATATCCCTATCAGAMTTCGGGOT 

TTGGCGAAGAGCAACGAACCTGTTTATCTC^^ 

ACGAGAAA03GCGGCTGTACGMTGGGTATTG^ I 

SEQ ID 2030 

MNRPKQPFFRPEVAIARQTSLTGKVI^ 
AVIMTLAEQELGRLKLIHENETRSLKATra 
SQLERTIADISQEVLDFEMRSEQIIRAGRSGnAIPNVEVGR 
LAKSNEPmVKIKPDKPTITAyGEEKPLQIGMTI^ 

SEQ ID 2031 

ATGAGAGAATTGACCATGMCGAMTGAOTCTCnTTO 

GGGTGGGCATATCGGCGCTITITO 

SEQ ID 2032 

MRELTMNEMTAVFGGTEGTWGHRG^ 

SEQ ID 2033 

GTGCGAAATCCATTTCAAACT^XXXSATCAACAGTO 
TATGCCCACCmTGCAGiACCGCXXm 

MTTCTCTCATTTCAAAACTCOT 
SEQ ID 2034 
VKNPFQTPDQQUHBQ^^ 
NSLISKLLI^SIEKIiKGQIpyPCraQSIC^ 

SEQ ID 2035 

ATGGATTTCGCACATTACCTGAAACACTGGAA^ 

SEQ ID 2036 

MDFAHYLKHWKAAVLIYLAISILTD 

SEQ ID 2037 

TTGACAAATTGTTTGCAATTATACATCACTCT^^ 
COTCGGAMTCATACCGCGCGCCAAMGC^^ 
TCAGGATTTCAAACAATTCGTCCAGCGTCOT 
CTGATACGGA?ICGGTTTCTGCTCGTG^ 
ACGGCAATACCCGAATCCEACAfX^^ 
CTTCOTTGCCTGCO^ 
ACCGCAAAGCOXACCCGCCTTCCC^ 
SEQ ID 2038 

LTNCLQLHTLRTI£RLRIWRQ 

LIRIR?LLVRQGNVQPYRHRIiPGKRALVCRI^ 

TAKPHPPSPHiFSDGIFGNLLKSPA 
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SEQ ID 2039 

ATGCKmraTAOG<^T^^ - - 

TTGAAATCTTCGCCCAATCCGTCOT 

6AAACTTCGT(XXjGAAC^TOTTra 

ACAACCATGACTGAAACCGTCCm 

SEQ ID 2040 

MLmAPIEI^AVGAYAGIQARKNARPVRLGFLP^ 
T*IWPETVLITGSNRGIGKAAALGFGGREL 

SEQ ID 2041 

ATGAAATATCACCGCTTGGCATTGTTTGCCGC^ 

TGACCGTTACCBCO^^ 

CATCAGGGCGCGGCAGGTTAAGATTGTTT^ 

ACGGAAGACCGCCTK3TCGAAG 

SEQ ID 2042 

MKYHRIALPAAISCLLLSAWIAPYLTAPHEQEOP^ 
TEDRLVEVlAQPAWSI^ILLUiPAIFlJlIi™ 

SEQ ID 2043 „ 

ATGGATTATCTGCAAAACCTGTCI^^ 
AT!AC(^TTWX!GCGCACim^ 
AGACGAATTGGGCAGTFTGCGOT 
AMGTCAAGACGGAGGAAATATCGCGCMGTTOC^ 

6GATITCTGGGCTGGGGCGGACATTGTTTCAGOT ' 

CGGACGATCGG<XITGCGGTAAATCGACACTTTT^^ 

CGCMTTTGAGTGCGATGGTCAGGCAGGACGATG^ 

SEQ ID 2044 

MDYLQNLSLGLTKKLm^ 

KVKTEEISRKFTCIALELWPimiFEAG 

RHLSAMVRQDDVL 

SEQ ID 2045 ' 

ATGATTCGCXSAAGAAATCTOXKX^TGCCTATGG 
AACCflU^TCCTAT^ 

AACGQTGGAATCAGCA6ATAGGAAAATGTCTTTAGGA 1 

SEQ ID 2046 

MIRmSAMMCTEmGDMGSALSGGQKQ 

SEQ ID 2047 _ m 

ATGACCTTTTOACAACGTTGGCAAGGTTO 

(^QCTGCTGGCACTCGTCCCCOT 

GGGCGCGGATATGGTGTTCGACTATATCGAC^^ 

GCGTTCAACCGCATCTGGCGGGTTAAC^^ 

GGTGGGTTCMGACTCCGTACTCTCC^^ 

GGCGCATTTGCCGCCGTGCCGTTTTTCOT 

GATTC6ACTC6CXK3GGACGGTTTGACGACGT6TO 

CGATGAATTGGGCGAGCTTTTGGAAMGCTGG(X^ 

TTCGTGTACCGCCCGTTGCCTGTGGA^ 

AACAGCAGCAGTCT 

SEQ ID 2048 _ 

MTFLQRWQGLADNKIC^AWFVIRR^ 
AFNRiraVNTQRPWMMQFLVYWAIi^ 
GAFAAVPFFLLWLNLIA^VLGGAV^ 
FVYRPLPVBimHVNQAVDAVOTPCIiQT 

SEQ ID 2049 

ATGAACCCAAATCCCCTCAAMTCCTCGTCCra 

AACTCGCCGTCAA<7TC6GCATACGCCGCGTGTT^^ 
CCGCGTTACCAAAAC 

SEQ ID 2050 

MNPNPLKILVLYySQNGSTRNPARRITRGI^^ 
PKYQN 

SEQ ID 2051 

TTGACATATCX3AATCCTAGTTTTGGTAACGCGG^ 
MCACGCG(KX3TAT(KXX3ACTTGACGGC^ 
CTTCGCAGACGGCGGACACTTTGGGGA(XXn^^ 
GAGGACGAGGATTTTGAGGGGATTTGGGTT^ 

SEQ ID 2052 

I/EYRILVLVTREVCEPRSWTVKTRIIQANAFRR^ 
EDEDFBGIWVHGRIPLK 
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SEQKD2053 

TTGGTOGAQ^CGGCTTTGATATCGOCOT 

CCGACCGCGMGCCTGCCGCG^ 

(X^TTGGGA!K3TGGT(X2CGCGGACTMTTO 

GGCCS^CGGGCAACCGGGGGCAGGTCAAfT^ 

(33GGTTTGATGGATACCGATATTATC^ 

GC^TGAAAAAGCGGCGTATATCACGCGO^G^ 

SEQ ID 2054 

IiADDGPDIAVHC3lSMU)EmVAEEIRALGRNAR^ 

SEQID2055 

AAAAOXMCGCAAGCACCGCAAAGCCGCAC^ 

SEQID2056 

KTQWCHRKAAPAPPQPFFRRHFR*SAKIARLSFH 

SEQ ID 2057 

ATGCCGTCTGAAAATATTTCAGACGGCATT^ 

SEQ ID 2058 

MPSENISIXjILFVMLH)II£ra 

SEQ ID 2059 

ATCTCAAACCAAAAAGCCTTGGTCAT^^ 

AACGCCATGCCGTCGAGCTGGAACGCGC^ 

OKOTraTCGAAACCGCCGCCATC^^ 

CGGCACATCATCGCC&GCGTGTGC^ 

ACACGCCGCTGATCTACCTGACCMGGCGCAAATO 

CGAATGCCC(^GCT(^TCTTGCGC^^ i . 

SEQ ID 2060 

MSNQKALVIFSGGQDSTTCMQAI^ 
RHIIAGVCETDFSGYPIX^GVFVKS^ 

SEQ ID 2061 

ATGCCCAAGCTCCATATGTTTTACCTC^ 

C(H I GGTTCGGCGATAC6GACAAAATCCACATCGAC(^ 

GCGCCTGTATTTTGCCAATGTCGGCGGCTATCGCGCG 

TTGACCGACTATGIPPCGGCAGCATAAAGACAAOT 

AAATCGGCTTTCAAGGCTATTTGCTCATT 

SEQ ID 2062 

MPKLHMFYLGGNAGRSNIEVHDIQFAVCDD^^ 

ltdyvrqhkdnlkd™ 

SEQ ID 2063 

TTGOH'GGAAACATATCCGCTGOTCAAA 

AGCACXSTTGTCCAOTCTTTCMGTO 

GCGCCTCXXX^GCTGCCCCGCGCGATAG 

oattcgacmcctgcc^ 
tcgatattcgaccgg(xx3gcattg(xxxx^ 

ttgcgccttggtcaggtacatcagcggcgtgtgg^ 

TCGGTTTCGCACACGCCCGCGATGATGTGC^^ 
(X3GTTTCGATGGCGGCGGTTTCGATGGCGGCGGTTO 
CTCTGCGCGTTO^GCTCGACGG^ 
GA6AAGAT6ACC&AGGCTTTTTGGTT 

SEQ ID 2064 

LRGNISAASNGQIALKADFGGFAVRIRGQAE^ 

PFDNLPTVDVDFVRIAEPRGFECm 

LRIiGQVHQRKVDLKIVVHRQIKGNVHGFDEHAAATO 

SEQ ID 2065 

GTGtAGGTTTPGGCATTTGCCGTCATGO^^ 
GGGTGGCT 
SEQ ID 2066 
VQVLAFAVKPVEHMGVBGKYLGNFHPDSFSVm'K 

SEQ ID 2067 

ATGGTGATGGATTTCACCGACCTC^ 
. AGGGCTGGAACATGAAAACCCTGCXXXHtMXX^ 
MCGCCGACATCGTGCGCGGAGTATGAAGGGGAG 

SEQ ID 2068 

MVMDFTDLKAIVKQHITDPFD^ 

SEQ ID 2069 

ATGCOCaACCXWGTCi^^ 
GCGT0CTTATT 
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SEQID2070 

MHIHVIPAQAGIQTLHQEYLKIMIP^ 

^ATCiAATTOXX^TCCAOOCTOT 
AATTTATGGAACAGAMCGCGTGCAGWXCTtnTTG^ 
(^CGGTACTGTTTCTCGGOTCG^ 
GGCGGTATGAAAAAAATCAGCGTOSCCCCCGAAAAT^ 
GCAAATGCAATCTG6CGTGCG(K^rcG^^ 
CGGA6AGCCGA(X2ATACAGCCGCATCTCGATACGC1^^ 
GCCACCAGCCCCAAAGCCTGTTATGCCGCCAAATA^ 
AAATCCGCGCGaTCATOC^ 
GTTGAGCGTXK^GACGCACAAATGGGCGGGAATAGAG 

ccn in 9079 

GQJKKISVAPENPQYBIVEIFESLQGEGWOT^ 
ATSPKACYAAKY^CIOTADEVRIVATC 

SEQ ID 2073 

GTGTGCCGCTTCXIGCGTGAAACTGCCGA^ 

SEQ ID 2074 

VCRFRVKLPTFPPPNQDEAUSNKMPRU3TRQAS 

(X3Ga^(^GGAGmACCGCCGAAGC^ 

AAAGCCAAAGGCGTGTTCAAAGCCXK^(XXKAAGA 

GCGA(^TGCC(?rCTOAM 

OTGCCCAAG«XX3A0^ 
CGCCGTCGCTTGCCXmm 

ATCX^GCCATCGAGCCGCATTTCAACGCAGAAAC^^ 
CGCOXmAATACX3AACCa3^^ 
CGA(»TGAmmTTGCCM 
ATCGGCATGGGOtjTGCAGCATTACGGCTGGTTTAC^^ 

AAAGCGMGOCGA^^ 

CAGCGAA£^CC(mCGATOCTGAAAGAAGACC^ 

ATCACCGAATTGGATGCCGAGAAGCAGTTTGTCTC^^ 



LPKADWAFSIDDBOTEVDDALSLTDLG^ 



IGMGVQHYGWFTSPIiRRAADYINQKQLLSLIDOTA^ 
ITELDAEKQFVSIiNyiKAAAPGGKTAGNAV 

CGCOGGCCCTT 

SEQ ID 2078 

VOTAASYSLRIjRPDWAAEMSAVSRGSGTIiSPYVKHRQIIGTPAL 

ACCGCCCCGGCMWX5A3Xmm 
SEQ ID 2080 

SEQ ID 2081 av»i 

T'KHCAGTGCGCGCAAATGGCAGGATTGG 
CGAG6CGGTCT6AATATG6TCGGMT6A6A6GAT^^ 

SEQ ID 2082 

LSVRANGMGEHEHPSBTMTCF 
epQ in 9083 

GOCGGCAACAAAGGCTTGCGCCGCTTGGAMCC^ 

SEQ ID 2084 

VIIAQKimymPFCRVSGGmPIiRLIiRRPPCPAARQGRQ^ 

epQ in 2085 

«aTCCTCTTCaXXXmaTCCAAA^ 
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GAGCTTTCCGCCTGCGGCATCta^ 
CCCTCGOX?rTCAA^ 
. CmCGMCTCGMGCCGO^CCTCXm^^ 
GAAATCTGGCTCAAACAAATTTTGCGCCGC^ 
CCTTTGAGGCAGGTTGCGACATCGTCCTC^ 
AAACTCX5CTCGGTCATGAGGCGGTACAAG 
GAGGCCTfP 

SEQID2086 

LEQTAffiVmPRGPVMADIA^ 
ELSACGIDLSPTFV^^ 

EftF 

SEQID2087 

ATGCAGCTGATTGACTATTCACATK^TTTTT^^ 

GATACTTTTGGGCATTTTCACTTCACTGCTG 
GTGTTCGTAACCTTTATCGACGACTATTTCCACA^ 
CCTCGCXCATGTGCGTGCTGATGOCCGTT^ 
CATGAGCCH^TGAACTATTACGCGCTGTTT^^ 
6AC6AAACCGCC6CTTCAGACGCTA0CAAA6GTOT 
COTCA(X^TTTTGGGGGCA 
TCCCAMGCCGTGTGGCAGGGTGCGAAATCC^^ 
GITOX£GCAACATCCA!P(X£GG<T^ 
CCATGGCGGTCAAAGTCGMCCCGCGCTGATTAT^^ 
aSCGCaa^mCCACATC^^ 
GGCACGACCGGTATTGTATTGQCGGTGCI^ 
SEQID2088 
mql:u)yshsffsvvppfiaiaiot 
vfvtfiddyfhslavgaiarpvtdkfk^ 

DKAASDATKGRVYALIIPVLALIASTVSAMIYTGAQASETFSILGAFEOT 
YAGNIHI^raLFLIASV^ 

GTTGIVliAVLIFLLKDKKRADV | 

SEQ ID 2089 

ATGATGCTGGCGGGCGGCGGCGMGMTTTTGCCCGTCCGA^ 

ACCGCGAQsGGCTGGTCATCGGAGAAAAGGCGCG^ 

GA0GGCA5X^TATCGGCGMGGCGCAGCCX^ATS^TCA 

SEQ ID 2090 

MMLAGGGEEFCPSBVYVFDSLYAASRRNGEPEETPRPYDANBIXjLVI 

SEQ ID 2091 

ATGOTAAAGATTGGAAAGMCATA(XXXOTTGOT 
CACTCGATGCCAAAACGCGOSAftC^ 
CGCAGGTGCATTGGCMCCGCCATTGCCCTGMTGCGGffK^ 
SEQ ID 2092 

SEQ ID 2093 

G1OTAAGCGTCCG(^TGGGTGGGTATCTGTO 
TAAAATGCCCGTTAAIKX^ATrGACGGG 
SEQ ID 2094 

VL5VRMGGYI>SASGIX3FVGKMGGNIFCRKKY^ 

SEQ ID 2095 

GTGTTCAAAAAATACCAATACTTO 

TCGAACA(ACCAAAGACGAC«3CA^ 

CGGCAGCGGCMTGCCGAAAOTATTCCGACCCGC 

(X&TTGAACTTCXMTTCGGGCTO 

O^CTCATCGGTTCG^^ 

TGCCATCGG(X3CXX£CTT^^ 

CCGGGCMTTCCGGCGGCCCGCTGTTCMOTAAAAGGA^ 

CCATGMTGTCGCCGAACAGCTGAAAAACAC(X3^^ 

CGCATTGATTGCCAAAATCCTrcCCGGCAGCC^^ 

GTCGGCGCCATTACGCCGGGAAAAGAAGTCAGCCTCGGOT 

ATGMGCCCCCTACAtXXjAACAGCMTCCGGT 

GGCAGAACGCGCAGGCTTAAGGCGCGGNN1W 

GTCATGCGCCGTGGCAACAOGCTGTT^ 

SEQ ID 2096 

VFKKYQYFAIiAALCAALLAGCHCAGSPFGADKKEASFVEM 

(^HFGSGFIISKNGYILTNTHWAGMGSIKVL^ 

PGNSGGPLFNLKGQV^^ 
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VMRRGtfTLFIAIiNLQ 

SEQ ID 2097 

GTGCGAATATMC(X3CAAMOT^ 

SEQ ID 2098 

VRI SRQILTAAAIiQNETLENSDGIU^ISTYAV 

JGTGCMGCGAAGC^^ 
CTACACCTGCMGGCTTTAAAGCCGCM^ 

SEQ ID 2100 ..^wxntr.r 

VPE^RBM^PGVGRKTANVVLmFGHPm 

TATTTTTCCMCAOGATMQGCIW^^ 
TATCGGCAACCGGAAACAGCTTCGCCGTCGCC^^ 

SEQ ID 21 02 

HRIDRHNGVSE8CVQYH\OFAPHABCRFQSIAVFRHIAVVFF 

SEQ ID 2103 

ATGCCGACCGCACAAACCGACTGCACAAAAAAAAT^ 
TCGAGCimMTCGCC(7I^ 
CXSGCGTGATGGMTACACGAAAACCAT^^ 
SEQ ID 2104 

CACG&TTCAACCOa^^^ 
GCATTTTTCGGGRSGGCAGCAAACGTCCCATCCMTAGGAAACC^ 

SEQ ID 2106 



GEGGFGRAGQEGGGSBGGKSEGGVGVGGYHCKQSQTGKNERRIffiYFHM'AEGRRTVQKYGTAVCP 

ATCCci^CA&AAM^ 
(MTffiTACXXITCCTACACCTACffiXXXXX^ 

CffTTraXSTAmBGCTOOTGCGTTO^^ 
WUiCGGCGCGTTACGOt?riCATTCTGTG(^^ 
cpQ in 2108 

RFGINIMFAHLPVVGDALPLtAGHIiRLKP^ 

8 5cEo™ 
ATGATGTGCCAGCCGAATPGGGT<XK3GACAGGCGCGCCGAC^^ 

CATTGCCCG(OT1«X3TCTT«^ 

GTCGGCTTTMraGGATGTGTTGGGCKGGTATOjGCGCAACGGTGC^^ 

crem^TCftCTGCCTarrGG^ 
caxrrcACTircc^TOira^ 

QgT(3GCTTITQGOTCQGCAACGGCITO3GCAAGO^ 
SEQ ID 21 10 



VGFHQnmVIAQRCAF^^ 
GGFGFQKjFGKPAVGTOFVVGNDGNAV 

S ^araGAAG(«rCOCAGTAACG^ 
aBAAOGTTTCCCO^^ 

TCC0G0GGAGCAGGCWTGATGGATGCAGG!TTTGCTCGG<X5ACGAAAGCOT 

GMTTGTTGOT3ACCSGCAOGTCGCGCAACTOX3GCGOCAA^^ 

ACA^GCtfOTOTOTCGOJC^ 

TTGGAAGMTOSAACACGCCAAW^^ 
GCGATGOTATTTTTOWIGCTATGCAGCTTCTGGOT 

CMaCGOCGaXTKCGCCC^^ 

GCACGTOmmCCGCAAACCG<aCACAC^^ 

TTCGGACGGGCAGAACGACCCCGACCTGCCCGCCATTAACC 
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SEQID2112 

KLLLTGTSRKFGANTYVRMMPHTADANIGITO 
IiEEtEHAmGAmAELVGYGMISIXSINIGEGJAWI 

SEQ ID 2113 

TT6ATGTCCGTATTATTGCAATTATTCAGTTC 
GTGOGGATGTGCAGTGCv:mi\iGGCGGACAA 

SEQ ID 2114 

UOLLQLFSCVSMJISFYGSKR^ 

SEQ ID 2115 % 

TTGATGCCCCGATGCGGCaOTCCGAATIT^ 
ACTGTAAAATATTMTTTCTGCTGCT 

SEQID2116 

I^RCGISSFSGYGGGFSHWERF^^ 

SEQ ID 2117 

TraATTTCAGATCGGCO^TTGCCTC^ 

SEQID2118 

LNFRSAHCLHPIKFAKCSNISMICIAVKKCQFLPS 

SEQID2119 , m 

ATGTCMTTTCTATTTTCTTGAAACGGGC^ 

TGATGGGTTTTATTAOTtXATGAA(X3ACATC^^ 

GTCC^TCCCGATGGGGGCTTTTGTCGG^^ 

GCGGTATmrc^GCGTOtOTA 

TTCAGGCGTTTAACGCTTrGGGTACGACCATT^^ 

TTOGGACTGGCGGGKTt^^ 

TGGCAATACAAACATCKXSTGTTC^ 

CTOCCG(XX^TTACCTGTCGTTTTATO 

CGTGCTGCTOCO?ITOX^TGGCGAC^^ 

AMGGGTTGGGCAAATTCACCAACGCGGCTTCOT 

TT6TC(OTKXGTTATCT^ j 

SEQID2120 

MSISIFIiKRAIFKfflERQIM^ 

AWLGALFILASGVTUjQVAGNPYVTIjL^^ •■ 

WQYKHLWGTAGIFCYVGAEVSIGSIMVNV^ 

KGLGKFTNAASGVLCTAIVGGAV^^ 

SEQ ID 2121 

ATGTTCAGACGGCATTTCTCAAGAAAAAGATTATTO 

mTCCGCOmaACCTiH^ 

TCGGGAACATAATQ^GTTGAAAAAACCGATQGCAAGCA 

CGCCAMTAACGGTTGGGCGCGAATTTCGCC^ 

AGATAACCCMCACATTGACCATCAGCGAAC(X3AT^^ 

6GTTCTG0G0GCITO!CTCGGCGG^ 

AATCTGTTCCGCCTTGCTGACGGT^^ 

CAGGATAAAACAGCAGGCACCOSACm^ 

AAAGCAAAATTGAATCAGCATOSOT'GMOT 

ACGACC^TGG<X&IOTATGGTTGTTTTG 

SEQ ID 2122 

MFREHPSRKSLFVRPISVHRKKHDIQVTDN 
RQITVGREPEHYIffiTEBTSEHRSAPIKRQVMRGRSMIOPF^ 



KAKLNQHRLNVREVENFF^MRCKDVVHAG^ 

SEQ ID 2123 

ATGCCGTCTGAMACGTOJGGCAGATTCGGCATG^ 

SEQ ID 2124 

MPS^VGQIKHGVLSSWISNRAKNHVCFQILTRYAAR 

SEQ ID 2125 

AKXKGCa^mGraCTTGCCGTCGT^^ 
OTACG6CTT6GCGCGC0GTTTGAAATACAAC6ATAT^ 
GATGATGAGCGGCGCGATTCCG^ 
CTCACCGCCTCXXXX^^ 

GOTOTCCCAGCGTOCaX^ 

GcamAiraxxHrnwra?^^ 

ACAAACTCmaxmGCGmAAAJ^ 

TGCGCTCGGTGTGATTCOTCCTT^ 

TATTOAGCATCCTCCTTTCGGGA^ 
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CGTGGAG(X3TGTGCG(XX>TATTTATCAGC^ 
(XmCTTTGAGCAAMAA 

SEOID2126 

MPRGSAIiAVWALIAMjOTISLEWLPHMSIIAAITO 
LTACATVWAFHGMAAAFQAD^ 
ALMRVKAVVSMIiFmAAV^ 
YLSIIiliSGETFKPVYDKIiGIJISCNIiSR^ 

(XOTI^JACGGGCAGAAC^^ 

TTAATCATCGGA 
SEQ ID 2128 

LQHAGUtfEDIGRB^^ 

LIIG 
SEQ ID 2129 

GCACGGGCGATATTGTATTTTOCCGCGCXS^^ 

CGGCAACGGGCTGGATATGAGCCTGTTTTGCGC^^ 

GTGGTTTCAATGCTGCAACTGGAAAGCGAAAAACC^^ 

ATGTCGAACTCTCGGCXXftATTGAA 

TATGTCGCGC(K!GCAACTGATGCGCCGC^^ 

GTTTTGGAG(jTCGCGCTGTCGGTGG(^^ 

SEQ ID 2130 

MCSAFWAD^BTLQREGLVHIVTAGSGY^ 

VVSMLQliESEKPl^TVSVVNALPSWIJiV^ 

Vl^VALSVGFQSErHFGKAPKRQYHVSPGQYRKBG^ 

SEQ ID 2131 

TTGTATCGACTTAATCCTGAAACACAAAAGGCAG^ 

SEQ ID 2132 

LYRLNPETQKAGLRHNKAVSFIKVGISSIiLT 

SEQ ID 2133 

GCCAAAGGCAGGOGAAGTACaX^GCTTC^ 
AGGGGGTGTCCCGCAGCGCTGCCXXXjC^^ 

SEQ ID 2134 

AKGRRSTAGLRYLVPRQAK^^ 

SEQ ID 2135 

TTGG(^CmATAMTATCAGCCGTCMGCX^ 

TOOTTTTTCTGGAATAGMGAATCA^ 

TTGCGAAATTGTCGATAATTGCAATTTT^^ 

SEQ ID 2136 

IiALTOQPSSEWGQSMAVIAQSEFVE^^ 

SEQ ID 2137 

TTGCCCGTTTTTGAGGATTCGCAMT^^ 
(XCCmTCTAA(AGGGG(^ACAGAAA^ 
SEQ ID 2138 

IJVFEDSQMGDKQGMTKAVA^ • 

SEQ ID 2139 

atggacggcgcacmccgaaaacaaatttttt^^ 

ttcatgccgacacactgacccggctggaaaaagt^^ 

catcaccgcctacgtcatc(^taccgcccattc^ 

acggcggcacgtcgggtton^c^ 
ckxx&^tccacgcggctaccgam^ 
gtgogcggcksaaaaagtcotatcggcggotgca 
SEQ ID 2140 

MO^QPKWFBRLIARIAREPDSAEDVIi^^ 

QFHLKSVLRPAVFVPEEKSW^ 

VRGBKVXiIGGLQWARADKRRLHTUiATRVK 

SEQ ID 2141 

A!TCTTCAAT(XOTCAA^ 

TGAAACGCGCCAAAAAATACCCTTT^ 

GTCKm , TGAAAMC(^TA^ 

TTCGCGCTCAACGMGGCGAMTCCTGCOT^^ 

GGaTTTTGCCCACTTAACCATAa 

TOCCAACCCCTACCGAGAGGAOGGACAT 
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SEQ ID 2142 

MPNAFKEtfKRRGVffll^ 
FALN&SILPCQVSEHLYCaM 
SEQ ID 2143 

TOMCGCATTGAACATGAACCCG^^ 
TCGMATTTTGGGCATGACCACGCGCGGCGACCG^ 
TTOGCGGTGCMTCGATTAAGGACXa^^ 
GAAGAMTGCCCGAAGGCGCGGTTGTCGGCACATCCACj^^ 
CCAMCTCGACAACGGCGAATA^ 
ACAAGGCGCATTGGGTATTGAMTTGCCACGC^ 
TTGGGCGGAAGCTGCCAAGTGCC^^ 
CCCETGCC^TATGCCGATGCOT 
SEQ ID 211 44 
LNALKMPKKLVIASRESIiLW 
EEKPEGAVVGTSSIiR^ 
UXSSCQVPLAAYCTEENGLUI^^ 

SEQ ID 2145 

TTOTTTTAATTAATO 
(BGATTCCCGC^^ 
SEQ ID 2146 

LFLINLFHKFYSFYFVIPAQAGIQFARFQLFIJ^ 

SEQ ID 2147 

TTGQG?nXXa(aTTCCCGO^ 
GCTGTACTAGATTATCCCTAAATTC^^ 
SEQ ID 2148 t 

LGFQIPACAGfefCAERFIiWLItf 

SEQ ID 214© 

TTGCTCCCTGCCGTCAAAAAGAAMTC 
AATTTGCAGAX^^ 

CGMTTTCGACTTAACTCCGGCACACC^^ |! 
SEQ ID 2150 

LLPAVKKKIMPDGIWADSPGSR^^ I! 
SEQ ID 2151 

{mSATtmX^TACGCECC^ 
GCATCGC<OCTTCGTTCAATO^ 
CGGCAAACCAGCCCAAGTCGCC^ 




TCMTTI^GCTTCGCTCACXXEGC^ 
TTOTGCGACGACGGCATCG^^ 
ACTTATCTGCGOTCTTCGGGCAGGGTGGC^^ 
COCTGCTGTTGTGCGGGTGOGGCTTCTI^^ 
CGGCMTOTCAGGGCTTTGATfTTC^ 
SEQ ID 2152 

ADVDIRAGMQVAQQVIX5QiSGFLI£D^ 

RADGOTAVGFHQDVl^^ 

37LRVFGQGGPGFGNGFGKPAVGI3}FVVGNIX^ 

SEQ ID 2153 

TTGATTATMTGGCX3GCTATGAAAAAATACKOT 
CCGTCAAAA(XjGAAGCACCTGCAAAAGGTTTTC 
CAAGGTTCAGGCGGCGCAATATCTGMCAACTT^ 

TGAAGATGCAI3CCCTTGAAA 
SEQ ID 2154 

Lim(MmiPI£IAAVI£GC($IYVrara 
IKTBQSKLYIENALRC^QQRWKNMDAKPD^ 

SEQ ID 2155 

TTOTTTTTMCAAAAAACAGATGCO^^ 

SEQ ID 2156 

LFFNKKQMPSSLVKVQAAFSYGCAFYSIFH 

SEQ ID 2157 

TTGmGAAATTGTTATAGmATlOT 
AAATAGTACAACACTCGACGTTGAAGGTT 

S EQ ID 2 158 

IfSIVIVYC?I^NHCRIWRFRR^ 

SEQ ID 2159 

ATGGCATACTCTCCGGACTTAAGAAACAAAGCTTTAA 

ATCACOTAGG^TCCTTO 

TCAAGAAACACaSTEACCTGCG^ 
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SEQ ID 2160 
SEQ ID 2161 

jfflGCOQCCftAMCGSATMGCAJ^^ 

ACAGATGCGGATG^^ 

AACAGGATTCGACCGCCGCCTGTTCCGCC^ 

AACCGGCCGATTGCTCCGATGGTTTGTCAA^ 

OGC^TTTCGCCGTATGGGTCCCTT^^ 

G(X^ATCTGCGAACCGTTTKHK}^ 

SEQ ID 21 62 

MPPNRIGKNRU»^ 
HRPIAPMVCQNmGWFEAR^ 

SEQ ID 2163 

TTGCWXXn!GTACRATATCATTTCX^ 

TCCGCAAAAAAGGTTATCAAGTCTGCGTGTn^^ 

GTTCGCACCGGGCCCGGCTTTCAffP 

SEQ ID 2164 

• IjLFVQYHFLraSNHQD^^ 

SEQ ID 2165 

TTGAAAAACGGTGC6GCGT?PTCGTGGGGCAGCCGCT 

TGATTGAAC^GCCGCCAAACAA 

ACTGACCGCGAAATTCGTCTTGGACGCMGCGGCTA^^ 

AAC^TTACCCACCCGAAATTCGACXXjCMCAA^ 

(OC^CCCGACAAACTTGCCCKKXM 

CATCCM(X3CTATTCC6CCAACGTCAAA3^ 

GCCGAACTTGCTQCCGATCTGCTGACAAAAC^ 

ATCCGAACAACGCCTGACTGCCTTCTCAGM 

SEQ ID 2166 

LKNGAAFSWGSBXTEFDFTDKFSW 

HITHPKPDRHKIIimHP^ 

AELAADIiLTKQLKGEAADWQTEPAEPI^ 

SEQ ID 21 6T 

TTGWCGQCAftGA^ 
AAAAAAGACATTTTAGGATCGGGCTGGATGC^ 

SEQ ID 2168 

LFGKKNKlSASEAGKlfrFHKPCRKVYNPTVL^ 

SEQ ID 21 69 

G(X3CCGTTTGTTCCX5TCCGGC 

MGACGGCGTTTTTGTAGGTGTTTTCGCCOT 

OTCAAAACCGTAGTTGTAO^ 

mTKXnTOOXXOTATrTGTAC^ 

TCGAGGGTGAAACGCCCGCCTATG^ 

GACCGATACCGOXXXXSAAACGC 

GAAAAGCTGCCCGTAfiCOGiU^GCTSCTTO 

OCG&ACAGGOOCOTAITQCQTTCGAAGQTT^ 

GCAGCCCGAGTTTGGGGCGGACGTAGCCC^ 

GGACACGCCGATTTGCGCCCI^CTGCGTT^ 

GAMGGCCGGCATTCAGGCTGCCTCCXGC^^ 

TGTCXX&GACT^^ 

CAGGTCGCTCTGTCCGCTCTAATCGGGACGC^GGTM 

TAATAQGGGACGGAAAGGGmCGCaiX^^ 

CGCCX5AACACGAAGGCGGOTGTTTGGCAACGOT 

CGTCAGTTTGTAACGCXJCTTCGCX^ 

prCTAGGTCAGGGTTTCGCCCCGMTCAGCGTATC 

C(mXXX3TTCGA^TAACGCTW 

GGCGGCTTCGGTTCTCTOGGGGCTGOOQCT?^^ 

cGCcrarammccGccGmcOT 

SEQ ID 2170 

GPFVASGGKHVGGD\fc^ 

FFVAPVFVAGVQHFSGGGAVACADGVMFGAPVVLVVV^ 

EKLPVAEAAFRRIEIGQVVLRPGRNIVVEQARLIX3 

GHADLRPACVFmGfKM^ 

QVGLSAVIGTQVTDLSVKRRAAFADNAK 

RQFVTPFAQHFGGAA0MiQPPSAliFGPHITWVRFAGI^ 

GGFGSIiGAAAFVTAKQAGRTEGQIGSI^^ 
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SEQ ID 2171 

ATGAGTAATCCATTTTCCTCm 

ACGACCTGCTTGCCGCCGCGCAAACC^^ 

TATGCTCCyiCTTGACCCCGAC^^ 

CAAACOXmCCTGCGS^^ 

CAGACCGCATGTTGGATATGGGGTTTATCGACGACi^ 

GCAAGACTTCATGAACSOGOCOGAAAC^ 

CTGATTOTGATCTGCATATGMCCAAOTA 

OTCCCAACAAAGCAGGCTCGAAAC^^ 

TTACGAAATGCCGjCCCAGCCCGMGACTA^ 

ATCAAAGAACTGACCGGCAACAAOCTGCTCATCGA 

CCGGACGTGCGCCCTGCTCCAAC^^ 

SEQ ID 2172 

MSNPFSSLGLGTELVSALTAQGYEMPTPIQAAAIPKALAGHD 

QTAIMAGCEITOTVGRLLD^ 
LIVDLHM^VIVFCOTQSVDRVT^ 
IKEI/TONKLIJERIEGFEPI^ 
SEQ ID 2173 

TTGGGGCGGGCACC6TTTTATGGCATMTCCOT 

SEQ ID 2174 

LGRAPFYQI IRTQIPVPAVHRRAVYFPVLSVSRFERHK7 

SEQ ID 2175 

CTGCCCGCCCttATCtt 

CCAraCACTTGOXX^^ 

CGGAAAAACCCTO^CC^ 

CAGGTCATCC&CATGAATAT^^ 

CGG(m , ACCCXnTACCCTCAAAACCOT 

GCXJCACGCAAATGTACAAAGGCGAGGCGCGTTACGAACl^ 

CAGMTGCAGAGCCGCCATTTTGMCCACATC^ 

(XSAACAGGaXXXmGAAA^ 

GGG 

SEQ ID 2176 

VPAPIAGASATMSCIiKHVQWRTFLKRGPIMHIG^ 
QVIDIMGCPAKKVCNVQAGSALMQDEPL 
UQTAADGIHIGRGAQGRFWFFRDIiKHYAEHSW 
G 

SEQ ID 2177 

ATGCCaXGTTCGGGTTCGGACGO^T^^ 
AAAGGCTTTCAGACGGCATTCX^TCP 

SEQ ID 2178 

HPPFGFGRHLSAWPGGRAESW^^ 

SEQ ID 2179 

ATGAACOTXKGACTGCGGACAITra 
AGCCGCTGCTGGTGTGCGTGATG6AGCMTGCGGC(K3^^ 
SEQ ID 2180 

HNPATADIAQCIEQNLNQYFKDLN^^ 

SEQ ID 2181 

C(XX?IX3CCGATGAGCCTCT^ 
GQCATCGAC(XIT6ATGCTCGGTC^ 

G(XTT<rCGCCAAACTCMTOT 

SEQ ID 2182 

VNMSATVRIIjGIDPGSRVTGFGIIDWGRDHFYVASGCIKTPAD^ 
KAAKEQVQHMVVi3MLGLSGTPQADAAIXnA 
SEQ ID 2183 
TT(&GTGAAMTAAACGAGCCAAAATC^ 

SEQ ID 2184 

LSENKRAKIAPQVGLFVRIWM 

SEQ ID 2185 

ATGCCGTCTGAAAAAGCTTTCAGAC^ 

0»CGGCATTGCG1^^ 

(mACATCCGCCCAAT6CGAA666(mGGC^^ 

CCGGAAAGGACTGCCGCMTGGAAAGAGGGATAAGGTATTT^^ 

GCAGGCAAAAACACATKX^ACMGCCTTGCCGCMGGTO 
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G03GCG(XXrKK^TCACX^ 

AAC(%GCGTTACCCTGAGTTiACA(^^ 

iJ\CCCGGCTGGCGCGTC(m3T^ 

AAAGA 

SEQ ID 2186 

hpsekafrrhlrtaspqglhlhhfhqkvgkcgiigfgihifptIiL 

OTRPMRRGEAKTPCRCPRFIX3IIX5ffiGDFRQ^ 

AAACFTVMNVLIKEASAKFAIjGSG^ 

VLLUTOFRSGQEPAALAGI^^ 

FWEELFWQEIIi^niLSGII^^ 

SEQ ID 2187 

TTGGTTTCACAGGOCGGCAAAAAA{KX^^ 

SEQ ID 2188 

LVSQGGKKAQTYVRAWELVGPVGFEPmGL 

SEQ ID 2189 

ttgcc(k:ct(Maagaaaatgtgtaccga^ 

ftCCTTCTGGTCAAACXXXX^ 

GATGCTCGAATACGGCTOATATOGGTAOGOGTCTO 

TACCCGCACmACCGC^ 

ACCGCTATCACAACGTCTTCCTTATCGGGC^ 

TOGGTTTTTGTGGATTTTTTCGGCAT^^ 

GTPACATTGCAATTCTATCCCGCTTGGA^ 

GGCTGCACMGCGTTTCAAAACXXXSTC^^ 

SEQ ID 2190 

LPPEIKKMCTQKFIFFVLYVLQPI1PFAI1I1HKM 

YPHPTAFEMAVYAUIQDVPIJSMySHQianCIIjDBQ 

VTLQPYPAWXSFPSQttQADAQRM^ 

SEQ ID 2191 

TTGAAACaTOGTTCTA^ 
AACAAAGGGGAGTGGATGCCGAAAACCCGOTAATATATTA 

SEQ ID 2192 

LKPGLYKQDLPDGPNGSVVCKMAFLKLTKN^ 

SEQ ID 2193 

TTGTTMTCCACTATMTATATT^ 

(^ACTGAACCGTTAAAACCGTCCGGC^ 

G 

SEQ ID 2194 

LLIHYNILTGFSASTPLCSRNKPLKPI SFFVNFKKALFLQTTEPLKPSGKS^ 

SEQ ID 2195 

GTGAAAAMGGCGTACATTCCGCTACACAGAATTAC^GATACAGCGGAGCAAT 

AGTTTGC03GCGCAGGGTT(aOT^ 

SEQ ID 2196 

TOGVHSATpl^SGAMPSBRIFRS^^ 

SEQ ID 2197 

ATGCAAGGTACTGCCATGAATACGAT^^ 

TCGCGCAGCAGAAACGCCGCGCACAGACCGGC^^ 

CGGCGAGATGGAC6ATGGCGAAGMGTCGTGATTGCCM 

OKTCCCTGATATACGGCATCMCGCCGCGC^ 

ATTTGAT6GATTTGCTCGCGOTTA 

aACGOGGGCAfiGCTGAAGGAMTCOmm 

GAAATCAAGCGGATCAGAAACTATCTCTCCXKXXIAA 

SEQ ID 2198 

MQGTAMNTILAFDIETVPDVQGimYD^ 

RSLHGIllAARYWDTGDGDFGDSRDPKWNNyiSRYHQ 

EIKRIRHYLSAQTEDKPHWAE7VQAWK 

SEQ ID 2199 

TIGGACAAATTCTGCCCAATGCGGTT^^ 

A(&TACATCAAATACCTATTC^^ 

CtX^OGTTOGCCATATCGTCC^ 

GCGGCTGTCGCXX3AAATOGCXXnXX3(XXXj 

TGCGG(XTATGTTTTTCAACX^TTCGAAAAAOT 

CCGCCACMCCT(&T(X&(Kn^ 

ACGGATACCCTGCACArCGGGTACGGTTT^ 
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SEQID2200 

UJKTCPMRFVTCLGGE^ 

MVAmVmPmGVDAVYQGAVMQYGQVETMVP^ 
TOllfflGYGFNIESQKRIHGSTIiHSKTD^li^ 
SEQ ID 2201 

SEQ ID 2202 

LSKRGNRMiLSBGGHiVPPFRPQT 
SEQ ID 2203 

ACGGCGGCAGCGTCGTOTTATGGATGCGCCGC^^ 

TTTGGGGTTOntSGTATTGAACGACTTGGGCM 

TTGCAGAAGOCMGCCGTGAAGGGGTTTTGCCCGAATAT^ 

AACAACCmAOTTGGCAGCAAACC^^ 

GGGCGTTTTAGAOTCCAAGACGCGCTO^ 

ACTACCGCGACGGCAAAGACAAATACCGTCCGGAAATC^^ 
CGGCGATGAACAACTGGAAACGGGCTTTACGTTC 
SEQ ID 2204 

MQRQTELKKWLQTVre 

WKASRECVLPEYDRETffliREiraiPPEW 

WEKARAAC&mAEFDEFYRI^^ 

SEQ ID 2205 

ATCAAAAAAGTC&GCOTATTGATTGTrG^^ 

CTGCCGAAATTGCCX^GGGTTTGGGTGCAAAA^^ , 

T(£AGACGAACGCTGCACGOT 

GGCGCGATGCGTCXXXSACAGCGTATGCCMTCT^ 

ACCATTACACGTACGAC^CTGGGAACMTATT^^ 

TATTATCTGTACAMTCCGGCGGAAAATTT 
SEQ ID 2206 .jj 

MKKVSVLBJAKNEANHIRBCIESCR^ 

GAlffiPDSVCRLMPKKGGSVQGKV^ 

YYMKSGGKF 
SEQ ID 2207 

ATGAGCAAAACCGTCQOTATCTGAAAGATTACCAMCGOT 

AGCCGCAGAGGGCGGGCGAGCCX£TGGTGTT^^ 

AcanrcGAAcanrcAc^^ 

CGCAAAATCACGTTCTACATCGACXXnTO 
AGTTTTCAGACGGCCGCCATTGGGTGAAATGGGMGA 
CAGAAACGTCAAAATCGAGTTTTACACCACCGMGCGGACAA^ 
TTGGATATTTTCATGGTCGTCGCCCT 
ATTTCGAAGGCATOAATCCGTGGTO 
CCAAGAGTTTTCCGGCGACCGOGCGKSOCGCG^ 
AGCTATGAG6AGATGAACAATTTCTACACCATGACCGOT 
TCCMCGCCACGACX^CAGGCAGTGACr^^ 
GGAAGCCGAAGGCOGTCTGAAAAACMTGTTTTCGAGTTM 
AACCGCAACGGCGAAGCGGTGGCATTCGATTATCAG 
CCTCGCTGCTGCGOGGGTTCAGCGCGCGAGT^ 
GCTCTACCGTCGCGCCGTCGCCCXX^COT 
GCCTTCAAAGCCCTGCTTTTGGGOGTGCOT 
(XHTOTGCCGAAATGGaCGAA^ 
CCTGCGCGCC(»C(Xa^^ 
AACTG0CTGCTGGCGCAGTTTGCX2GACAAG 
TGCAGCATCCGAMTTCAGTCTCGA^ 
• CAAAGTCATCGAAATCGACCGCTTCMCC^ 
ATTCGGGC&CACXjAAGGATTGTCXSAAAG 

SEQ ID 2208 

MSKIVRYLKDYQTPAYRILETE^ 

RHTFHDRPDVMSKFTTTIVADKKR^^ 

IiDim^VGDFNWGAM^JKGLNIFNTKFVL^ 

snnsEMNNFyTmmGAEvvRM^ 

NRNGEAVAFDYQGKRATEAVLUiTEAE^ 
AFKALLI/^SEAEU^ENIDPIiRYHQAREA 
NCLMQFADIffSDDAL™^ 
IHAQSGLSKDVGKIVGKILG 
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CCTGTTGCAGGGTGTCGCTGCGGCGOT 
SEQ ID 2210 

COTAACGOWTOGGAJ^^ 
KSCmLFHKAaAAAAS^VWVAAS^^ 

OT0CC®3(afiCTAAAkfcT0CTQCQGQMCGGTQlGQAMT 
SEQ ID 2214 

ATCGGATTGTTProGTTTGCCAAAAAiGCCGAAAGC 
SEQ ID 2216 

SEQ ID 2218 

MHPLYSLFHQIIWISVGWMAAHAFCPKFIIG 

S ?£^mTAAAr^ 

A03TOGCAGIOTATAATOTCTTCCT<aTQCATOriW 
KIiFKSGmySNrKSIKHQatNOTCGFDPDAITfFKSGVS^ 



AAT 

CATCTTTGGAAAAGA 
HUBS 

GATACTGCATCGGAAATATCCTCAAATCAGG^^ 
TGATTTCCMTAOTTGGGTGCGCAMAGGAMTTATIMMCAT^ 

S 5^SAxi^BQm^^ 



BGFTEKIASFYRQSDVVVIiPSLVPEAFGIiSLCEMiyCRTA 

a 

SEQ ID 2226 

BSaOjIiKMPliSIKMHIPSISCQKSSSPRKH^RAEKJTO 
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SEQ ID 2227 

MGGAMTATTTATACAATAAGGMGAATGAAAC^^ 

CGTATTGAGATG 

SEQ ID 2228 

MSFGHilATSQPCEIiLTKSRGET^ 



SEQ ID 2229 

TOXaAJ^GAAAATAG^QCCQCAGGMTTGA 

Oy^TCAAAACAGTTGTATTT^ 

GCAGCTGATGAAAACGTTMTTCAAA^ 

SEQ ID 2230 

LEKKI EPQDIiMJVYRAPDSVOTLNFRIVEKBQK^ 

SEQ ID 2231 

ATGAAAAAACAAATCIACCGCA6CC6TAATGATGCTGTCT 

TGCAGTTGGCGGAGCTTTCTCAGMGGAGATGAAGG^ 

GACAGQCAGACCAQOTCTGm 

AATAATATGGGGATAGCttOTK^ 

GTCAT0GC0CTTG(^TCA^ 

SEQ ID 2232 

HKKQimVMMLSMIAPAM^^ 
NNNRIAPFGNRTGHPIGKFPHYHRR^^ 

SEQ ID 2233 

ATCAAAAATATTCTCTTAGTATTTGTTA^ 

CCTACCTGAACAGCGAAAACAtX^^ 

GCTCGCCGGCACGCTCGGTTTQCGCTACGGACT^^ 

CGGATGWXGACATATCCG^ 

GAAAATCGTGGCTCATCGGCGCCACCACCTAC^ 

AGGCAATTACTGGATGCTGAArcCCAACATC^ 

GCAAGAAACACAOTACCTACGCCCAT^^ 

GCGTACAGCAIACATTT | - 

SEQ ID 2234 

MQTCIJjVFVSFVP1£!VRTO1*POT 

RMSDISAGISHTFLKDGKIWALIAFIiESTVY^^ 

ARHTSTYAHF^FGFTKT^^ 

SEQ ID 2235 

ATOTMTGAAATTraTGTMTCTG^ 
AAACCTCGTATTTAATTTTCAGACGGC^^ 

SEQ ID 2236 

MLMKFHVICLNNIFNDIFNTIX^ 

SEQ ID 2237 

CTGCGCGGCTTTTTGTTCTTCAOT 
TCGATTKXSMGGTTTCGGTCAGC^ 
TGAGGACTE<XG(K?PCTTCCAGOT 
CGGCACXIATTTCTiraK^CAGACGAOT^ 

TGTCGGCGGGGACGCTO^ 
SEQ ID 2238 
IiRGFLPPTmRQKVIXmTGIjPVGKPDn 
RQHFFLPDDWHfl^^ 

SEQ ID 2239 

TrGAACAAACAGGCGGOTXjTPTTACCATM 

CACCECITCCG(nOTAACCAGTTTCA 

GACCAGTGGCGACATTAAAAAAATTOCAGTGCAA^ 

ACCAAAGGCATCmCCCC^^ 

TCTGGMCGACCClGTTCTrATCnCCGAOT 

TAAAAAACTGC&CAGCGACGCACAAGCCGCATTTAATGOT 

GCCCCGCMGGCGTGAGCAGCGAAGCCGAGCOGTCCG^ 

TGGTTGAGCGCATCG(X!AAAGAAA£XX3^^ 

GACCAACGCGATGAAGCAA 

SEQ ID 2240 

LNKQAACFTIMNVMLYHRKTVMKHIiKLTLIAA 

TKGIQPUKAEEEGGHHHDH^^ 

APQGVSSECTSAKQ^IRQIKRKGIKAVm 

SEQ ID 2241 

ATGCACTACXXXXTOjCATOGTCT^ 
GTGCXXXEGTCGGCTACATGTTTGCCG^ 
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AGMGATGCCAACTTTGCCGCOTTTACTO 
GATATTCCXXOCTGCAACTCATCGC^^ 
(K^AAGGCGGGCTTTGGCACGTCATTTTOT 
CCGCTTATGGGCAAGAAATATGGGGACGCTC^^ 
ATOTTOTTGCA&Xm^ 

SEQ ID 2242 

MHYALASVFOjSLSAAPVGW^^ 
DIPALQLIMVSGMLITUVIYRPLVI^^ 
nXX^SVLYIiFSVILGKEGGILPKWFKNHRHH'rr 

SEQ ID 2243 

TTGCCMCTTCCCTTTCAAAACTGC^ 
GGGCGffTATOGGTA 

SEQ ID 2244 

LPTSIjSKLQIDffPSLIiLSGLTFRIiF^ 

SEQ ID 2245 

TTGGGCGGTATCGGTATGAGCATCATCGTTGAAAACOT 

CCAACGGCGCAGGAmTCTACCT^^ 

OGACAICGAtt&T^^ 

AACOTI^^ 

CTTTCCmTACCCTGCTGC^^ 

TCXJTGCGCCTCA 

SEQ ID 2246 

LGGI<^IIVEffl/IVSYI«R^^^ 

ERV(MRQFARRQIAHIiSN(£FQRVL^^ 

WCAS 

SEQ ID 2247 

ATGM(XT(OTItMCAGCTCGAGCAAGAAG 
GTCTGCAAGCCTACGMGGCGTGGTTAT^ 
TACTGTTGAGAAAATCGAA6TCAAACXX3CGTGGC6ACCT 

SEQ ID 2248 

MNLIQQIiEQEBIARLNOTPEFAPGDTVWSVRW 

SEQ ID 2249 

ATGCCGTCTGAAAACCAAGTTTCAGACGGC^^ 
GCAAATTCGACTTTATATTTGCCGCCGAC^^ 
CAAGGACTTTTTTGCCGTCAAAGGCGGTITOT^ 
•TTTGCTTTCGGCGQCAJmTAT^ 

GAGGGC^GGTGTCTGGATTTCATAOKXJTCTCOT 
TCGGGAAACCGCTT(^ACGGCATCAAACCCGA 

SEQ ID 2250 

KPSENQVSDGIFQTGLIjRGFIiFFTYTVRQKVDGVHT^ 

FAFGGSHIBGtmCQRQHFFLPDDW^ 

HffiVSGFHTALRDVGN^ 

SEQ ID 2251 

ATGCAATATMGCCGCTTOAGACCXX^ 

SEQ ID 2252 

SEQ ID 2253 

ATGCAAGGAATGATAATGGTCAAACATCTGCCACTC^^ 

AarrATTGAAAATTTATMCTGGTCC^TACGTCGATC 

CAAGGTGCTGACCGGAAAATCCG(?TO 

CATCTCAACCCTGAAATGAT6AGGCTGATGGACGGGOT 

TCGGTACGGACAAGCTSCCGGACAACXI^^ 

CTTCAACTATTT^^ 

GATTTGGCGCGCGGC(^TACCTGCGTAA(^ 

TACCTACGCGCCTTC^GCAAGCOjGCG^ 
CCGGCGGCATTGAAGTTTATGGTOCGC^ 

SEQ ID 2254 

MQGMIMTKHLPIAVLTAUiL^^ 
HLNPBmMDGVDPDHmVPFW^ 

dmgdtotigpggdlniakrraeeaggk^ 

PAALKFMVRQWQDVKAGK 

SEQ ID 2255 

atgcttato&ggcaotacc^^ 
ttgccgacaacagattgggctatatoacgaccgc^^ 
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CTGCTGCAAAGCAGCOTCGA!^^ 
ACATOAGTCTGCC(^CAGGATTCGTTCT^ 
CTTGATAGCGGMTGGCGGTTGGAACAATCGCTGC^^ 
GAGCAACQGGAAATCCAATC& 

SEQID2256 

MLIQAWIFPEMFDSITRyGVTC 



SEQ ID 2257 

ATGACAGACACTCAAAACCft^AG^^ 

GGCATTTGGCCMGGACGGCAAAACCGTCAGCGT^ 

CGGTTACAC^TCGAMTACCCCGTGMGCA 

CTGCCGACTGGGGTTTGGACTAC 

SEQ ID 2258 

MIOTQNRVAMOTKGVPGIKGWIiKIMNTEYSK 
SNIiffiTGANDVLKIDGEHGQI LI PFVSQYIETVDTGSKT ITAK?GIiDY 

SEQ ID 2259 

ATGGTAGTTATCCGTTTGGCACGCGGCGGCTO 

CCGTAGCCAACGAAAAACAAGAGCGCGTCCGCCT^ 

C 

SEQ ID 2260 

MVVIRIARGGSKHRPFYNVITO 

SEQ ID 2261 

ATGATGGMCTCAraTCCAAACCG<X^GCACAGCM 
AATCGGACACCGAAAAAAACCTGTA(XIGCGCATATCT^ 
SEQ ID 2262 

MMELIIQTAAQQGYLSMSMDiycmPMIFOA^ 

SEQ ID 2263 i 

TTGGATGATGAOT(XATCAOT 

TTGCTGTOjGCATOOTTGCCGTCCA^ j 

SEQ1D2264 

LDDEFHHFU^PAAAHMFRQCDAKF^ 

SEQ ID 2265 

TTGCAGCGGCAAATCOCXXIOCTGCTCGCXXjAAG 

<aCGCT(K^Ca3ACAAACCAC^^ 

GACCGTCATCCCCGAATACCGCX^^ 

AACACAGGCAAAGCC(XX£AAACCCTC 

C06CA6AACTGCTC6GCAGCAAAA(X6AAACC^ 

COXCTmACCa^a^^ 

CTGTGCAACOmmCCACTGTGM 

SEQ ID 2266 

IiQRQIPPCSPKK^PYWfrrHNAVQFQFGSHPL 
NTGKMQTLHiAAAKMOSAAAYLEHKNPTAAEFIiRHTSEAAA^ 
LCKPDYHCKPHGRQGMPTATSSPSSATAA 
SEQ ID 2267 

TACTTOITGCGGCAGGTTCGGC^ 

CCT(XXXX3TCCAGCTWXX<^ 

GCCTGCTTTATTTTGCTGTCGTOT 

OGAOTCCAATCCGGCAGGTTCGCTGTCC^ 

GCTOTTATCCTCTTTATTG^ 

GCTTTGGGCC^GOTTTTTTTGCCCTGAG^ 

TCTTGGTTTCOCTOTTGCCG^^ 

GTT0GGTACGGTTTTA3TPGCAG1^ 

AAAmCATACTGGmATCGGCACGGmmm 

ATGTTATTTOCGCXXnCATTATGCCGA 

SEQ ID 2268 

LQRLPRLVPPRQVICYHALPSHFIJKFMSNHTSV/SSKIG 

ACFILLSFYSVVG(mHYV^ 

ALGQAFFALSIGVSAMITyASYIiGKDQDMFRSGOT 

KKHTWIJGTAIPIVGIPSALSFGAWGEFKWGmro 

SEQ ID 2269 

TTGCCTTCAAAO^GCC^^ 

ATAAACTK^TGCCG(XXjGCC6GATAGGCAC^ 

TGCTCGAAGGCGCGTAGGTMCGAAATTGOOGTTT^^ 
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ATCCAOXmTCaXACGCOTC^^ 

TTTTGCCCAAATWHITCAACAO^^ 

CGGCAGCTTGTCCGTACCCi^GCCTTTO 

ATCATTTCAGGGT^GATGfTTTA^ 

ATITKX^TCAGCACCTrGCTTT^ 

OTATAAATITK^TACGTIT^ 

ACCAJPTATCATTCCTTGCATAT^^ 

C 

SEQ ID 2270 

LPSNSGCKPSSAKMO^^ 
IHPNPHAIiFRHHHADFFIAARFFGTAPG0VQISAE1OT 

RQfcVRIQSIiFHAFtE 
VINFQYVLVFRRCIffFGERF^ 

SEQ ID 2271 

TTGCACAAACAGTTACCCCAAAACAAACACACAGCGGGTG^ 

CCAACCACATTAITCCT6ATAGGC6CGAGCX3AACAA(^ 
CACCATAGCCGGGCTTC(£GCCTACAC^^ 
CAGCTCCGA(^CATCATCCTCATACAGGACTGG^ 
CCGCAGGCATCCAACTGCCCTCACTCGGACTC^ 
GMCmCKmmGGCACATraGCCACAra 
CACCACAACCCTGAAGAGGATCAGCGCAG^^ 
ACTKCTCGCCCGAtt^ 
(^TGCCGMTCGCTGCACGCGCACGO£CAA 
ATCGCCT<^CGTCAGCTT^ 
CCAAACCCCTCCTCGTCA^ 
CGACATTGCmGCCGCAG(OTGC(Xn^^ 
CACCACCCA 
SEQ ID 2272 



QLRHIILIQDWKI^AABLHTAETAIRKHHGNGLNITAOT 

HHHPEfflJQRHLFSAIRQFTRHTPLIIiHITCl 

IAS^IARPWSAVLM^ 

HHP 

SEQ ID 2273 

ATGAAACTGTTCCAACGCATTTTCGC^ 

(XXX3CACCAT0GAAAGCACATO 

CGTCATACAGGGCGAC^ 

GACCGTTTCGGCGAAGMTACCTGTTCTT^ 

TCATCCTCTCCTraTCATC^ 

MCCCGCATCTOC^^ 

CACGTTTCCCACG^TGCGCTCCCXJGCTO 

TGGATACGCRKXmXX^^ 

TATCGCCCAAAAGAACGGACAGACGGTCGCCCTGTC^^ 

GCCGTCAACTACAGCCCCGAAGGCAGCACX^ATCCT^ 

TOTCACCGCTTTCTACCGTGCAGACTOT 

ACCGAACQGTCTGCQGATGCGCm 

SEQ ID 2274 

MKLFQRIFATFttVIVCAIF^ 

DRFGHmFFIKGVro«AQRLPSPIiFIPGLPLAPIWHEFIILSFIirVG 
HVSHEMRSPIARM5AIVGLIQAQPQKRBQYLKRIiEG^ 
AVNYSPBGSTIIJNIGQIHKHWIinVW 
SEQ ID 2275 • 
ATCAGCCGCGTATTACTCt^^ 
TCCTGAGCGG(mTACGATOTA(m^ 
AGGCGACGACATCGACCGAATCATCGGCTTGGAAATGGGT^^ 
AGCGGCEAACAGAACAACGCACCCAACAG^ 
TGCTCGAA6TCCT6ATGCGCCATGCCGGACAGCT 

SEQ ID 2276 



SEQ ID 2277 

ATOCAAGACATTCCAAAAAACAGCAMCCGGTAGGGAAATACOT 

CATACOCTTACGAAAAGGAAACACCA 

SEQ ID 2278 

^DIAKNSKPVGKYAIK^^ 
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SEQ ID 2279 

ATGT(XGCrGMACGACOT 

CGaXXSACTTOACOm^ 

GGCX»GCATTTTGGTCGCTTCAS6CACGCCT6A1^^ 

GCACACTACXXyOAGAACGCATCGTTACCC^^ 

AAAACATCATCGTT^^ 

CCCCGCAtfmCOHmATC^^ 

(XtTTCGGACAAATCCAAC^GACGGAC^ 

CTGCCGTCGAACXXXjTCGCCAACGG 

CCAACAAACCTTCCTGATCMTGttX^ 

ACGCTTCTGGTTCCCGCAACC^ 

AATATCCTTTGTGGTAT(TO 

CTCCGCCGCCATATTCGCAGG&TT^^ 

COTATATTTCOK^ACAfXXXX 

TAAAATACC«XXXrTACTCCGCCACCT 

SEQID2280 

MSAETWSGARPAAKIiPIYILPCFLtflGIIOT 

AHYGQERIVTIiFAWSLLIGSUjQSCITO 

RSDKSNRROT/SIAAAVFL'IAI^ 

ULVAATLI/TGIAGLLKRSLTPASLFLLCA^ 

RHSANSPMLSFYADFSLVNFALPEYPE^^ 

SEQ ID 2281 

ATTGCACTGAmCOTATCOTTTOTGG 

MTCTAAAGAAAAAAA™ 
CTATCCCGCAGAAAAAATTTGCGCCGCGCT^^ 
SEQ ID 2282 

VNRQTAYFLASFSLIAIilALSLSWELWIAPLRPGGSWM 

SEQ ID 2283 

ATGMTATGCTTGGAGCTTTGGCAAAAGT(W^GCOT 

GGCTTnATCCGCC^^ 
GACAAGTTCCMCTTTCCATCAGOT 

TTTCCAACTOXX3TGGCTG<XX^ 
(KXSCAMTTTCTTTGGTTATC^ 
TCGGftACMTTTTGCTGCCGACTTTTO 
CGGACTGGCGGTATTGTCGTTCCCGCTGGT^^ 
TTAAmmTTAAAGTOTm^TCCG(Xrm 
CXTPTGAAACACGOCGGGCTTTCGCTCGCCA^^ 
OTGGCGAAAATGCOOTCGC^ 
CTGATTGCCGTCGGCGGCGGACTGTAT^^ 

SEQ ID 2284 

MNMLGALAKVGSI/RflreHVI/^^ 

DKFQLSISliLRITPraLLII^SFVGSIl^SYHKPGIPAP^ 

AQISLVmiFASYI^GSVSmYADRMMELPa^I^^ 

LIMIXVLASGPYARQJrcKTFVKI^ 

LIAVGGGIjYFASLAALGFRPFHFKRVES 

SEQ ID 2285 

ATGCCGTCTOAAGCCGCGCCGCACATTM^ 
CCCXXXXX^TCGACXjAAACCGTTAAAGCCGG^^ 
C6ATTTCCCCCTGATTACCGCCGACAOT 
GTOTX&CTGCTGTCTGCATra 
CGATCGAGAAAGCCGGTGCCTACGCCGTGCAAIj^^ 
(X^TTTCGGATACCGCCCXICCCTT 
SEQ ID 2286 

lffSEAPPHIKEHTVNTliYI£SGSP^^ 
VLTAVCIHYRGKTSSRVQTNRW^ 

SEQ ID 2287 

GTffPCCGAAACACAAAACGAACAAAC^^ 

TTGCCCAGCAGCTCTATAOTTTATC^^ 

GGCGGOGTTTTTGGTTTCACTGCCGCATAC^^ 

TTTTTCATCGGCATGGCATTTGOT 

TCTTAGGTATGTTCGTTGCGfFTCGGCACG^ 

coKXXBTTTffrc^^ 

(OTCAGAACA03ATG6C(a^ 
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SEQ ID 2288 

VSETQNEQTVQPLVEHLIELRRRI^^ 

I^IGMAFAYFLVFPVIPKPIjAGVTPVGVNMATDIDKYIjSFI 

RSEQDGDIQPPAKT 

SEQ ID 2289 

ATGTTTGATTT(X3GTTTGGGC6AGCTXa 

GCTTTGTAG^GCGTCAMCMGAACTTO 

TMG(MAT(jCA<&ACAGOT 

ACCKX^AACACCGTATCMiAG&CAffl 

ACAAAGACCGCCX^TGGCGGGAATACCTGACTGCTT^^ 

GCGCAAACAGGCAATAAACCGCAAACGCGATTT^ 

SEQ ID 2290 

MFDFGLGELIFVGIIALIVIiGPERIjPBAARTAGRLIGRI^RFVG 
TANTVSDGI SDVMPSERSOTSAETLGDro 

SEQ ID 2291 

ATGGGCAGTTTTTCTCTGACGCACTGGATTATCGTA 
AGGGGCTGMCGAAGGCACAGACGGCAAAGAAGCCXIAAAAA 
SEQ ID 2292 

MGSFSLTHWIIVIJrVVLIFGTKKLRNVGKDLGG^ 

SEQ ID 2293 

GTGTTCGATTACATCGTCT^^ 
AATATCAAAACGACGATAATCAGTAO^TAATCCA 

SEQ ID 2294 

VTOYIVFLGFFAVCAFVQPLFEVMNRTAEW 

SEQ ID 2295 

GTGCTTTTTTCAGACGGCAACGGAGCCGTTATGGAC^^ 

TCAACCCCGCCGCTCCXCTK^TCT^^ 

CGCCAAAGCTTCGGGACTCy^^ 

SEQ ID 2296 

VLPSDGNGAVMI8jCIFaQAAKEIPAQTVYEIX3EM^ 

SEQ ID 2297 

TTGCXXXAGAGCAGCCGTTPAGCCTATCATTCCGAACAA^ 
ACXXXSTCCGAATCTTACGTCGCACAGCTCC^^ 

AAAGCOGCCCGTACAGAA1CC 
SEQ ID 2298 

kPESSRIAYHSEQTETNRKIGMSD 
KAARTES 

SEQ ID 2299 

TTGTTAATCXACTATATTGTCCGGCATGGCXIAGCCTO 
GCCGCATC^TCCGO^mCGCTCATG^^ 
TGGCAAACGCCTTGTCTTTCATCGT(^ 
ATCGACCAGCGCGGCAGGCA(^CGTTTGCTTTMGCAGCX5 

CCTGAACCGTCGGATCGMOGOGCCTT^^ 

SEQ ID 2300 

LLIHYIVRHGQPNFKPPAFGNRPyQRQIHHGKRRRQGIGNRW 
IDQRGRHSVCFKQRNNLVRDIALRD^^ 

SEQ ID 2301 

ATGACAAGACATTTGAGAGACATTCAAAAAATTATGAAAACCTtt 

ACGMBACCOSAG^^ 

C6GCAAGCACAACGACTTGGAAAACC?rCGGCTA 

GCATCGTCCGCATCGGCGACAACAAAGGCGCGAMTA^ 

CIX^GCGGCATTCCCGGCAGCCCCGMGA/^ " 

TCCGTCGATACCGGTATGGGCTTGGAACGTATGGCX^^ 

CGTTCAGTATGGAA6AAC(XAGCCTGAAAG1TATCG^ 

ttTCCGCCGOXXX^^ 

GCCXJU&TCGAAGAAGCTTOAAA^ 

TCTTCAAACTCTAO&TACCTACG^ 

CGCACGCGCCMCCCAAAGCTTC^ 

GGC(^GCAAGTCGTCGAATTGAACGAAGGCGACAGCGGTG^ 

ACC(XTrTGMGffAC(£GATAC(X^AA 

GACATTTCCCATCCCCMGCGCT^^ 
AAAAAACCGACGCCAT(^T6CTCTTC6G 
CGGCerCTTCAAAATCATCAGC^ 
ATT(XXX5AAACCAMGCCaJU^ 
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C^GGCGCCAMCTCTT(^ACGACG^ 
CAACGCCGTGATTCTTI^^ 
GTCGGCtmAAGGCGGOm^TC^ 
SEQ ID 2302 
WHfflljRDISttHKTSE^ 
EFLTSPEWIdttPKDKUiAOT 
SVEO^liEEmVMQHVHSNYEIDIJ 
AQIEBALKNEESRFAQTI^^ 
GEQVVELNEGDSGAVVIDFTPFYAESGGQVGEVGn 
DISHPQAVTAEEIAEVBRRVNEAIIiANVAVNAA^ 
IAEmOTEKDVLAKIQAGAAH^ 

SEQID2303 

ATCAAAATGCCCTACCGA!^^ 
AA606GTTTTTT6T6TAAGAACTGCTATAATA6CCG^ 
AGAACCGGAAATGGTCAAGAAACGCCCTGTCG^^ 
CACAAMTCAGTGMMCC^ 
GTTGCATCCCAATMTCAGTCAA6AGGATTTTCA 
SEQID2304 
MKMmS<^PMPHPM^ 
HKMSENPliPDKHUQCRAKO)^^ 

SEQID2305 

ATGAAGTACCTCCO?^^ 

GCCTGATTAAAGTCAAACCGTTCAAAOT 

ACXCGCTTACCTGMGCATTGGATGAAAAACGGCAGCCGCAGCT 

CMGGCmCGCCTGOX^CCATO 

GCGGAC6GAAAGGCCT6CACGATGTCGGCAAAGAC0SCC(KMCT 

CAATGCCMCCKmAATGTTTOC^ 

SEQID2306 I 

MKYVRLFFLCTAliAGT^^^ 
QGKRLmDEWEFAGlASA^ 
NLGFRCASR 

SEQ ID 2307 

TTGTGTCTTMTCCTGCCTlTrGTGTT^ 
AAAACAAACTCATTGTCTTTGTTTGT 
SEQ ID 2308 
LCLNPAFCVSGLSRYNHHP!jlMFVFFS 

SEQ ID 2309 

TTGCG1TTCGTTGGAAAATG0CGTCTGAG0GTTO 

AGACGCTC6AAGO?rT(K» 

CAATCGTGCCGCGCACATCCAGTTCTTTCAT^ 

CC<X»TTCGCG«3CTT^^ 

C6ACATC6CCTGOTTC6C6CCGCTGCGTACATA6GCG^ 

GACGGCX&TTTOTOX&CAGGCC^^ 

TCGACCACGACGTGTTGGCCGACTTTM 

GACCGCAGGGCGGGATGAAAATCGGGCCTTCCATAMTTO 

TTCGAT6CGGATCTCGCCTTTGTTGTAAAAA 

AAGCC 

SEQ ID 2310 

LRFVGKCRLS^DGICAVANVIC^WIH^ 
IQHFVHAGTPERHVHAFAVGQFFHFFNNIDFGRVEDWGNAG 
IXH)FPGQAAAAAAQADKVHVFGEVVVAAF 
FnADVAFVVKTCCFHCNATC 
SEQ ID 2311 

TTGGGTlTCGGATGraMGjAAGGGAAT 
CTGCTAGATTCAGACGGCATTCCTACTTGAAAACAW 

CCCGTAACGATCmCACGAGTTCTCCGOT 

aX5G0SAft^GCAMSACCACCT^^ 

CAAAATCCCGTT(&ATCM«^^ 

TTGCTGTTGGCCGCCGTGTTGAAAGCCAAAGG^ 

AAGTCGATGTTGTTGAAGAAGTGAAAAAACT^ 

CAACTTGGTTATCOTATOATCTG^ 

AAACTGGTCXSAAGAAGGCAAAATCAACCTOG^ 

TTATTGTGAATCCAAACCTC 
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S<™^mKYIJ{RFHRKRSGmPPPFQPre 
[IVH^INLEPFITCRIKIiDKLVSBGFERLIHHMESAVKIIVNPHIi 

GTAACGGTCGGATTGGG 

SEQID2316 

IjGIPAKVSRDVFKHAEGPLPASGQIMMPSENEIGFREHLYEHGRIQ 

ATCCOaTTCGCCCGACCtXXTCCTCTGCAK^GCA 

OWfflGCAWflMCIGMGC^ 
TO3QCSGTTCAGQCft6TCGATATGGTCATG^ 

CcromTTCJVA^^ 



TCCGACCGTTACGATATAAATGCCGTC 
SEQ ID 2318 



^ >HMMWT itnTn t^vniim o»i.iifte*iiwTMfT.ifii/wc!^aiT^YQ|(xliJPTKKPFAKPKTGQVI^PTy 



SDRVDIHAV 

TGCGATTGCCGCAAaAGTCGAT 

SEQ ID 2320 .,, 

HHQAVAQPAPLVLMWFmiHRPffi^ 



TATGAT 
SEQ ID 2322 



YD 

rreGACAC«^K^rGC«^ 
(XXJTATCGKCCCOGTTGATGCAGAGKX^^ 

CTBCKAAACACGTra 



U^TLTAL ^ 

CAA6ATATGTACGTAGCATTGCAACTCCT 

GGTAOTG^^TTGTCAACCTT 

GTACCGGTT 
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SEQ1D2326 

LPKHLIFWIVBUHTKTELQKLI^ 
VPV 

SEQ ID 2327 

TTCTGGAAACTGTATGCXXa^ 

AAATCOCnTTTCCCGAAAAGGGTATGG^ 

GGG(£AAATCAGGGGGTTTGATACAAACGG(^^ 

TCGGGCGGCTATGGGACGGTGATGGACAAAGCGGGCAGGCTT^^ 

TTGCCGATCCCTTGTCXaACGGCG^ 
SEQ ID 2328 

imYMJYFMRPDAETPPENSVKETIiKRVGrc 

S(X^GTVHDKAGRLTHLFDFRTGVSTF^ 
SEQ ID 2329 

TOSAACCCCACCCTTT^^ 

GCECAAGCCTT(X»GTTGGAATTCAAAA 

TTTTGTACCCTGTCCGA^ 
SEQ ID 2330 

LEPHPFGGVG1XLFQKRQNPSLPGKTBRQHPEYAALRVOT 
SEQ ID 2331 

ATGATGMCCGTTATCCTITATGGAAATO 

GACAAGCaTCATCATCAACGMCAGACTCM 

CACAGAMC(K^AGOTAAAGCGCGCGACGTTATCGAAAACAC^^ 

ATGTTTTTGGGTTTCGACCTGCGCGGC^^ 

AAAAAATCCmtmCACGGTO^^ 

AAATTCAGAAGGCAGCAATATCGTCTTGA^ 

GCAGAGCCG&TTAICCAACAGTCAC^^ . 

TGGAGGACGATCCTGCCAA(?ITOXjCGAC^ 

GACCGGCGACAACATCAACGATGCGCAACCGAGT^^ 

GGCAMCGCATGGCGATGGTTTTGATCGACCMGGAAAATC 

ARGCCAAC<&TACCTCTrrQCTOT 

(foCTOATGGGGTTTTGCCATCGTTG^ 

ATGCAGGCMCGTTGACOTACCG^ 

CGCAGCAGGCMTCMTCFCGGTTTCCAA^ 

GTATGGMACCTGAAGCCGAAA^ TTTCaTCQGTGOG 

SEQ ID 2332 

MMOTYPLWKYLLIVFTlAVAAVYSLPNIiFGE^ 

MFLGLDLRGGVHFTMQV^^ 

AEPII(X}S6IiDEIVVQLFGVQDTAKAKDIIGBTATI^ 

GKRMAMVIJDQGKSEmA 

MQATLTLPGMAAU&m^ 

VWKPEAETAAGKE 
SEQ ID 2333 

ATGGAACTCTTTAAAATCAAACCX^ 

TCTCTGTCGAATTTA(XXKmrPAC« 

GGGTACGMCAAACACAT(^TGATCCGCCTGCCGAA 

GTCGAATTTATCX5G(XCGCAAGTCG^ 

TATCTGCCATTATCGCCAATATGQCGACATCGTGATT^ 

GAACGAATCCGTCGKXyrcTTCG^ 

ATTACCCACGGTTCGACCGAGGCGATGGTTGTTTC^ 

TGGTTGCCAGCCCGCTCTTGCT^^ 
SEQ ID 2334 

MELFKIKRDIPFMSYQO/rm 

VEFIGPQVGmVNNGUOIflFV^^ 

ITHGSTEAMWSMLVFG(ytftffl^ 
SEQ ID 2335 

TTG(XX>TTGTCGAGGC1TCGCAAACTTGTC^^ 

AACGTAAAGAAGGCGACACOGGCTCTTCTGAAGT^^ 

GTTGAAAATGGTCAGO^^ 
SEQ ID 2336 

LPLSRimVRIPEGVLPIPRSFSIMALTVE^ 
SEQ ID 2337 

ATGGCTGGTGGAGTTTAmAGGCGGAAAAATTATTAGCCCTATAT^ 

MTTTTTAAOTTAAAACGAGTTTO 

MCTTGGCAAGTTGCTATAGAAAAAGCAGCAAATTI^ 
SEQ ID 2333 

MAGGVYI^KIISPIYHSSQEFSOT 
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SEQID2339 

ATGGGCAQGCATTTCGGGCQCAGACGTTTTCTG^ 
AAAAAOGCXIATCAMATGTGTm 
AGTCQCGCGTTTGGAMAAATGTT^ 
GCX3AEATTCACGCGC 
SEQID2340 
KGBHFGRKRFLTAAAYA^ 

Aim 
SEQ ID 2341 

ATGGGTTTGAGCTTGTC(KjAAAGCAA^ 
GttrTGGGTftOG<X^^ 
GGAAAGTCAACAG(K&AACT^ 
AATAAAAAGMCCCCAAATCX^TTCGGACA^ 

SEQ ID 2342 

HGIiSIiSESNRKENT^ 
NKKNPKWDSDKFAKPRQRQTPYS 

SEQ ID 2343 

SEQ ID 2344 

SEQ ID 2346 

ATGCGGGGTTCTOAfiAACOXTTm 
AAAATCCGTTTTGCCGCTAOT 

CAGGGAAAAATCATOTT6ATGGTCAA6ATATTACCAAACTTC 

OTTCOSTCTGAAACAGGACAAAAT^ 

TCAGCAGCAACGTATCGCCTTGGCGCGCAGTTTO 

MTACGCTGGMCAAGTCGGCGTMCCItX^TTATGG 

CCAGCGACGTGTAOGACTATCOCMCAGCCGC^^ 

TTOGAAMCaCGTCCGTATCGACCATOTTO^ 

TACAACTGGGCGAAAGGCACGGTAAAAGAAATTGCCTATT^ 

TGCffTMCATCAOXm^ 

SEQ ID 2346 

MRGSLEPLFISCCPCASQEFQTASDVATP^ 

QGm^DITXIAPYDRPIN^ 

NTLEQVGVTCIMVTHDQEEAMTC^ 

VIWAKCTVKEIAYLGSFAHHIK^^ 

SEQ ID 2347 

ATGAACOTAATAAAtt^ 
AACCGACTCCGTTGTTCCGT 
SEQ ID 2346 

SEQ ID 2349 

ATGAACCTTAATAMCTGAAAAACAAACTGTTCCGCC^^ 

AACCGACTCCGTTGTTCCGT 

SEQ ID 2350 

MHLHKLKNKIiFREFGQ^ 

SEQ ID 2351 

ATGAACCrrMTAAACTGAAAAACAAACTCTO 
TCAQCTOGCCGAACMGAAATCGCXATTO 

CCGACCGCCTATCOGATTTCCCG^ 

GGCATAACGGCATCATAAACAACTTAT^^ 

GATTCTGCCGOTTfcCACtK^^ 

GGCAmTTGCAGCH'TCGATGCTGGTT™ 

ATCAAAACAACTGGCCGTOGCI^^ 

SEQ ID 2352 

Mm^IiKNKimP^RAmVPyiWUiVIM 
PTAYAISRANPSVRNGLIiLAIMLPFWTSFIjIA^ 

GI I AG SMLVFVPAVGEFVI PELVGGSENIJCGKVLWQAFPDQNNWPLAS AVAVVM^ 

SEQ ID 2353 

ATGCAGAMTCCAMTCATCTTC^ 
TTTOGGGCGGCTTTTC^^ 

CTGATTATTCAGGTGCAGATATOITTG^^ 

ATCACTTTGCCTTT(^^ 
ATTGCCGCAGGTTATTTTC^^ 
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CKXMCCTCAGATGMT^^ 
SEQ ID 2354 

LIIQ\^IFLQGSEWLQHLYFDRGFFTI 
LDPQMNVLATILIGIIGTLVIIVNYW^ 
SEQID2355 
ATGAAGJ^ 

CACTGTTTTACCGTAAAATCCGCACSOT 
A0GCGGT(X£G(XXXX3GCA^^ 
AAAATCATGOCGGACGGTATTGTTTATK^ 
ACATTAACGGCATTG(mCTTTTC^ 

SEQ ID 2356 

HKITOCKLKKEVQKEPLRSFVPEVTA^ 

KIMPDGmAIEPGSRGKLDAGGFTRCR^ 
SEQ ID 2357 

ATCCGCCACGTOCTCGACAACGOT^ 

MAAGCT(^TTTGCGCTCGATMTCCACGAG 

CGAOTGGGCCCGGm^ 

ATTGTGTCGGCGTGGMTCCGGCTTTGGTCC^^ 

ATATTTTCCTTG(ntn^ 

CCGTAACCATTftmCAGGC^ 

GAAGOTACGATCCGCATCCGCACATC^^ 
SEQ ID 2358 

MRHVLDNGTDKSDRTGTGraSWGY^RFDI£^ 

IVSAWNPALVDQIALPPCHALFQFYVADGKLSCQLYQRSADIFI^ 

BGYDPHPHIKAW5V 

SEQ ID 2359 

ATQACTGQGTCAGCCTCHTFTTTTTC^ 

TTCAA6MTATCTGCCTTCTTATTATTTO 

TACCGCATTGCCGCTG«X3GAACAC^ 

GGG6AAATO6AAAAACAGGTCGSCTTG6A« 

GCGGTTATQCCACGGTTGCCGTCXXSTCX^^ 

ACAGCAGCTTGACAGCGATATGTACCAAGGCG^ 

TTCGAGCAATOECGATCACGCGCATCGAA 

ACCCGATATTCCG<mTOGM(X^^ 

AAAGTTTTTCCGCAGCTTGCCGATGTTAGM 
GTTA3TCCGGGCACGGGATGGCGATAACGGGCATTGC^^ 
GCAACCGTTTTTGttX^AACTCGGTTCTO 
SEQ ID 2360 
MTGSAOTFSTDFLFGKyGPTVCIIPS 



FEQSPMTRIEPYQNGWLmPEGSVECKNVravmVGL^ 
KVFPQLADVRIByS^BCDITANLVPHFGWiTa 
SEQ ID 2361 
ATGCGATATTK£ATATTCAAACAAGG(&TTT^^ 
TCCXSAAAGACGGGCCCAACCATTAAGGAAACTT^ 
CGCCGTCMTTCGGG(X^TACCGCCTGGGT^ 
ACGATGATGCACAGOTTOC^ 
GCGGGATGCAGATAGACGCTGCCGCACGGATGCTGACCX^ 
TACCEGCGCGTTTGCCGMCGMTGAAATAI^^ 
GGCGGCGTATTGGATTATGCCGGCGCTACGGTCGT^ 
ACAATATGGCGAT(^CGCTGATTGGCGCGGCAATGTTGTO 
GTCGGCCGTATTCXSGCGCGGCAGGCTGGCTTGCCI^^ 
GGCTOAOXXmXXHCGGGC^^ 
TACAC(XMTCGGCGGGCTGGTAGGCGGAAT^ 

GAACACGTGGAA 'KnECGCaXXZK^^ 

SEQ ID 2362 

MRYFHIQTRDLFHAQOT'SA^ 

TMMHSFSIATLVGILWAVGYSLAF^^ 

GGVLDYAGGTVVHINMIAGLVAALVIiGRRIGYGREAMPPHH^ 

(ffTGPSGAAAIGILTAAACFVSVTVVKHm 

EHVE 
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«Km«3GCGATMGTCAAAAGATWTTGA 
SEQID2364 

. MOT 

SEQ ID 2366 

y^UiHlIPTGMENRCGSLIQGCAEPRPFLTKFYFIiFPIEHP 

ACCftCCCGGMGTATGCCGCCATi 



SEQ ID 2368 



QSRMDDmEEFWSTGNMELH^ 

SEQ ID 2370 I 

LLAISALFHSKSRRENKIIIilGlKKSSIiTT 

SEQ ID 2372 

W»C»YSPFPEWirawn3IUnCtlA , reA0KVlAQ 

SEQ ID 2374 

VGIQDAGETVIiSDKFSHEKVtflPALYDALYAjaiTGF 



ATTATGATGT00GM3CT 
TO3AAAT(M&TT(XX3TTCGTCC<^^ 
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ENPMLGFRGAARYVAESFKIKFA^ 
SFDERNPAVKVMLHLAI SACRKQNRYVGICGQGPSDHTO^ 
SEQ ID 2377 
TOTCGGCCATTTGTGTGTCCAA^^ 
(HCIGAAAAATAGCGGGT^^ 

SEQID2378 

LSAICVSNPKYVKKETNPLAYFKRFTO 
SEQID2379 

ATGAAGATAACACACTGCAAAmAM 
CACTtnTTTACCCTAAAATCC^ 
ACGCGGTCGCGGCGC^ 
AAAATCATGCCGGACOTATTGT?T 
ACATTAACGGCA3TOX3MCTOT 
CACACCGTCCCGGCAGCTAAAAATCC^^ 
SEQ ID 2380 

MKITHCKLKKEVQKEPLRSFVPEVTARSAADILGIHPK 
KIMPIXSIVyaDSP^RGKSDAGGmCRlN^ 

SEQ ID 2381 

GTGGTCX^GAAGGTCAATCGGAGCCGGCAGGAAMCGGTCAGC^^ 
rcTTTGAGACTTTTTTGGGACTGTTGG^ 

GCTATCCGTCXXmAACTATCCGCm 
AGGCCATCCX3TCAGGAACGCCGCCXXMTTCGGOT 

SEQ BD 2362 

VVHCWRSRQE^RP^ 
GIRAANYPUPIELESADLPRM^^ 
SEQ ID 2383 

ATGGCGTTTGAACC&CAGATTGCCAAA(^^ 

TTAAATACAGAATAAAAATATCATCA i 

SEQ ID 2384 I 

MAFERQIAKLVQQSLADM 

SEQ ID 2385 

ATGCCGTCTGAAACOTTCGTCCC<X^ 
A(aCCCAACTCGCCC<mC^ 

CTATACCGCATGGCGCACX^GMTATCTGGAGGAATACGACAGCCGCT 
G6CATCA3t!ACCAACAAACCCATGCGCT^ 
AACCCATGCTGCACGCCT^^ 
ATGGGGCTATATTTCCGATGAAGACGATACXXsGCTtt 

SEQ ID 2386 

MPSETFRPAIiTOGKmaQAVLPDIiDGTI^ 
GIITNKPMRFTDKIjVPKLGFAVPPATWSGDTC^ 

SEQ ID 2387 

ATGAACCCCGATATTTATGCTTTGCCG^ 
ACGCGCCOXXXXXK&C^^ 
GATTACGCATATCGAGTTCAACGCCGTCAATCTG^^ 
TTCCGOTGGTGCGCGACAGATTGC<^^ 
TGGTACXXXXKGTGTTGGAGGCGCGCGC^ 
GGGACACGTCGCCAIKXjGCAACCGGTGOT 
GTGAACATCGAAGCGCGGGAAAMGCGGGOTCA^^ 
SEQ ID 2388 

MNPDIYALPHIAIjLSGDPDEK^ 

FRLVRDRLRAFGFDYGSF^ 

VyiEAREKAGFSRFKfiDMTtKDFEQGLKQNKHAV 

SEQ ID 2389 

ATGTTTATTTTGTTTCAAACOTXXJT^ 

CGGGCMTC^CTGCGGMCAGGGCGACAG^^ 

CGOECAGGTTGCCG^^ 

GGTTTTGTACGCCATATC(X!A^^ 

ACCCAGTCGCGGACAAACKSAAAGt^ 

CXXJTOTCAlTniG^^ 

TA5PTA 
SEQ ID 2390 

MTCLFQTLIiEIFQHIQFEAAEARFFPRF^ 

GPVRMPQMWRLCTAVIEABSAQSVA^ 

G(3®IJRQPSFnGVARKQGAFRQSIinGVHIjKSAVFGKRIL 
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SEQID2392 



TO3TCQGTACTGACC6ACGAROCCTAK ICCMC STTCGCCCGAATACCTCAftAI AGGCGC "7^Z^W1 
MCAGGCGCQCGCATGGGGGGCGGATGCCGTCCTGCT^ 



TTGGCXXMGMGTGGGCAAACTC1TC 
LBGKTVVTESGITGKADVKFMRARGVHTM 

SEQ ID 2396 

MPSIJIflKLLTLFFPYFEYMiFIADNQPIIIN 

GTTTTATCCGA 

SEQ ID 2398 

IjEKQKUiRRYSREGRNI^SVQCPISFHSRESGHPGCRGNRPIR 

CXOCQ 
SEQ ID 2404 

LALGAARFAIiBG^^ 
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MTTCCGGOTOSGCGTGGMTAAG^ 
GGCAACAAOTTTTOTATTOK^ 
MTGCGAAAATGGAATTTMTAGGGATATAI^OTATC^ 
AMGAAGGGCATATCATTTAAGGTTTATGCTGTTAAAGT^^ 
CAGGGGGAATATT 
SEQID2408 
HHLSIWPnWESnriiB^ 
FLNAVKYIKGRVSYFGEDYQPEKII^SKVEV^ 
KAKMEFNRDISmYSFLSYBYLVLIJm 

SEQID2409 

ATGAACGCAATC^AOTT(XAAAAC^ 
OXXXGTGTTGtECGG^ 

ATGATCGGCGC^GGCAGCCTCGTTCC^^ 
ATTCCGCCGCACATTATGTGAAGCTGTCGAAA^ 

SEQID2410 

MNAIRTFQNRTPEIHETCMIDEACTO^ 

MGAGSLVPPftKRLBGGYLYIGSTOQW 
SEQID2411 

GTGATOCACXTCCCGTACTGTTTCGACA 

TAGCCGCCCTCCAMCGTTTGCGCGGCGGMCGAGGCTO 

TOGGCAGCCGTGCAGCATCACTTTGTGOT 

ATTGCTGCGCGCGCCGACGGTGATGCTGTK^^ 

CAGGTTTCX5TGGATTTCGGGCGTGCGGTT^ 
SEQJD2412 

VISHPVLFRQLHB^IFQX^ . 

SEGID2413 

ATGACTGACCTGAACACOTXjTTTGCXAAOT 

AATACACCCAGCAAAGCCTGCTCGAACGCATCG^ 

GAGTTCCacaTCGG(X:CTTACAAAGGC(^ 

GGCGGCGGCAAAGGCGGTTCCGACTTCGAC^^ 

(^GQXXXX^CATCGGCGTAGGCGGACGCGAAATCGGCTA 

CATCC^CCXX^GGCKSACC^ 

TACGCCGCTC&AAMGCCAT^ 

TGAAAGAAGTCOGCCGCGAACGCGTTGCCACCTACGCCAA^ 

CGMTTGGACGAAGAAGCCGCCAAAACCCTGTTO 

TACGCTCCGGGCAAAGCCTOCAACGCCGGCGG^ 

TGCAAAGCATCCACGAGTCCTGCCTGAAATACGGCAA^ 

C 

SEQID2414 

lffl)IJmiPMMQRNPNQEPFHQAVE^ 
GGGKCXSSDFDPKGKSDAEVMRFCQAFm 
YAAEKAIQI^VLWSDSDGF^ 
YAPGKASNAGGVATSGLEMSQNAIRLSV^^ 

SEGID2415 

ATGCCGTCTGAGGGCPTC&GCATITCAGA 
CACTAATTGACAAGCAAAAA 
SEOID2416 
HPSBGLSISDGIFIXjKRREGFSbPINNRK^ 

SEQID2417 . 

AAACCGGCAMTTCGCCACCTOCGACTO 
<^CACCAAGACTTCT0(X5A^ 
CGCCPGCXmTACGCCGATlOT^ 
GGCCGATOGTATGGGCAAAAACTTCAAGGGCGTGTA^ 
CAATCCCGAATTGGAACAAOX^^^ 
CCAGTGTTCTTCGGC7CTGCGATTAACAA 
AfiCCGAAATTTTCCGGATTTATCTTTAAAA 
OTGCGTATCAACCGCEAAAT^ 
CAAATCGGCGACAGOTCTCCX^GGCGAACAACTGGCGTO 
MGGTTTGCMCAACTCGGCGAAGAAGGTGCGG 
CAACGMTACGGCGTGGAAGCXXn^GTTCGACAGOG^ 
ATCGAC(^(X3CGGCAACCTO^^ 
SEQID2418 
VRmTFMISHPDAGKmTEKLUiFSGM 
RIiROTPIVTFKNKYDREVRDSLEIiLDBVEDI^ 
PVFFGSAINNFGIQEILNSLIDTO 



WO 02/079243 



-192- 



PCT/TO02/02069 




QIGDSFSHCEQIAFTGIPFFAPEI^ 
imcm^YLAmVNIiGLTQ^Dr/FHETEKSVKIi 

SEQID2419 

ATGATGGCAAACGTGCG<XX3GCGGCGCAC1TO 
GTACAAAAAMT 

ATCACAAT 

TTCGCCCTTCATCCAT 
SEQID2420 

FALHPIJLARTAVIVGKAGFDGFLQGPGVHPRHH 
SEQ ID 2421 

■ITCOTKXXKXATCGCCm 
(ECAAAAACAATCGATGAAGGGCG^^ 

SEQ ID 2422 

LVCAIJ^DAETKRVLMV^ 

SEQ ID 2423 

ATGGCGCAAACCAAACOTTTTCGTCAAAT^^ 
TAGTGCGTTTGGCTTCXXX^^ 
GGAQGCGATGACGGGGATGTCGACGGCSTO 
TGCATTTCGACCGCCC^^ 

TACGCCSXGCCGACGGTCAGCGGGATGAAGACTTGT^^ 
CCTTOGOOGTTG 

SEQ ID 2424 , 

MAQTKPPSSTCjTASSRFLSIFPFRRHSI^^ 

CISTAHSTASKPVGLRPPWVKISQR^ 

PSPL 

SEQID2425 

ATGGCACTGGCAAMCGCATCATCCXX^TGTCTCGAOT 

ACG(X£AAGGCGCGGACGAATfl^ 

CGGGSCXX^ ACCX&CCOT 

ATGCCAAAGCCGTCAATCCCGAAAACA^^ 




(^TCGCCTCCGGCGGGGTCGGCAATGTCCGGCACC^ 
AAACGCACTATGCG(XiAAG0C6GCATCGAAGTG0GCX^ 

SEQ ID 2426 

MRLAmiPCLDVKDGRVVK(W 

GSQAIVAAVDAKAVNPENTE^IFTHGGRNPTGI^ 

KRTMREAGIEVHL 

SEQ ID 2427 

TTGmAMTCGCAGCCATATOCAAAAAAG^ 

CCGGAAAAAATACCAAAATCTOTPAAGCGGCGACC^ 

OTGGACGAG(^GTCAMGACGCXTT 

GGCAAAGTCAGACGCTTTGOCCGCGACC^^ 

CCCAAAACAOGCGTTmATTTCGT^^ 

ATTCGCCACGCMTTTCACACCGAAAAAAK 

SEQ ID 2428 

LLKSQPYCKKEEEAMQTA^ 
GKVHHFARDLRDPQGCRLKVPHMGWNTVR^ 
SEQID2429 

AJTCACGCACCKACC^ 

GGCCGGGATTGAAGCGGCG(X^AGTCMCAGCGG<^ 

ACCGACCGCTTTTTACGTTTTCTCG^ 

TTGCCX&MOTATCO^^ 

TGACCGCACGAATATTTATTAIK^GGT^ 

AAGGTTQAAGATGCGGCGCAGTTTTTGCGTGAAMCGGAl^ 

mTCGTGGT6GC6ACCGTGGC6¥T^^ 

CCGGGACGGGCTGCCTGCCGTGAGTTCGCTGOT 

CTCGATGCCATGCTTira 

GGTTTGAOGGCAOGGTGTTGGTGCAAAAAI^ 

CMCCGGCACGAGCAACTGTCCACATTCGQ 

CAG(XATT(£AACTGACCG^ 

CCGAACGCGMGAACGCCIX?TCG^ 

AATGCCGCAAAACCTCAACG^^ 

GTCATOCGTCCGCAAACCXjAACGCGMCAA^ 



WO 02/079243 



-193- 



PCT/TO02/02069 



CGGCAITCGATGAMCCGCCAAACACAAACGCTGCC^ 

CGCGCCATTGCCGCCAAAC^ 

TC 

SEQ rD 2430 

MTHRPTARQIUffiVPGYPEFRGRQEDVINTL^ 
TDRFLRFLDQQTVSLFAIDEAHCVSRWGHDFR^ 
KVEDAAQFLRITOiNAIPmGLSI^ 
LDAMLSVCETMCRRVLLLKHFG 
QALQLTEAAKKmGH?TEVMLRP^ 
VIRPQTEREQQIjRQKLEAWRY^ 
RAIAAKQPEGLAEIjAAVYGVGEEKAARYGAAVIAVLEK]^ 
SEQ ID 2431 

TTGOCGGAGOTA(^G(mT(X£ 
OTCGTCCAATTCGTTCTCGGTCGC 
GCGGACTTCTTTCAATTCGATCAA 
TTTTCAGCGGCGTATTra 
CCPCGCm^mGGCTCXX^CCC^ 
GATCH^CCCXXEGGAACATCGGTGTCCGCGC^ 
CCTTTGCCOXGCCCATAG^ 
(XsATGGCGGMCTCATTTGCACGGGGTAi 
TTGCTGGGTGTAT1TCGGAT 
TTCflGGTCAGTCATT^ 

SEQ ID 2432 

VDALHnAEQSAWFARPRQAI^^ 
ADFFQFDQGGELRFGHTBVGEED^ 
DVAGRNIGWADVAVEF(HKGIAEAHYFGIGFAF(^ 
LWWRIPCQKRIRAHEDFFNRIJ^^ 

SEQ ID 2433 

TTGAACOTATGCCGTCTGAAGACAGATTCCCGT^^ 
GCCATTATCGATTACGGTATGGGCMCCTGCATTC^ 
GCGCCGACAAAGTCAl^Ti"i<XX^GGGCAGGGCGCGATO 

CTGAAAGTCCCGCATATGGGCTGGMCACCGTGC^^ 
ATCCCGAMCCATATTGGGCGAMGCGACTACCCGTCCC^ 
GTOAAAAACrm^AAACTGGTAAGCCGGA^ 
TGGAAGAGGCGACCGTCTTTTCCGATT^^ 

cxrrcxxxKmoxmGACATCCTCGcra 

ATTATCGGCACGGCGGCAGTCAAAAACOCCGACTTG^ 
CAACCGTMCXGGGCATCATGTAATTGATTO 

GGCCGCGCGATTTAK3AGG(£AGCATCG^ 
SEQ ID 2434 

LNPMPSEDRFPFSDGIFISPAHLFVKIM^ 

LRRCATFIRPQRRRKHRMiGLVRRQSQTK 

VERLFKLVSRTRPRTRWCTMIiLIPAID 

IIGTMVKNPDLVmCKAFPGRIIVGLIMI^MAAIDGWATTO 

GRAIYEGSIDFAQAQQIiADSLD 

SEQ ID 2435 

TTGmOTATATTCAACCTGCtXmXX^ 
AAGGATATATTATGGCAAAAGTACTCGTCGTACCOT^ 

ATCTATCTCGCAGGCAAAAACGTCGAACTGGCATTGTCC^^ 

AGTTCTTOGtEGCGGOGCC^^ 

GAAAGACGTATCCGTATTGGCAGGCCGCGAAG^ 

ACTCTTTGGAAATCATCGATCCCGOTCA^ 

GGTACTCGGTACGATGATGATGGCGCAAAATGATGTO 

AGOTO3TAT0CAGCGTATTCOT 

ATI€CGCAAMGCCTTCGGCATCGACCCGAA^ 

AC(XXra^CCTCGCCATCGAC^ 

TGGAAGACATCGTGTTCACCATCGCOT 

SEQ ID 2436 

LFAIFKLRAPSQTRHARHAGSRFCPYAAPSNffl 
IYLAGKKVELALSLDAAAWAVRSDNTDAD^^ 
EPRTVQAAAICH2KGIARCVLLAKREEVEAVAKEM 
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SLVSSVFmiPNQVLVFGIX^VNPNPTAQQIADm 

SEQ ID 2437 

ATCACCGCCGCCCTGCACAOXXS^^ 

TTTGGGTTACCTCGTACAGGAAGGCGTCCTn^ 
AJGTTGGMTTGftCCGGCATTTCCGAACTT^^ 

CCEGCAATACGCCGMXX^ 
ATCGTGTTCCCCGOCGCGC^^ 
TOGCCTTO^mTTC^^ 
OTCOTTCCGCCOCCACCCT^^ 

SEQ ID 2438 

ffiAALHIGHIiSKSFQNTFViaflHSI^ 
MLELTGISELAOTYPHHiSGGQQ^^ 

iwPMuaarrsmLGRLPVQS^ 

SEQ ID 2430 

ATGTCTCCTAAAAAAATAC(£ATTTGGC^^ 
TC(^CTGTTGTTCC6CCCGCGTATGT(^^ 

GTGATGATTATGAGCCTGTCCTCGTT^^ 

CCTTTOTTCCGO^^ 

CACTGCXATTTTCCAAGMTAOG^^ 

GCCAAGATYTACCACAGC(^CAAAGGCGTTGCC^ 

TCCCCTTTCOX^ATTC^TA^ 

TATATTAIOTGCCTTGCCOT 

TTGGTOTATTTCAGCATCAACTACACC^^ 

AACTCCXXyU^GGGATGGAACAGGTCGGCGCAACATTG^ 

OTCAAGCTGATGAAAGAGTTGACCG^ 

GCGCTGATOTTC^TTATTT^^ li 

SEQ ID 2440 

MSPKKIPIWWKftlLIiIALPLT^ 

VfflMSISSFPUYLPVEAALKRISI^^ 

AKIYHSGKGVARPYPVKTIiKLPGQIGAIVPI£ 

LVYFSINYTPAVYQTFIWILAYPMLYLPl^^ 

ALffljVLFSGIPVFLLKKYAFK 

SEQ ID 2441 
ATCAAAACATCTATOX^ACGCACTG^ 
CXrmACC056(X?EACCG^ 
CXmCACTCGCCACCOT^^ 
GGAOGTTOGOGCGTOGTCGTTTAC^ 
GC36C61TCTTGGAACA6AIITCT0GCCATC 
AGCOnTGAAAACGGCGAAATCGAT^^ 
CCCGGCGCACTCGTTACCTATTCCGGCG^ 
TCCtHXXICGMTATCCTTTGAAT^^ 
GCTGCTTGAGCAAGCOGGTATGAAA 

SEQID2442 

MKTSmALUtfUO/FAA^ 
GRSKVVVYDTRKLSEKDIiEKSVLNyATPKWKNRIGy 
. PGALWSGAAVLKSSQMBAKKFVA^ 

SEQ ID 2443 

GTGTA1'1U.Y1WTO U £ATAAATGCAMTAATTATO 

SEQ ID 2444 
VYSFFFnmNNYFIJniLLSDPVIVCSDIiYyFRASRP 

SEQ ID 2445 

TTGTCG6MTGCTGGAGAACCTCTCCOT 

GTTCXKSCAGCGAGGCGGOTTIGTro 

CGCCCGAI^TCAGCATTO^^ 

TCCSQCTOCQCOCGCCAAATC^ 

0GOTTGGGTATC(X3TATC^ 

SEQ ID 2446 

LSECWRTSPFKTPSBQGOTSIAIPADTVCRIMNIV^ 
SASARQIKTSSCSRPSffliSSIiPStrSKSI^T^ 

SEQ ID 2447 

ATGAOCGATACGGATACCCAA(XX^^ 

C^GCGCAATGCTGATCCATCG^ 
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SEQID2448 

SEQID2449 

AT(^0GACW^ACCTGGTCC6^ 

(XEACGCGCAGATGCTGAOCCGGT^^ 

TTTGGAAGATCTCCACATGAAC^^ 

CTGCGCGACCAGATTACCOT^ 

AGO^GTCAGCTTCGGACACC^ 

OaJEAOtamAlXm^ 

60CTC6CTGGT1IATGGTTCACTTGAG 

GGTTTTGCCACCGCCACCGACTTG& 

GO3AATT(XKX3CTa^GC0CT6C^^ 

ACAjGGTAOXXnXX^^ 

SEQ ID 2450 

JOJDRTWSGRFNEPVSELVKQYTA^^ 
LRIXJITnQNLIQNLQTALIjDLAEQNAEAV^ 
ASLWVHLSHLSEELI^^ 
GFATATOLADYLVKKG^ 
SEQ ID 2451 



(mMAGACAAAlTCTOGAACA^ 

ATCGGCGACGAACOOTCGCC^^ 

CC(nCGAACCGTCGCAAACOKXm 

CGTTGTCGGACOTACATCC!^ 

(HCTTGC^XX^CCCCTTT^ 

TGGAAAAATACCGCACCGAA 

SEQ ID 2452 i 

MKPIKKAVFPTAGMGTRFIiPAITK^^ 

IGDEPFAVIIJVDDIJDAPKGAIi^ 

VLAHPPE(nraX^SKLGYl^TVAYGli^ 

SEQ ID 2453 

AAAATTCCTGTTTTTATCAAAIA^OT 

A'^GTgmTTTOTC^ 

TCCGGTACAGCTCCXKXGCTITITrGGC^ 

(XACGGGTCCAGCCCGTCTTCCT^^ 

ATGTCCTCCGCGCCXSAA^ 

GCTTGG6ATA0GTTTCCGACAG6TTTTGOT 

A(XKX3CGCCCTTTCCGGCGGTACA^ 

TCGGTTATGCGCGTATTGCTGAAATCE^^ 

ACXGCGAAGACAGGCAGGGAAACGCCTT^ 

OTCGCCG 



CACAGCGGCGCGCTATTC^^ 

SEQ ID 2454 

nPWIKYFKQFFLTOTRPmQVQYRF^^ 



TRALSGGTDLAWLDFAFQVQAVQA^ 
AFaSFFEOiSLFPGYIJPSQLTVHmVAPKRG 

SEQ ID 2455 

TTGGGAGOTATGAATACCGTTCAGACGGTCT(K^ 

CGCOGCCTTGTCCTGATTTAAATTT^ 

GGCGTTTGCT 

SEQ ID 2456 

I/K31NWQTVWHIRMYRYSELNLNQYGV^ 

SEQ ID 2457 

ATGGAATTCGTAOTATCCGCCAC^^ 
AACTGAAAGAAAACGGCTATCA(H!TCGACAT^^ 
ATGGAGGCTGAACGAACGCCACTACGGCCGAC^^ 
CTOTGGAWAAGACGAQjCGTTTT^^ 
GGGAA(^CCAAATCGCCCCCGC^ 
CTOGAMTCXX£ACCG(X^ 
SEQ ID 2458 

MELWDUiGQSEWNAKNLFTGWRDVKLSEQGLA^ 
IiLDKDDAFSAHKDRRYAHL 
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SEQ ID 2459 

AT6TCTGAAAAACCGGAAAMATCGTTOGGCAAG0GG 

CCGAATGCCCCGAACOTATCCCAC^^ 

AftACGGCGCGCCGGGCATCCATT^ 

GTCTGCGTATTGGTOTTGTCEGT^ 

ATOTTATCTGCCCGAACACG(^ 
SEQ ID 2460 

MSEKPEKIVLASGNAGKLKEFGNIiPKPyGITO 

VCVLVFVRHKDDPmiAEGIWHGQMJTPLGQNGFGro 
SEQ ID 2461 

TCCTTTTCGCATOGGCGCGCTtn^ 
SEQ ID 2462 
MNRHOYLLSVALF^ 

SEQ ID 2463 

TTGTGCTATACTTTCAM 

CCGCCGGCAGCMCCACAAAATCGAGTGGAACCGC^^ 
CGAAGACGGCGATGAATTGGACGTGTTGCT^^ 
AAAATCGTCTGCmCCTC^ 
AAAAAGCGGGTACX&CXIAAAGTCG^^ 

SEQ ID 2464 

• LCYTFKLQTFPNKG^ 

KIVtroABDMilKMKTLSD^ 
SEQ ID 2465 

ATGCCGATGGCMMCCGCTCAAATACCCXXnTTC 
GCAGCCTCGAACXXKKM2GAAACCGTCGCCCA 
OTACCACCACTGGCGGCACOGCTACCCCAAAGGCG^ 
GGCTOGTTCGGC™ 
SEQ ID 2466 

HPMAKPLKYPVSALVVLYSGDGGIIiL 
GWFGLEEAAEKVFSPSNERAILRLGRFH3CR 

SEQ ID 2467 

ATGOTira^TGCCGTCTGAAAGGGAAAC^^ 
AAGttSaAACttGCCGTAGGAGACGTCTTCCTCG 
GC(^GTG(HX3GTAGATTTCGTAAACCGTGCTGTO 
CGGTTCGAGGCTGCCC&TTA 
TTTGCCATCGGttTCTTTCGGCGGGC^^ 
GTTTKXJGGTAAATTTTCCTTTGTGTCAAAAAGA 
SEQ ID 2468 

MRPMPSERETGRVLPFAQKPAQFQNRAPVGRGKHFFRRIT 7 ^ 
RFEAAGYRLPKSFRMRAFDEQDAAVPAIKD^^ 
SEQ ID 2469 

TTGAGCGGTTTTGra^ 

SEQ ID 2470 
SEQ ID 2471 

ACTATGATGTAGAGTACGCGG^ 

SEQ ID 2472 

VWRSI>VAHIiH<WQGVEGSNP^ 
SEQ ID 2473 

ATGTCCTTTGCTTOCATTCCG^ 

GCGTGGTCAAAGGCCGCGCGCTGCCTGMGTTTCA^ 

GCGCGTGCTPCGGCGAGATTTOGGTAAATACCAT^^ 

AACTKX3GTTCGCGCGACGGCGACGGGGCGGC^^ 

ACTACGACGGCGCCTimACGAGCCGCTGC^^ 

CGAAGTCACGCAGGCGGCGATTCCACTGTTGMGAAACC^ 

GAATTQCGCCCTATTTACGAAACCtmAGO 

ACTCCGCCAAAATCCTTGCCGAMTCGM 

CGTGCGCGACGAGTCCXSACGGCXjAACATOT 

GrcTCCATGAACITCGTGATGATGG 

GTTTAAACCAAGTGGAAAAACG^^ 

GCTGTTCGGGCTGACCGAAATCCMGCC^ 

CGTCTGAATATCTTTTTGGGCGACGAAAA^ 

CCGTGCTGACACAAACCGCC^^ 

CCGOCTCAAACAAAOCOTG^^ 

GTTTCCTCXrrTAATCGAGTTQCAAAACGG 
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A(X£ACCAGTCMAA(X^ 

TCTKXGACKXXmATCCT^ 

CXXJGCAGCCTGAAACAGCTTOTTC^^ 

S HtfAI^ 
NFGSRDGIXSAAAMmE^ 

VSSLIELQNGAKPVAMLTGLPE^^ 
SSEPEIHESEGRRGAAHKDRLFVAIJI^^ 

ATCmATAMG^GGAGTGATGAAACTCCTM 
CWCKBTTTIGGC^^ 
MGCCGCAGCAGGATTTTTTGCAAAGT'^^ 
AACGCC(^CG(^TGCCGAm^GC«^ 



CTCATGGCGC^^ 

ATCAGAGCCGTAACO^CGCGGATACTTT^^ 

KPQQDPLQSWQEUiGKHSNWBQDVPOT 
I^QHRVLYEAISNRRKSEASEAAKAHUfYVAS^ 

SEQ ID 2478 

KGSRARATGDDKPSVRVMG^QDIPLSYI^TPW^SSLQ 

SEQ ID 24-79 ' 

TTGCAGXKTGGTGCGGATGCCGTAGCGGAGGM^ 
CTGTH^TCTTCCGT^ 
GAATAGTGCGGTICTAGCCGGGTTCGTT 

SEQ ID 2480 

LQAGADAVAEBOJiraVAR^ 



ATACCAATCGA 



TTGGTAGAGOra'GCTTT^^ 
GAATA&CGACGCATCGATOTTAACAAATTGGAAAG^ 

SEQ ID 2482 

LVEHLUXQGVIGSIPPASTKT^ 

ATTTCGAACCCGACGGGAAATACCACCTATT^ 
<X£QGCCATTAAAGGAAATATO3OT 

ACGACATAAAAGGCGTTGCCCAAMTATTCGCCT^ 

CGGATTCAMCGCGCX^CCCGATACAGCCCCGA(XTO5 

GGCGCAG<X!GATGOTTGCAGGGTATMGCGMGGCTC^^ 

AACTCAAAGACTATGCCGCAGCAGCCATCCGCGAT^^ 

AGCTCTCCGGGGAAAATACGGCTTGGGCGGCAT<^ 

GCATACGCCmm!CCGTCCXOT 

AACTCGCAGACCAACGCCACCXX^GACAGGOT 

TATACOTAAGGCTAAGOTGTGTTTGATG^ 

(^GAGTAGTCAGOTAAAGCCCATGCGCAACGAGAATGGGAAAAT 

TAACCCGTGGTGATGTACGGGTGATACMCAAACC^^ 

TGGGAAAGTGATGACXIAAGCACA(XATGT1^XCAA^ 

SEQ ID 2484 

LGISRKISLILSILAVCLPMHAH^ 

GAGDAVQGISEGSNIAVMHGLGLLSTENKMARIlffi 
AYAKYPSPYHSRNIRSNLEQRYGKENITSSTO 
QSSQFKAHAQREWEN^^ 
TSRSGIKIEGFTEPNRTAYPIYE 
SEQ ID 2485 

TTACGTCTACmAATGA£XX^TATO^ 
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TTGOCTAGAATTTCATCATGATAAGGTCAIT^ 

SEQiD2486 

MMKHKCFCDKFPSGOTFRlCIIIiD^^ 

APLQLPRSIHSWETVI EB 
SEQ ID 2487 

AAGMGATGGTATAATCTATGGCTATACTATTTTACTm^^ 

SEQ ID 2488 

MTNLKLDPYS^IKDSCPN^^ 

SEQ ID 2489 

TTGAAGGATTCTACAfX3GCAAATGA^^ 
MGACAAMTCCCTAOTACACTOGCA 
A 

SEQ ID 2490 

LKDSTGQMRIAOTLAEH 

SEQ ID 2491 

ATGTGGAAAATTTATAAAGAAAATAtX!^^ 
ATTTACATGAAATTCCAMTTATTOT 
AACGGGTATfl&TTTTTTAAG^ 
TTTGTAGAGCTQCCPCCGCTT 

SEQ ID 2492 

MHKIYKBHSTOIJIFAIGSI^^ 
FVELPPL 

SEQ ID 2493 

ATGTTTGACGGCAAAGGMCATCTGAAATACCA^^ 
GAGCGATGAGTAAAGGAAAG 

SEQ ID 2494 
MFIXKGTSEIPNYTWHHH^^ 

SEQ ID 2495 I 

ATGTGGAAAATCATAAAAGAGGATAGTGATGATTTAGGATTTGC^ 
CCATCGAGGACATCCCTTTTTATATTT^^ 
CIT^CAGGC^TTGCCTTCTTAAGGGGGATAGATC 
COTTTGTAGAGCTTCCGCTTCTT 

SEQ ID 2496 

MWHIKEDSDDUJFAIKOJSQSIDI^ 
PFVBLPLL 

SEQ ID 2497 

ATGCACAAAATCCTGCC^TCGCCCACGTCCTOT 

CCA0CAC0606AC0GTTACCCTTTCC6GOT 

TGCCGCTAITOGCTBCX^TGCCGATGTACC^^ 

GGGCTCGCCCCCTCCGTCAAC^^ 

AAGCCGAAATCCCCG<X&TTGACAAAGAAAGCAAAATGTC^^ 

TGCCGGGATGGGTTGGTTCGATGCCGTCT^^ 

ATCCTTITCACT^^^ 

GCAGCA3OTTGCCTCCOXX^ 

(^raSACTKmCAATGO^^ 

AAATTCAGCCTGCQCGAMTCATGGTGCTGC^ 

ACTTTAT(^(XXnXX3TCCTCTTC^ 

GCCCXSCCGGCMTTACGCCGGTTTGGACGTGATG^ 

AAA 

SEQ ID 2498 

mhkilpiahvlsrlgmlfsfillipaalswsix^ytafatt 
glapsvnfwrhmlnv^^ 

IVFTIFGGIHFASHFAAMSRSIjICnV/KDEEro 

KFSLREMMVLLHPKAVRTVKISGKAIPDRIALTVMSFff 

K 

SEQ ID 2499 CGTTCTCCCOT^^ 
CCCAAACCGOTTCXXX3TT^^ 
TOTGTAGTCCAACACCAGCTTC^ 
TTCTTTCAGACGGGATTTCAGTTCCGCCAAOT 
GCCMCAGTCCGAGGfPOGACCACAACATATCGGC^^ 
G(XiCGGCttXXX3CAG0CGCCAAAAAAT<^ 
AT6CCC6G6CGCGTT6CG6ACGAGTGTCAGGCGTTGGTCK 
ACAGCGCGGCAAACGCTTCCGCATGGCTCAAAAA^ 
CGCGTCFTTGCOXXXIAA^^ 
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SEQ ID 2500 

FFQTGFQFRQGIAPEAGEKQQPVRLQRARPVVQHGKDIiFR^ 
MPGRVADE£QALVG&F(^ 

SEQ ID 2501 

ATCGCACATTACtmTC^^ 

GCGGCCCGAAATO£TC£AAAC(OT 

ACGCAGCGACACAATCGAACCCATAC^^ 

CrcxXXmTGOB^ 

ACGAAGGTTTGGAAG<X?rATGCCCGC^ 

GCOTTACCTGCGCCCGTGGTTCMAGOT 

SEQ ID 2502 

MAHYAIGDIQGCFDBLTALLGKIGFNHGTDTIjWLTGD 

IiPQWRIAKAESIAGRAEAEIi^^ 

GALWGGQLTAVNLETBEITQVQAAGGIIWKSFAR 

SEQ ID 2503 

TTGGACTTGGGTMTTTCTTCa 
CCGMGATGATG(rroTGGTCGAGGTTTTGC^ 
GGGCGC3GCATCCG0GTGAAGG0GTTCAl^ 
(XXXjCGCAGTTCGGOTCGGCTTCTC^ 
C(X^GCCAGTCGAGCATTTTTCCGCCGTC 
GGACGATTTGCACGCT(?rTTTra 
TTTGCOGAGCAGCQCGGTCM^ 
TTATG(XGTOGAAOTGGMTTATGCGTTCAAM 
SEQ ID 2504 

LDLGNFFRFEVDGGQLPAPQRAGVQRDDVVGIEVAQRGPVSEDI^^ 

AAQFGFGFSGERFGFGYAPIJIQDAGVYHHAAALAY 

FAEQRGQFVETAI»DVAIX!VMCHC^ 

SEQ ID 2505 

GTGGAATTATCCGTTCAAAACGAGGACQGCTT(X^ 
TCGTTGAMfeCGtXSAAAAT^ 

GGOTTCTCGGTAT(X5XTGACGGTT™ 
SEQ ID 2506 

VELCVQNEDGFRFDLHAFGQRGNAIXjGAGGERIiG^ 
GFVGMDDGPGHFGPUK 

SEQ ID 2507 

ATGT(XAAAACCGTCATCCATACCGACAAAGCCCCCGOT 

!K^CTGTGGTCGGCAACGGCGATTTCCACACC^ 
GAAGCOGTCCTCGTTTTGAACGCA 

SEQ ID 2508 

MSKTVKTDKAPAAIGAYSQA^^ 

EAVLVIflA • . 

SEQ ID 2509 

TOTGTAGTTTTAGGCGGTATATGTGCCG^ 

ACGTTTTmCAfTGTTGAOT 
•K^GCTGTCGCTGCCGCCCAftGTGGG 
AAGTTTGAAATCGGTGGATGGGGCTTGTTT^^ 
CC 

SEQ ID 2510 

IiCSFRRY^OKQKTAEGSAVCGITRISEFDAHAGGQFAGTDVFDrTO 

KFEIGGWGLFIVFVARVIDGEAEVVDAVACKMBAEAFOT 

SEQ ID 2511 

ATGAAAAAAGCACTTGCCGCACn^jAI^^ 

GTTCTGCX^AAACKK^TrcAGCXJC^ 

TTACAGCATCGGCGCGTCCGTCATTTAC^^ 

AGCAAAACCTCCGCCGCXXTCGGCGTATT^^ 

TCCGTTCXXSGCGAACTG^ 

SEQ ID 2512 

MKKALAAIiIALALPAAAIAEGASGFYVQADAAHAKASSSW 

SEQ ID 2513 

TTCCX3GTTTTTTCCAAADCGTTTGC^ 
CT 

SEQ ID 2514 

UlFFPNRIiQVSPIRI^PPFKGNMVNGFAVGKAAGCRRVS 
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SEQ ID 2515 

AT6OTCMGATACGCXXXX3GC&TCCX3GTO 
MAACCGCAMGGCGGGGCGGCGO^^^ 
SEQID2516 

SEQ ID 2517 

ATGCAACGTTGCTTTAATTGGTTTGACCA 
TGACTOXXJCCGCCGCACTCTTACCC^ 
GCCGA06GGTATG(SXGlTrCTTC^^ 

SEQ ID 2518 

MQRCFNWFDHYCRRLEKYFRR^ 

SEQID2519 

TCTCTTCGGAAAAAAACGCACCACGT^^ 
GCACACGAACGCGCGCmTTTC^^ 
TAACTGGATCGTAAAAGCCGTTTTCGGCGAG^ 
ATCGCGCCCGAAGCAGGTGAAAAACAGCAGCCCGl^^ 
GCACACGCTTCGGGCAGCGGCACGGTC^ 
GGGA&TCGTATTCTTCAAATACCGCCAGCG^ 
TTGGKX^ATGTTCGGCAAGATGGCGGTGAAOT 
SEQ ID 2520 
SLRKKTHHVLPLROT 

IAPEAGBKQQPVRLQEARPVVQHGKDIiFRRQQSEVRPQH 
LVGMFGKMAVNIiPAFVLRIHVKPPSTKSRH 

SEQ ID 2521 

TTGCAAACGGTTTGGAAAAAACCGCAAAI3GCGGGGCGG^ 

GGCTGTGCCGTCTGAAACGCCCGGCGGGGGATCC^^ 

SEQ ID 2522 

LQTVWKKPQRR(^ICQIVAGAGVRPFVC^ 

SEQID2523 

ATOACACCATTTCCTTTCCAAAC^ 

TGAAAAA(X5GATTGC(XmGCCGCOTAT 

OTX^TraGGCGGAATCGATO 

CAAGGATTTAAAGACtXIAGGCATCACGCGGCTTTCT 

CCGCCCTGAAATTATTTGATAAAGTCMTATCGAOT 

C&aTATCATCTGACGATGG^ 

GGTTTTATCCACTACGAAACATC^GCTTTTGC^^ 

CCTATCCC(»C(mTCGA(XXX^CCGTC^ 

GTTCGMTTCATGATGAAOGCOCTGCGOCTGACCG^ 

(X3CCTGCTGGAAACC<&CCC^^ 

SEQ ID 2524 _ 

OTISFPJffiTRimPPI^YmFVTCIKK^^ 
QGPKDAGITRLSIGVQSFOT)CMLSRLGRVHNGKEALTAI^ 



GLLETDPTVFRPTEKGRLFLNDLLQCFL 

SEQ ID 2525 

AT(^CCCGACCGCACMCGCATACACGAACra 

an^cmcarrTTTccccGm 

CCCAAATTXX5ACX3GCCTCCmra 

ACATCCCt^TGOGGCTGCACGGCGAAAACGTGC^^ 

ACOTTCGCCAATCCGCGCAATG^ 

ATTGCGGMGAACACATCX^GAACTC^ 

AGCAAAAACaXECGAACra^ 

ATTCCCTGCCGAAGAAGCCCTG^^ 

AACGCCACCCKX^CAATCAC^ 

(XXXWGCAGGAAACCGCCC^^ 

CCTGO^GCCCGCTGCCCWT^^ 

GCCTCGCGCAAAGCGATGGACATCGACGGCT^^ 

AAATGAAGGAAACGGCGGATAAAGGGTCATCTGAAAACGAAM 

ACGAGCTGCTCGCCGCAGGOGTTGCCCCGCAA 
AGCCCAAGCCTTATGGGAACTCGCCGGAC^ 
CTAAAMOTTCTTCGCGCAAATGC^ 
G(X3ACCAAGCCCAAGCCCTAATCXzAAGCC^ 

GAACTGCTTACCCTGCTCTGC 
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SEQ ID 2526 

PKFDGIMSIiLYBDGVLVQAATRGDGTTGHJVT^^ 

IAEmQEIAYFQALGFSLPlOT(X^ 

NOTINQDEVSRKD^^ 

TIoERVRRAPEPVLACIi^ 

LKTPFAQMPSEDEAAQGSDGINKAVAGKTFVLTGT^^ 

SEQ ID 2527 

GTCCGGTC^^ 
SEQ ID 2528 

VRSGSVrnm3SlX^KCKI^GKTC™SD^ 
SEQ ID 2529 

AGATACTCGCXXATRCGCAGCGGTGTAATCGCGTCra^ 

CCTgTAAGGCTOAGGCGTG^^ 

ATCCtfPTCGGAACGCGCC^^ 

SEQ ID 2530 

CQTPFDKVAAQRGKRFRMAQKKRVCRLFH^ 

FFGSIKHEPSLLLAAHRRRAH 

SEQ ID 2531 

6TGATTGCXAT6ATTTACATCGTACTGTTCOT 
MC^TCCCCTGCTCMCAGCAAAAC^ 
CATGCGCAACCTGCMGAACAGGATGCXXJTCTAQ 
GCCCACACCGTTTCCGAACCCCm 

CSGCTQCACCATGGACGACCATTTCCA^ 

TCCGCA!HmCCGCC^^ 

ACCAGACCATC(K:CATCCATrTGGTTT^^ 

GTCGGGCTCGACCATGTTCTCCATCTGCTC^^ 

GGCGAAAAAACCTTO3ACGATTTGTO 

GCACTTATGTATTGGCGCGTCAGTCCGAGATGCTC 

SEQ ID 2532 

VIAMIYIVIjFIiAAVLAVVAYNMYQE^YRKKTO 
AHTVSKPQTGHSArcPADAPAKPVPVPQTPAKPLITLKELSKTO 
SAmQADAFAQSMGGQTLHTDLAAFIEVASAl^ 
GEKTFIH)LFMDIjAVRLSGQIjNI^^ 

SEQ ID 2533 jroOQQgmM^^ 

SEQ ID 2534 

MAITLNFGMQDAQSAGTAVKSGLRFRQSDGTAGY^ 

SEQ ID 2535 

ATGCCGTCTGAAAACCACAGATTO^^ 

SEQ ID 2536 

MPSHiHRFQTAFRKISAYPimiiSDPAimiQTL 

SEQ ID 2537 

ACGGTTTCECTGATCGTGCTGCACAACATT^^ 

TACACACTTTGCACGTGTCCAGCCATTTC^^ 

GTCCAACGCGTTTTCCATCGGCATCGAAC^^ 

GCG(tfAACGGGACACCAAGACAT(£CA 

AGATTTCAGACGGCATTCGGAAAATTTCCGOT 

SEQ ID 2538 

I^AGQEEHHGQSRRSTIiAKRLA^ 
VQRVFHRHRTGRLRFRTVO^ 

SEQ ID 2539 

ATGGACAATCACGCCCX^GCACACreGC^ 

ACATTTOCTGCCGCCGTTO 

•TTTCCTGATTGCACGCGACGG^ 

GAACTGGAAGGCTGCC^TTTCGAACCGTTTGC^^ 

CACCOmXKAAAACCGACra^m 

SEQ ID 2540 

MDNHAGAHV^NGWLQSIRHTPSPNFSPR^ 
ELEGCDFEPFAEAQYRSLETUjAALCRRyPITAW 



WO 02/079243 



-202- 



PCT/IB02/02069 



SEQID2541 ■ 

TOKOTmCTGTCGCCCGAATm 

CGCTGCTTTTTGTCCCTAAAGAC^ 

TTTGACGGCGGCGGCCTACGTTTTGGGTCTGCAC^ 

TCCCH^CCGTGCAGAmTCGA^^ 

TTGCGCn^TGCCTTCAGCGGCMTOT^ 

CX^PGAAl^GGCATGGGCAGGCAGGCAGGACGGGCTGCOT 

GOTCOTCTTCGTCAACCG^ 

CGCCGTAOUOCrrATACGCGCX^ 

AARTGGACGGCAC^GQC^^ 

SEQID2542 

liGYWARILRRIPNAGNPTMLKKML^ 

SmQIIEGSRPSHHRKVnttTTOIGHDTKOT 

ASVFVNRLKIQIRI^TDPSV^ 

SEQ ID 2543 

TTGCGGT1TCATGCTT6CA6TTCC6GTM 

CATGGCTGAACTGGCTCMGCCCGTGCCGT^ 

GCGGCAAGCCCCCGC6C6TATAGCTGTTGTRCGGCGOT 

GCCCTTCAGGATTGCCGCTGAAGGCAT^^ 

AAAACG(WCCTTC<&T^^ 

TTGTGCACACCCAAAACGTAGGCCGCCGCCX^C^ 

ATGCCCT(XXX5TT(nCm 

AATACGTCT 

SEQ ID 2544 * ' 

U^CSSOTGKPSEimiFSDGMV^^ 

HTDFQAVDEDGSDMVAVGLMPEFIj^ 

KTRTFIMjHGNGIRPAAAHFIiQDIPSRHFRRQSVRPRMQPVVH^ 

NTS . | 

SEQID2545 i 

TTGACCGCGCATTACAAMTCGGCAAATCCAC^^ 
(KGCAACCGTCAAACATATAGTGGAT 

SEQ ID 2546 

LTAHYKIGKSTRIGLDFSJVFNKRCRI^^ 

SEQ ID 2547 

ATGGT(MTTCCAACCCCCGCGAAOT 

CC6QCAGCCGTG(mCCTGCA0GC(^ 

GGCTTTGW^TA(XX»GTTTCAGCGCCG^ 

TCCT<mACTCAOTACG«^ 

AAACCGTCA{X^TACCTTCGTCGTCGTAO 

(EGCCGTATCAGGGGCTTGMCCAAATC^^ 

CTTGAAftTCGGAOMXATACCTOa 

TGCCGATGAACAAAAAGCAGAACAAGCTGCTCAWX^CT 

CaCGAAATCCGCMCCCGATGTCCGCCATCC^ 

AAAGCCGTGCG(XXJPATO3GCGACXHX3^^ 

AC(X?K^imTGTCGTCAOT 

COXm^mCAGATGCTOCTGAATTACAOT 

T^GATACAGGACTATCTCG^ 
CTACCGTATGGAACGCCTCAACATAGAC 

SEQ ID 2548 

MVISNPRELEOiKmPlfolNIIK^^ 

SCLLSYGRYPUiYASVASIUJPNAIADSNIN^ 

LEI^HTSLFDPVMLWDKTSSRW^ 

KAVRRIGDATEQPVDVRIVCATHKKIiEALVESG 

QIQDVHHKmTMAVPVAmPSEIAAAPSRL^ 

SEQ ID 2549 

ATGAAACCGCAATTCATCAOTTGGACGGCATAGA 

GCGC^UiCGCCGGTCGGTGAGGCCTO 

CCTGCCCGOXTTTCAGACGGCATCCATG^^ 

GS^GGGCGGmGOXXXrGATO^ 

TTATGCGTGTGCGCGGCGOTATCTCGACCGAGOC^^ 

ACATTTOGGCTGT 

SEQ ID 2550 

MKPQFITUX3IIX3AGKSTNIAWKA^ 
VQGGLEPDL'MLDVPLEVSMARIGQAREKD 
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CGGGCGTAGCCGCCCCCTtnOTGGA^ 

ACCAGCACGOSWGCC^raC^^ 
CCTGAAC(MCTCCTO3A^^ 

GCCGTTGMGACAACGGCCGGCXXX3TACTTGCCGATGCGGTTAMGGCAAAGAtt 



jOTCTGCTTCCCGlTCarcP^^ 

axaXGTGGOACCCCTCOG^^ 

SEQ ID 2552 

MENSIjKEAAIjKPHELPVPGKISVTPTKSIA 



AVEDNGRKVI^VKGKDIFI^^ 
EVVJAYGKKFEFGAEYLIPTPFDSRLL^VATAA^ 

CGTTCGGCA6AGGCAACCCCAMXX5TG^ 

SEQ ID 2554 _ m 

LAIYPKLQNKPPPVHTTGQWI^^ 

ATMGAACAGT«?mACAGAM 
CCCTGTCGTCATGACCGQGQGCGGTTTGTO 

SEQ ID 2556 

LAYSIQIFEAESAGTLIHRraCRPOTQ^ 

GCAATACCGATTTGGCAAGTTTCCGCACCGAQ^ 

GATGTG66CATCGCTGCATTACGMCCGCCCGTCAGCM 

CGGCTTTCAGAOGAATGTTTCMTTTCGW^ 

GGCAGATACTGOCCCGACACACX^ATTCCGACACC^^ 

AAACTGTMTATTTTOTAATCGACGATGTC^ 

CCAATGAAAAAA 

OCQ O CCD 

FMKK 

gCQ IfJ ORRQ 

TTGTTAATCCACTATATGOTGACGGTTGCGGTCAGGC^^ 

GTGfflTTTGCCGATTTICTMTGCGCGGTCMGTCC^ 

CoxAACGGCAAGCtmTCAGCC^^ 

UiIHYHFDGCGQAARRAVHVNVGlDlAAAFVKDVFKVQTO 

SEQ ID 2561 

ATGSra^AGAGCMC^ 

SEQ ID 2562 

MVGQSNRLVKAIMVCQSIEETIiRPGDTGNRCDAI^ 
epQ i« 2563 
ATCCCAAATT^ 

GGGC(^TTTTTTATC6GGAAAAC6GCAAAG(XjAAAA 

OTGOCGAMAAG^ 

CTGTTm^AAAACAGGTGCOmC^ 

A(XXXCTGCAACGCGATGTGGAAATCGCG^ 

TGCGGTG6TGGTCGGCGGCAGGGCGGCGGATG6TTO 

TCGACATT(m»6GGM(mX3T^ 

CCGATATTTTCMTCGGGATTTGCC^ 

AATCGAACCTGAOTGGCGGAGOT 

SEQ ID 2564 

KraPFKFHTO 

IjpRRI^SAPTAVGSSRVEAGRALLAAHPE^ 
STLSGKRVAAVAGIARPQRFFDOTH^ 
SEQ ID 2565 

ACCGGGATATAGAAAAAMTATGAriGMGGCTTTMTGCCGGATITAAACCGTATG^ 
TTATCGTGAGMTGTTrajAAAGATTTWTTACGCCCGAAGT^^ 
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TTTTACGGTTCGCCTGTCXKn^ 
TGCCO&OTTACG^^ 

SEQ ID 2566 

VDMRLKTLLLPPAALAIiO^ 
FYGSFVGQ5VVAKHPIfoIKK^ 
SEQID2567 
TTGCCTCGCCTTCCCGTACT&TTTGTA06GTO 

SEQID2568 

LPRLAVLFVR5AASP PCPDLNLIHyTAADDAP 

SEQ1D25G9 

ATK&AAAAAAATIOTAC^TCC^^ 
GCATTCCCTMATGCTGG^ 

SEQ ID 2570 

MHCKPLDILVCPVTKGRLBYHQDKQELWSI^AKI^ 

SEQ ID 2571 

ATGACCGAATTCGTCGTATTGATTCCGG^ 
GCAAAGCCGCGCGCGTCGTOGTCGCCACXGATCAT^^ 

GTCXmTGGCGACXXXXXKXX^CGAA 

CGTCTCGCCGCTGGAAAOCATC^ 
TT(^CAGGGTTCGCOCTGTATTTCAGACC^ 

SEQ ID 2572 

MlWVVLIPAMiDSSBLPGKALADIHGKPMVVRVA^ 

Vg^AGHEUmFDEUQIPNAVKVVIiI^ 

LDRVRAVFQTV 

SEQ ID 2573 

ATGCAGCAACATATTGAAAAGIXXOACACTTGAGCCGT^^ 
ACGCACCmTGAAGTCA^ 

CATCCTTTCCGTCATGATT^ ! 
SEQ ID 2574 

MQQHIEKWQHIiSRE^ I 

SEQ ID 2575 

GTOTCCTTCTACCAAATAATTGAf^ 
ATTCTAAGGTCGACmAOft^ 
GATCATCGTraXXIACGAGACCCCATACTTO 
SEQ ID 2576 

VSVYQIIEGGNFLRPKLRYVGTO 

SEQ ID 2577 

TTGATATTTATGOTACCATCGTmATATCAGCCG^ 
CATTGGATTCCGOMAATGAAACGCGC^ 
GACCACCAAAGGCACGGCGCGCCCCGATGAAACCGCXTI^^ 
AAAATCCGCCTGCCGATCCGGCTCGGCM 
SEQ ID 2578 

LiFffliTmYQPEiPPimMnu/^ 

nRLPKRPGSRSMLSNTVSVILFEAWRQHGYAGGV 

SEQ ID 2579 

ATGACCGACATCCTCAEPGAC(&^ 
ATACCGTCAGCCTCAGGCTTGTCGGGGAAAAAAGCMOT 

TCAtaCGGCATCCffXXX^^ 
(XX^TC(?mATCTCQAC(X^ATOT 
TACCGCACGCCAAACGGCAAAAAAACGCGTCGTCGTCAAAC®^ 
CTGCCCTACGGAGCGGACAAGGGA 
SEQ ID 2580 
MTDIUDDTATEAVRTLIW^ 
STCIRRALLNPETQDTAARINLHF^ 
LPYQADK6 

SEQ ID 2581 

TIOTACACGGACACAMGCCXXn^^ 

ATGccaacrmc^^ 
SEQ ID 2582 

LVHGHKARAMLVQQPDKAAEAHEBH^^ 
SEQ ID 2583 

ATGCACAAAA6CATTTTGCTTGA<3GACXjTG6C^^ 
SEQ ID 2584 

MHKSILIiED\^VRNISIQEEKKCIIGYRQGKSYPISKD 
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SEQ ID 2585 

TTC^CEAGGCAAAAATAAA^ 

SEQ ID 2586 

IJ^INGKHI/JLTKFRIKTGQYHTGFAPTRYKIA 

SEQ ID 2587 

TTCOCTCGTTCAAACGGCGGCTCT^ATCTGC^ 
TAGGATGGAAGAGACAAATGAGCAGMTCXXKjCAGGCTTT^ 
GCACGGCATGGTOXIA^GGTGCGGATATTT^ 
CKXX30GATGTCTTGGATGTCGTC^^ 

GACGGAAGACCGTATTAAAACCATTGCCGAGCTGGCAGGCG^ 

TATTTGCGTCACTATATCGffi 

AGATTGAAAACMTGCGGGCAACGAGQCT^^ 

SEQ ID 2588 

LRDVLDVVRKFRETDTQTPVV^ 

mopoDipicroFGisNAESA^ 

SEQ ID 2589 

TTGTCCATGAGCTGGTTAGATAAAArcCT^^ 

fflCGACCGACTTTGCAGCAGAACAATC^^ 

TMTGTCAAACCGACG&TCCTTTGAAG^ 

TOTCTGTGTG(£GGC^ 

GGTGTTGACCGATCCGACTATGGGCGGCGTATCT^^ 

(?KK^3CGAAACGCTCCC6^ 

TGTGCCGTCAGGAC3AAAGTTTCCGCCACC 

SEQ ID 2590 

LSMSWIflKILPPKimGKI^^ 

lXSLPVVVAAFBFRnGGSBGSVVGERFVQ^^ 

VRETLPEOTQRAEFLIiEKGM^ 

SEQ ID 2591 M% „ Mm MttMUMVVW ^ llwm 

TOGAGAAAOTATGCGCCGCGCCATCCT 

AAGM(£CGCCGCCC(KGMm^ 

CCTOCT^CAQCCTGCAAATTATGCC 

TCCATCGCX3TKX3(mCGGCAAA^^ 

(XSCO^CGACGCAmCGCGCCG^^ 

GAAGCCGCACGACAAA 

SEQ ID 2592 

tKIlCRRAIUiILTLTVOTSIAAGPSQKDT^ 
SIGFGNGKLSSYKVFGYGDTVNSRH 

SEQ ID 2593 

GTCCGGCTTCATCGACGGGGCATAGCGTCOGGCACXn^ 

Tam^GCCCAAGTGTGCCGCCTA 

ACXKXjATGGAGGAGCGGGCGCAGMTATGCCTTCG^^ 

GTCX^GCAGGATTTTCGGCTGTTTCCCCT 

GCGGCTTCTTCGGGCGTTTCGCGGTAGC^^ 

GTTTCTCCAMTTGAAAACGGCTTTATTO 

SEQ ID 2594 

VRLHRRGIASOTFQAW^ 

VBQDFRLFAVIFIHINVKPAVRVRIGQRRQFGGMFFKFAGGGFFGRFAVA 

SEQ ID 2595 

ATGMCAGCGAAACTTTAGACCTM(XXK&^ 

GCGCGCCGGGGGACTTTG^^ 

SEQ ID 2596 

MNSETLDVTGLKCPIiPIIiRAKKAIAQMRQGEV^ 

SEQ ID 2597 



GTAGGQCA 
SEQ ID 2598 

SEQ ID 2599 

TTCATGTTATTGCTCTCftjCATO 
TGACTOAGGTA6TTTCGAACCX3GTTTCCM^ 
GTGGAOGCTTCAMX3CGAT6G(X3GCAW 
CAGCCGCCGAAAAATGCCAAIATA 

SEQ ID 2600 

IJCAI^AFVLAACNAQNRQSGVR^ 
QPPKHANI 
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SEQ ID 2601 . . 

TTGACTATATATOGCMTTrT^ 
ATAAATCCCGCCATC(£OT^ 

CAAGCACAAAIPGCOGACAGCAAT^ 

SEQ ID 2602 

LTTOTSAAVGQALRNVPKARHaPAR^ 
QAQMPTAITSIETSAFQTAFICA 

SEQ ID 2603 

TTCTACGGCAGCCGCCff^^ 

GATTOCCGACTGCCMCGCGTITCTC 

OTGCGCTTATCTTOGTGTCOGCAC^ 

TTGGCTCAiU^TOGGTGATrATTTC^ 

CCAACCCGATTCAGAAAGTCTGGGCGA^ 

SEQ ID 2604 

LYGSRRLPCA^ADGAQKKTMyKGQDPAKVTHYLTR^ 
LAQKSVIISDHARFRRMGALRGTAEKLGHKVIJ 

SEQ ID 2605 

ATG(K}ATACTCTGCGGACTTAA6AAACAAAGCTTT 

ATCACOTAGGGAATCGTTCCOT 

TCAAGAAACACGCTTTACCTGCGGAT^ 

SEQ ID 2606 

MAYSADLRNXALNHS(&TKIRTRR^^ 

SEQ ID 2607 

GTGCAGl&ACCOTTGAAA 
TGAACCTAACGCACCGTTCC^ 

SEQ ID 2608 

VQAPPETRQISYAWRESGSPEETVLTAQTAK^ 

SEQ ID 2609 * 

TTGTCA!ITCC£ACG(^ j 
SEQ ID 2610 i 

SEQ ID 2611 

ATGAATACTACATTGAAAACTACCTTGACCTCT(^^ 

ACACATCTCTTTOT^ 

ATTGAACAGCGAAATCTACCGCAAACM^ 

AOTACAATGTCGGACT^^ 

ATTTGAG1CTGATTGCCA(XX5TTGCC^^ 

6CG6TACMGGCAG6CGT(^T1TO^GCCGTCG^ 

(X^CCTTGATTAACCGTC^ 

ACATCXXXmaxmCACGCGCTCA^ 

CGGGCTGTTCAAAAGCGGCAC(X3GC^ 

ATTGTTGCCTATGMTCCGCCGTGCAATCC6CXIOT 

CGCCGAAMOrrTGCCGATTTOT 

SEQ ID 2612 

MNTTLKMTS^AAAPAIjSACTM 
SYNVGMAASYELDIiF(m^ 
ATLIHRPIPEDLPAGbPIiDKQrFVEK^ 
mYESAVQSAFQDVANALAAR^ 

SEQ ID 2613 

ttgttcctttttagattt™ 
gggtaaatccx3gggaatcggcggtttct6cggtcaa^ 

SEQ ID 2614 

LFLFRFLVSRRHADFMPSEV^ 

SEQ ID 2615 

ATGCAMCCCTGACaTCA"IXXX3ACCCG^ 

ACCTCAAACC(mGTC6TCAGCCT 

CACGCCCGMCTCgTO^ 

TTGGAAGAAATCXKIAAAACAGGGCATCCTGTTCC^ 

CGCMGTCCCGMTCTCAMGTCG^^ 

CCGCAACGAOCOTTG^ 

GCGCGTKXaACAAACTOAAGCCTIC^ 
TTACGGCCftCGGCGAACTTGOT 
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nfiTDSAPHftKSAmaCGC^ 



SEQ ID 2617 - pmr Twwrif , ijvG(5C?Fft 

ATOGTGGTCBCGCCTGCGGGGTAGAGTTTGMGGCf^^ 
ACmATGCCGTCTGAAGTCAGGGGAQAAT 

nausea 

G^amTCCTCGACAAACTCGCCGCCCAAATC^^ 
VNGILEKLAAQIRPDKPKRR 

SEQ ID 2621 nwwwMwnHWiPVA^^ 

ATCGGGATGTTG 

igml 

GTCCWTCTTCGGCAAGCTCCCACGCGCTGC^^ 
TCGCCGTAGAATGOKmaTCGTCAACCT^^ 



AtntffiVAUKNIPIAHAVLTTENDAQAIBMGB^ 
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CGTTCATTCCGGGAATAOnTGGGG^ 
TTCTGTATTOTGGAGTATGGAAACGCA^ 

SEQID2630 

LGKQCGGFRVGFGDVGEADDAEVVGVVGV™ 
FCIFGVWKRIRAVFCGRR 

SEQ ID 2631 

TTGACTTTARCCO^^ 

GCTACAAAGTCGCX^AATTCACGCAAACCGG 

AACCCTGCCCATCCtt^^ 

GCCGCGCAAAAATTGGGCTTTGTCAGCCMG<K^ 

TCGGTACGGMCACGAAGCACAAGCCTATCTGAAC^ 

SEQ ID 2632 

LTLTRKTLFLIiTAAFGTHSLQTASADAVVKPS^ 
AAQKLGFV5QGTAHTO 
SEQ ID 2633 
TTGCCGTTCTTATGCAAACGAGAAACCGGTTTP^^ 
TTTTCMGTTTCAGGCAACTTCCAAA^ 

SEQ ID 2634 

LPFLOTETGFCVSTVI^ 

SEQ ID 26 ^ nvuu ^^ 

CCCCCCCCAAOTATTCCCGCAA 

SEQ ID 2636 

MRmPiaQKHUCAANGAHCTTKPPPSADNAGIVVKSASF^ 

SEQ ID 2637 

TTGCGGAAGACGGTAATGAAAGACGATGTTTTGAAACG^ 

GCCATCATGCGAAGCACMCGMCGGTTCGAGTTTC 

GAGGGCGAGTCTGGTCAATGAGGGCGTGCTGGCGGAAATG^ 

CKKXKGACGCGATGGAGGCGATGTTTGCX^ 

AGGCAAAAGACGGCAAAACTGCTTTGCAGGAGGCGTT^^ 

CGATTTGGGC6AACTGGGTTTCGT6TGCCGT6CCAAA6GG 

AAGAAA " - 

SEQ ID 2638 

IMTVMKDDVLKRQAHTMQK^ 

lADAMEAMPMVSFDADFKIAEKVVRHLFA^VRRADFQN^ 

KK 

SEQ ID 2639 

GTGGATATGAAAGCGMT^^ 

CTTTCAAACCGACCaCCGmCGCGC^^ 

CGCGTCGTGTTGAAACAACTGCC^ 

MTTTGAATTTGCGGCG«mGGCGGTCM 

TACGGACAAATCGGCGCGTTTTTTO 

GGTTOAACCGCATCTACATCGCCXjT^^ 

AACTGTTTGATAACAAAGTATTTTTGAAGGTC^^ 

SEQ ID 2640 

VDMKANMDIETFLAGERAAGGYRCGFVAI^ 

RVVLKQLPKHTPVILVIMIDKDKAKDRYALEAFVAQVRAEFKFAAA^ 

GUniimVLVDKESQKAILIGKGGERLKKISTEAR 

SEQ ID 2641 

TTGCCCC«XXXXK3CACCAAAGAGGCG^ 
GGTPCAGCAAAGACGGmGGMGGGCT^ 
CCGCCaSACATA^ 
• AACCGCTGGGTAAAAGAACGTATTC^ 

GCAGGATATTGCCCGATCCGATGGATTACCGCAfX^ 
CATCO^TATGA£X3GAAAACAGTA^ 

SEQ ID 2642 

LPRPGTKEAADUSDGKSRGIQAKTAYGSKNR^ 
NRWVKmEQNQPLSAEEEA^ 

SEQ ID 2643 

(nxaftOSACGGCGACGAm^ 

ACGAACAra^COXX!GCCGTA 

CAACACGCTGAAGCTGAATTGGGCCGTGTATCC^^ 

GTTTTGAATG(XTCGTTTGAGAAGWXX^ 

CGGCAAGGATTTCGGMTCXyiCXXjGATTC^ 

A3XXX3TGCCAAGAAAAATCAACGG(^^ 
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SEQ ID 2644 ^ 

VYDGDDFRDGLYFVW&EE^ 
VtNASFEKSLKNKSfTKHHLTLG^ 

SEQ ID 2645 , 

TTC^CGMXX3TTmC^ 

GGAACAGCGGCATCGTGTTCAMCCGAACCGGCAI^^ 

GAAAGGCTGGCAGCffTCCCGCOCTGAMTCG^^ 

ATGAITCCmTGCC^TCAAAm 

GGAAMTCGACTCGAACXXSCGTATAC^^ 

TGCmG<^TGCGOTACAGCCGTCGCOT 

GCTTATCrGGCACX^GATCAAAAA^ 

AC^ATCGGCAACTACmrPACGT!^^ 

CAGCCTCGCGCTCGA&ATGAAGTTT 

SEQ ID 2646 

LNDRPSIGKYFDFSLGGRY^^ 

MIAVADQKTKLPDSAGRLTEIDIRD^^ 

AYI^DQmSAGKVTSSWm^ 

SEQ ID 2647 

ATGQCTAAATTCTTTATOGACCTO 

CGAOATOACTira^CGCCATTTATCaM^^ 

(XXXXjATTCGAGCGGCAGCGGCAGCGTGAGCCT^ 

GTCCAGCAATACGGCGTMCCGTATCCMGGCGra 

CCGAATTGCAGCGTATCGAAGGCGTGGGGCA^^ 

CAGCGCGTTGTCGGCGCAGAATATCC^^ 

GAATTCGGCMCX3TC^T(X!TCCGCGCCM 

mCCACCGGTATGGCGGTMTGTTCTC^ 

CCCTTACGATACTEa^TTTO 

CTCATTCCGACOTCCTCOTAC^ 

TCGATGACGCGATTGTG(nt3GTT6AAAACGTC6AG 

CGCCGTfCTGATITCC^^ 

ACCOTACCCCTCCTTOTGTGC^ 

AAGGCCGGGTTGCCAAAGTGCTGCGTAAGACT^ 

ccaaggcttcgtpatggtcacxe^ 
^ctttccggcttc^^ 

tgacgggtatgatgatcgggacgotaaagacggtttcg^^ 

CGGCCATACOXOTOT^ 

GftCftTCAACOGTGOCQC^ 

GCGTGATOSTCammCGA^ 

AAACGGTACGGA&CAGAGCGTACGCTTCAAC^^ 

TTCGGCGGCGGTOCAGCTTCGAAl^^ 

TTTATCAMGCTGGTCGATTCCGCTGGCGGT^ 

GTTTGCCAACGACATOACTTCCAAGTCGGTTTO 

'GAAGCC(3CTTTGGAGGCftG<XX^^ 

QGCCSGTCAGACATGCATCAAAAGCCK^ 

SEQ ID 2648 

MAKFFIDRPIFAmSiniAAGIFGIKSIiPVSQYPSVaACT 

VQQYGVTVSMfflSMPUnVMLSSD^ 

EFGNVILRAHTIXSSNIYLKDVAKV^ 

IJPTIWPISLLOTAFISYMGMSI^^ 

TLTPAI£ATMLKTIPRGHHEEKKGFFGW^^ 

TVSOTSFSGSGQNM^FAIItKDWWEROASGS^ 

DINWAAAAQGISFADIRTAIASALSSSYVOT 

I/SGGYSFEtfGGQSREEAKGGSQTLILYGIAVAAWLVL^ 

EAAIiEAARLRFRPIIMTSFAFIimPLyiAAGW 

SEQ ID 2649 

TTGGGTAGCGGCAATOTCAAMGTCGTCTG^^ 
ATCTAGTCCGTTCGG^TOIAGT^ 

SEQ ID 2650 

I£SGNLQKSSEKSETCjDSRPHG^ 

SEQ ID 2651 

ATCCKXBCAAGCC^^ 
AA 

SEQ ID 2652 

HGASLXQFCfiVSEBEMVVKDFCQGHKGSOQYRRFB 

SEQ ID 2653 

#KXXX3CTGGMGCTTGATCT 
ATCCGG&CATTCAMGC 
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SEQID2654 

MPSGSLICTPKVGLNRQIJKVQVFIilQYKQDFR^ 
SEQ ID 2655 

GTGGGMTCX^GGTOTTTGAGOTCAGTt^^ 

CTGCACCT 
SEQ ID 2656 

VGIRJFBFQSFPDniSIECPDSRLRGNDGGNLVYIESK^ 
SEQID2657 

ATGAGGAAAATCCGCACXIAAAATCTGCGGC^^ 

ACATCATTAAAGCACAAAAMTCGOXX^^ 

CATCCMTTCCAC(m&CGAAGACGATG(&^ 

GCTCAGGCACTGCTGTTC(^TGCCTAra 

OCCCTGAAAACGTOGGCGAAimTCCGC^^ 

CGCCAACCGCCTATCCCGT 

SEQ1D2658 

MRKIRTiftCGITTPEDALYAAHAGAM 
AQALLmYHPSEYGGmRFDTOI^ 

SEQID2659 

GTGCGCTTGTGCCGTTT^ 

GCTTTAATCAGGCMCGCGCCAACGGCXSAMT^ 

CG<mXXm^AAAMTAC™GTCGGT^^ 

6C03TAAA(^TAGTCGCCGTOT^^ 

CATCXX3GOGGG(X3TAATGTAGTTTT^^ 
SEQ ID 2660 

VRLCRFRRHSRSVDFITO 

AVKmVFAAVAVAOlPVDDUEF^ 
SEQ ID 2661 

ATGAGTACCGAAA0CAAAAACTACATTACGCXXGCCGGAT6QCAG 

0^c66Cx»ccGCA(mmcx3(mcTATcrm 

ATTGCGCGAAGCGACCGACCAAGTATTTTTC^ 
ATTTCTTGGATTTCGCCGTTGGCG^ 

TCGAC j 
SEQ ID 2662 - , 

MSTBTKNnTPAGWQALKDELYQLVNKERPEIVQIVNWA 

ISfflSPLARCLIKAREGDEWLNTPEGREEIEILSVEYIKID 
SEQ ID 2663 

. AT(rreCX3GCGTATTAGGTTTGGTCAGra 

CCTTCCATATGCACAAAGGCAAAGGGATCGmOX^ 

CAGCAGCGCGGAGGCGCAGCXTTTCTACGTCAGCTCGCC^^ 

CACATCAACACCGOmGAT^^ 

TTGCCGAAGTCCACCGCCTAGTGCGCGGTGCATA 

CGACAGCGAAGGCAGGAAATCCTATGCXX3TCQOT 

ACAATCATCGCCCGACAATGCAGCGACCGTGCCAAACTC^^ 

CAAACOTACCGCGAGGGT^ 
GCAGGCAAAAGCGTGTTGCTGGTGGACGACTCCATC^^ 
MGTGCGCTATCCCAATGTGTACGGCATC 
CTTGGG(^TTTGGAA6CCGTOT^ 
TCTGCCGAGAAATCCGGCTG(XXXX&TC^^ 
SEQ ID 2664 
HCGW/aiVSHEP^IAYTC^ 
HIimJSDSEVlMWAHEIiRREVSKNADPHRI^ 
nmQCSDRAKLSPOFEYV^^ 
AGKSVLLVDDSIVRGTTSREIVEMVRAAGARKVnASAAPEVRY 
SAEKSQCAGLKIHPSRMEHSISISDAGBEE 

SEQ ID 2665 

GTGGCGTTTCAGACAGCATACCTGATTTA 

AAAAAGCX^OCGTTPGOTPTAm 
SEQ ID 2666 

VAFQTAYLIYAVRHAFSFLDTP^ 
SEQ ID 2 667 

GCCTCHTOIATCCAGAAAATGCT^ 

mcccramTCATCcmM^ 

. CGCAOXXMTCCXmACGACGACTMTCXm 
SEQ ID 2668 

MGGFLLFCQTLCRTLRRI^ 

RTRIPRRRLimGVSDSIPDLC3lPKRIiFFP 
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CGWftTOJTCAAAAACG(&CA(S»AATACAAA^^ 
SEQ ID 2672 

MLiSIKWSEHSRKSGHPDHWVAAIFKSRIJfflQKF 
TCGACGACGAC 

ggaaaccgcogccm^^ nMRMCCB0C ^^ 



rwiWMCAACATCAAAfiGTT^^ 

MGGCGGGCAAAAAAGCCGCXMTCAGGCAGGTTATGCCG 
ACGGCAAAGTTTACCGCGTCAAA!KAACCXACTATAAAiX 



SDGKKHmCKTOKA^^ 
SEQ ID 2677 



252S™ 

AAGACGACAGAMOGTCGTATTOXSCTCAWCATACCG^ 
S !2IL 2 ^L H ^JtM3LEBVSBVKKRmTPQCPVV^^ 

VPRGmDAimifiQHHimgimVRD^^ 



S mratoia^ 8 ^itotvs^^ 

ESmiKVWVIiffilQPKKTAErOTSD 



AATATGATG 
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SEQID2683 

ATCGCGmCATCATATTGTC^ 

TGCX5GCCATTT(7PCCT^^ 

CGGAACAGTCGGCGCGAAGCTGTTCTATGGT^ 

TTCGGGTTGGTGAAAAAOGC^^ 
TCATCCCGCTTTC^^ 

CCCTTAACCTTaTCGGTGCGA6G(^ 

CGCmCGTTGACGC^^ 

TTTTOXIATTTCCAAG^ 

SEQ ID 2684 

MALHHIVEHGRIJVQIBFra 
imSQSSC<mCMiNVRFSIX3IAS^^ 
NRHALLLWGKI/3REGVTO 
PFDFQVYAVKWQVFaiFTYVFE™ 

SEQID2685 

ATCGACACTATGATCAAATTCAAAAAiOT 
GCGGCAAATCCACACTCATCCGCACCGTC^ 
AGTCGGCTTTGTGTTTCAAAGTTTC^ 
GAGCTTTT(XjAA(X£CTC^ 
TCTTTGACGAGOOGACTTOOGCACTCGAOCCC^ 
CCGCGMOTGCCGACC^^ 
ATGACCCAT 
SEQ ID 2686 
MDMKJTCNVHKHFKDIiHV^ 
EIJLERVGLAHKKDAFPSQLS(X3Q^ 
fflB 

SEQ ID 2687 

ATGAAATACACAGCCGCACTCCTGACOTCT 

TTTTCCATTACGGTTATTCCCAAMTCC^ 

CAA&AACGACTGCGTACT6GAAAAAGCGGAAAGGGG6G6TC 

m«X3!ACGTCWUG^^ ' • 

SEQ ID 2688 

DCVLEKABS<X3VTYYTC^ 

SEQ ID 2689 

ATGGAAA(£AACCTGCAGAAAACCGA(^^ 

TCAAAMCGTTCCGAOTAACATATTTTGTCTTATO 

AGAAAGCATACATGATAAAATCAACCGCTAT^^ 

ATmAGAAAAACTTGACGCXTOGAAAAC^GAA 

OfflSGGAATTGCAGGTTTO 

SEQ ID 2690 

MEINLQKTERLIREINRIflAQYSQ 
ILEKIiIXjHRTEYGIiKNWYUX 

SEQ ID 2691 

Afly^AGAACACmGCXEG^ 

OGGGTTTGGCGQCMTTACCGAACCTTTGGC^^ 

TCAOGAA<X3CGACG!H3TTGC^^ 

GATGTCGCC^TATGCACTTrATGTPGCAAAAAGAAGTGG^^ 

TGCTGATTGACGTQCC3GCXX}GM 

ACTGGTGAAACTOGOCTTCOGCCAA^ 

SEQ ID 2692 

MKEHKARKRFGQHFLQI^IIGDIVNAVRPQADDVVIEIGPG 

DVADMHFMLQKEVVERMVA^ 

APEKYVALSHHAD&AV 
SEQ ID 2698 

ATGAOXX3UmTO5GCGGCM(» 

GCCAAACACGGCAAfiOCTTCAAACiUX^TO 

ACCGCCCTATCTQ1O3T0C 
SEQ ID 2694 

mAFGGOTGSKG^ISSBFSD^^ 
SEQ ID 2695 

TTGCACCCGACAOTCAG^^ 

CGATGCAGACX^ITCTACAACCI^ 

CTTCC«mCTGATTAAGGT^^ 

TOCGCIOCXKXjAAKXSGCaAAAAAA 
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GCTGAAA 

SEQ ID 2696 

LHPTLQDSAHMISRLTGKLVEKNPPQIVI^ 
SAPGI6KKTAERMVLBI1RGKLVAHTVTM 

SEQ ID 2697 

ipK^CATATCCAAOTGACMTM^ 

W5GCTTTCAOmTGACCGa(XM 

(X^TGJMGAMSIGCTTGMCTTTTJIAAGGAGAAGATMARGAGC^ 

(XyTATTCTTTCATCAICTAACATCTTCfM 

SEQ ID 2698 

UKSKLTFDATAMV^^ 

RLSRMTAAYAQIKSECAAMKNIfflHA™ 

TKFGNRKLRAVIiFMPAKVA^ 

TTTTTAACGATGCGGCGCATTTCCG^ 
(^CCGTGTTGAAGGCGGCAAGGGAGCACGGCAATGOGTCC^ 

TGGKATEILU)IQGGGIKMPKTGEWPFPFAGRDI^ 

ACGATCCTGAAmTGGGC^ 

G<XCGAA(aCTTGAACCACACC^^ 

GTGGCTTCTGCX^CXIGTCGCCGCACGTTTOT 

TCGGCGTTGACAGTGGCAGCCGTACTTTGAMGA^ 

TCCCGAAATGG^^ 

GCGTACTGCACGGOTCCGCAGCTAT^ 

CGACATCMGCGCGTCGMTACACCGTTGCCMAGA^ 

GOjGTGAAAAACGCGCCGAAAATGGGTAAAGACCAAGTG^ 

SEQ ID 2702 ' 



VASATVAARFGMTCDVYMGADDIQ..^. , 

IGLFyPYIEEENVBLVGVEAGGLGVDTPDHAAPITSGAPIGVL^ 
AVmPKMGKDQVILmSGRGDKDIim^KLKGIEL 

SEQ ID 2703 

TTGGTGAGAGACGATGTCGATTATCAAATATTCCGTGA^ 



TTATTACGGCCMTACAACGGTCATAATGATCTAGCCGACAAAG 

GAAGATOATMTATGGCGOnmAATAAGTTO 

GAATCGGTGCGGGCAGGCAATTAGTTTATGAAAMGGGGTTTAT^ 

TTATWtfGAilATEftATATO 

MTTAramOTrax^^ 

MTGGAATATCTATAAAAAAGMTTTGCAGATAAAATCAAACAACACGATAA 
GACAGCGGTAAGGOTGCCMCAATGAAGGAGA 
■JTCAAAGGCGATTACACAGTCAAAGGCGCAM^ 
TGGCAAAMTCGGCAMGGCA(OTIK3GAAATAAACGGC^ 

AATGACTTGAOTITGAACA^ 

GCCTTTCGGTACATAGTATACAAAATGACTATGACG^ 

CCTAAAATCCGGTGGCAGTCTTAAC^^ 

AACAATCAGCGTATTAGCGGTTTTAGCGGTTTC^^ 

GCGGCGCCMTTTAAACGGAAAAATMGCGTAACTCAAffi 

CTCCmAACAATGAA(mX3 



AGGCTTTAAACAGCTTTGATGCGACACGGA^^ 

TCTCAGCCTAAACCAACATAGCAAATGGCATTT^^ 

TACCACACGATCAAAATCAATCAmATCCGOT 

TCCATGTTCAAMTAAAACAGGCGAACCTAATCAGGAW 

OGCATTGCBTTATACAATCAAAACAGAAAACGGTATT 
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C^AMGGC^CACAAACGGACG(?I^CAM 

AAGTtMGCGTCAGCAAGtt^ 

TGCCGCACCGCAGGATTATATGGCA(mTCC^ 

CAGGAAGAACGCAAAGCG(K!AGAGTTGTTG6CAAAACA/^^ 

AAOTGCACATCGCCAGGAAG^^ 

ACCGCAGGATTATAT(XK^GTTTCCCMGACCG^ 

CATAATCCGGACTGGTATGAAAATG^^ 

ATGAAAAAGTTGAGGCAGTGAGTTTGCAACXl^ 

CACG(^TCTATCTTGTT6GATACA6K 

GCTTCCGCACAATATCGCCGGTTTAGTTCGAMOGC^ 

OTTTGATCAGGCGGGOGGCAAAAAIAOTTTC 

ACAAA(X£AGCAAAAAGCAAACTTTAA 

CTGTCATCTGCAGATTACAAGTT^^ 

AACarTTGTTATCRXa^ 

GTACCCTAATGTTACATTA^ 

SEQ ID 2704 

LVMHWOTQIFW)FAm6KPFVGAmSVKlM 

EDYMMARFNKFVTEVAPIAPTI^ 

NYGVLGDSGSPLPAFDKQKNQWVFLGTTOYWAGTC 

FKGDYTVKGANNDITWLGACTDV^^ 

NDLTFEHIRNVDEGARIVNHNTDH^^ 

NNC^SGFSGFFGEENGKCmNGAUMFN^ 

AlflTATDNAKVNLGYKNGDEVCVRS^^ 

YHTIKINHLSGHSHraYLTOIiA^ 



QEERRAAELLAKQRAEAEREAQAL&ARIU^ 
HNPDWYENDYEEIPIiDAIiEDE^ 
ASAQYKiFSSKRTQTQIGIDRSL^^ 
LSSADYKLGDDSVKVSSMAWTL^ 

SEQ ID 2705 W 

ATGGATAAG(&TATGGCGTTAATTTM ! 

SEQ ID 2706 i! 

MDKE^ILKRLAFIRFYRFNSDNVLHFTPVGX 

SEQ ID 2707 . 

ATGAOTACACAAAAAAATACTCAACAC^^ 

SEQ ID 2708 

MSYTKKYSTPPNRRKMQNI IAIKTVKPYESQTF 

SEQ ID 2709 

AflXXXmEGAAGGCGC^^ 
TGTTTCATCCTGfC n 

SEQ ID 2710 

MPSE(X3AAFALTKPCVSAPRCLPIX3FSRKM^ 

SEQ ID 2711 

TTGGATATGTtmTOTACGGTCJTTACGC^ 

ACTTTAAGTTTCTC^^ 

C 

SEQ ID 2712 

LIMSNPTVIiK^PLSIQHI^ 

SEQ ID 2713 

AT©3CTTTTATGCTTCTAAGGCGATGCGTGCGG^ 

CATCATCCAAAAAOGOTGTTC^^ 
GCACAGGCAACGCTTtmAAGCGGATGCGGATTTG^ 
CGGG(XOTAMGCGGCGCAG6CGGC6ATCAAATCCGCCGG^ 
GAATG(XX5GCGATACAACTGTTTTAGCCACX^^ 

TGCGCGCCGCCGTATCGMCGATCAGAATATCC^ 
TGC6AAA6ATACCGT6AT6ATTGTCAATGCCCAGGG^^ 
GTGGTTGTGGAAGGCATCAGTA'TCGCCGGTATQACGGGGGCGAA 
AAGCCAAACCTGCTTCTGAAGC6AAA 

SEQ ID 2714 

MAFYASKAMRAAALAAAVAIALSSC6KGGDAAQGGQPAGREAP 
AQATLAKADADIARYKPLVSAIMSKQEYDAAWAKRSAEAG 
IAADGmVGIKFDDGTWEKGRIiLFADPT^ 
VWEGISIAGMTGAKKVTPKEV^^ 

SEQ ID 2715 

TT6AAAATGATAAGAATGCAAACGAAAAGCCTCAGTATO 
AJmOCGAACCACAAAAACCGGOTC^ 
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GGMfflSJO 



GATTTGTCGGGCXK 
SEQ ID 2722 



ITCCCCTTCGACGGIA 
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FGOTAVTHVIWIiERYI^ 
EEWQDKFGADDDDIMeH)YDVBVVYV^ 

SEQ ID 2727 

TTGAAAGGCGGGGCGAACGC^ 
CAGACGGCATTATGATGAAAGGCGTT 

SEQ ID 2728 

LKGGANAGSRTGAAGQSSPPISNKGRLKG^^ 

SEQ ID 2729 

TmCGCTCAAACOTCCGACTTCGAT 

TGTTOTACCCTGAinTCCCCATT^^ 

ATTGTTTTGGGTTTGATTGTC^ 

GCAAAGGCMGATTATGGAAAGCGGCCTGCTGATT^ 

AAAAACGCTGGCCATCGMGTCGGTAIIOT 

AATATCTCC(K£TCGCTGCT 

SEQ ID 2730 

UliKPSDPDILPKHPKAVIIGmQPAIMPATAWLPSKLI^ 

IVLGLIVHKVLGSKTERPT^^ 

NISGSLLATYWAAKAGKHKKTLNAPHHNAV 

SEQ ID 2731 

ATCK^TGCTTGTTTTTTC(^ 

AGOT^ATATGOTCAAC^GGGGACGG^ 

GGGGATAAAGTC£A!TTT(XX3&C^^ 

SEQ ID 2732 

MDACFFVIPAQAGIRIWGIVFKRSGRIIi^ 

seq id grc^ 

AAGGTGCCTAT<mGTTT^^ 

CCACAGCACCATTGTC6TCAATGCTGCC6AA6CAT^ 

C^TTTTGCGACAGGGCAGGTACGGCGGTTGCC^ 

CGTATCTCGGCCCTGCCATCAGTGCGGATGC^^ 

OTTGCCGACCTCTACGCGCTCGCXXXjCX^ 

G(»GCGACamXff ATGG^^ 

SEQ ID 2734 

MKTITETLNLAPKGKNFLTAEWPAPANVKT^ 

IJCDRAGTAVAAAHAGWSGLAG(MiQNT 

GATGRUASUWUXaiAV 
SEQ ID 2735 

ATGAACAAAATATTCCTTACTGCQXiAGCCTTGGTG^ 

AGGCATlPGCAGTTTCCTTTGGAAACCXiCGCTGTAT^ 

CGTTACCCGTGCGGCAGTCATCAACGAATA 

TATGCCX3ACAACGAAATTTTGGGCAAACAGGA 
SEQ ID 2736 

YADNBIIXKQEEEETLWAEMRQDWABQIVRRLTFLKAE 

SEQ ID 2737 

ATCCfflCCGATATGTGCCGCCAO^ 
AAMTTTCGTTGTCGGCATAATCCAA^ 
TGACTGCCGCACGGGTAACGGTATAGGTTTCC^ 
CAAAGGAAACTGCMTOOTGTCC^^ 

SEQ ID 2738 

MM»ICAATSFRI^KGQAADNIiFGrai»PHFRPQGPOT 
QRKLQOSAFDWAPVGQHBR^^ 

SEQ ID 2739 

GTGGCGGCAC^TATCGGACGCATCGATACGGAC<XXXX^ 
AGCAfiGGTTAITCTCAACCGCGMGCCTATAO^^ 
GCXmCGGAAAACCOGGCAAAAACGG^ 
AAATGGTTTGC(XXAaTGGC^G(m 
ACGCATTGGCACTCTTTGCCGAGC^ 
(XJCCGTTGCCAATGTTGCCCGOT^^ 
CTGCTTCTGTGGG0GGTTC(XX5AA^ 
CGCTOCACOCT^^ 
CAAACAGCTTGTCGTGTCGCTGGCGGAA 
SEQ ID 2740 
VAAHIGtlDTDAPIiKPLYVIHGEH^^ 
KWFAALAAKGEVWEAKFVGAAALW 
IiU^VAHJVRTLIRLAAAI^ 
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(yTTTCAAAGGCGCGTCCGTATCGATGCGTCC^ 

TTTCTTOTC 
CAATATCAAA&GATATTCGT 

SEQ ID 2742 

L6ITALFRQRHEKI1FBYRPCVAV 



SEQ ID 2744 



CCGCAAACCCACTAACGGAG 
SEQ ID 2746 

LYVQENMmKRGiaiCKDWV^^ 
GAMTCMTCmTCGCGGCAAAACA^^ 



j£IFIWIFHI*YAGlMPI/GIPPKGAAGA^^ 

GGCQGCAAGCAGGATGGCAAIKXXX^GGAATCGACTATCAATAD 
GGKQDGRRP6IDNQYDNE& 

===== 

CGACAACGACAGGCTGCMCAGGMTTTGCCCWSTTATC^^ 



TOGWtfXGATGOGOGMCAGGTGCAGAAIPAKGAA 

GSSGBSLDFI^EUffiEANTLGS^ 
SEQ ID 2756 
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AATGGATAGCC<XXOTAGCTGCOT^^ 

CGAAAAAGGTTCGGAACTOXXSGC^^ 

TOjAAOTCCACOSCTGATGCTGAACCWjSGCCTCGGTTOT 

TTGCCGATATGGTO3TTTATGTGTCGGTGTATTTW 

flOQOHOCIBMIOOCMBtlBQGBCRMODWOQCM^^ 

GGCGOXXKGMGCeGim^ 

OSMflCOOCOOOeCWmG^^ 

ATCAAGTTCGAACACGTCGATT^^ 

CCACCATCffH^ACCTGCT^^ 

jf^Q»TACCTCGCTGCTGCACCGTTCCG^ 

fflOCmOCQMGOCW^^ 

CACCA' 

OGCIKXHTCCTCAGCGftACACQTCGAGTGQCAGCRCQftC 



J^^sL^DPYPEAAP^^ 

VMPHRUlIJIQSimQDmGKVSAK^^ 

GAREAWAKQSHEHWTVRAQMHATIAH^^ 



GGFLSEHVS3QHD 

epQ in 2759 

CCTCGTCAAACCGAC^^ 

SEQ ID 2760 

mfer™qvipvfkvemqsqfygfhytytiaw 

epo in 2761 " 

ATGlTOKaGCTTAAAGA^ 
aXKGAAAAAGCACCAAAACnTTMa^T^ 
TGTT 
SEQ ID 2762 



C«3ACGAAAACGCCAraTGCTCGCT^ 
CCAATCTGTCGGAAAAGCAGC 



SEQ ID 2766 

AGGAAKTA'ICGGTCTiaTGAAGGaXSTCAAACGCTACW 

GTGWaAACmCCGAAGTTCTCGA^ 
CGATTCCGMCCGAG(X^CAACTATCGMACAGGCGCATTAC^^ 

AAGOCGAAGCGGTT 

S ^^A«LGSI£QmrVNSI 
RLVRVATTKPQRKLPFffijRSMRKN^^ 
LQDDGGLTIilQLAAEYGVSAERIRQIERKMiQKLRGFLTBEftEAV 

GGCAGGAGATATACGCTAAGGTT 

SEQID2770 

HAGSAKALFIVreiSFRIiKLMGGLVCWDAVYHCIACILyVWQEIYftKV 
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^^^^^^^^^^^^^^ 



KKTGSSWVWWILTLAMjILFIFRHKEH 



TCOTTTGCCGACCTCT^ 
GGXXSTGMGTCGGTTTGTTGGG^^ 

C<SAGAMGGCGGTTGCCfiGTGTMCCATGT(K3ATGCCGCC^GG^ 



CTCAACTT 

SEQ ID 2778 _ TMWT „ n , eft T 
LmiMAFSFRVGLTRADY^ 

SEQ ID 2780 

I£DPIJiAQHWNAOTEYSmVVGPLS(^IACAPPCR 




iTGACTGTTTTGGTGTACG 



GACTOMCXSGTKGCAGAAMCATCATCATCXjCG^ 
GAATCGTCGCCGTCAICGTO^CTCGACGACTTGQGCATGCIGC^^ 

ATTCCGCCAACCTCCTTCTCG®^ 

SEQ ID 2784 

U3TPITSGIIASEIGTAP8KPHQLIYAFSBRL 
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GGCWSSjGOTTTTGGCGGTTOACCTPGCCGGCA^ 

(^TAICGGGTCTATGGGGCTGAAAGTACCQCGT 
SEQID2786 



FmCIGVBHiFDPFGmVSPAA^ 

GCOTCTCCTATATAATC^ 

SEQ ID 2788 

KKI£SKDCNTAFSSaETUWWffiIiQTAFSCPHP 

S 5g\^!ctac^to^^ 



ACCAAACAGACAAAATGCCGTCTG^^ 
PATHSDADmVAmSHAFTO^SVTEKD^ 

S aScSScacocc™ 



OCGCTKXSMAOa^^ 

GUfflWAlfflLPHm'imi.V^^ 
KXXXXnTOATTTCTGCCATATCGA^ 



GACAAGTA/W 1 A * WWVWWJU ^uv.\-vuin ww* - - - ~ - - - - — ~ - - — ^ m mid * m rrvf/i/i/W* * * * «mi 

TGTO\AAAACGSnTGCOTTCMTCOT 
^AWPEHSOTJtMDFVPTliEKPESKPI^ 
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MTrcOCGCTTTTATGA'TCATGC 



TTTATCACCTGAMTTIHXATGAAAATCCAAKOTTTTGA^^ 



AAACCGCGCGCGCGGCGTGGAC 
SEQ ID 2798 




SEQ ID 2800 

LQERCi^YATLWQDI/SIPSBKPWEFIYHAVF 




IiERVKEDIAEVAQIESFPKMEGR^^VMMIAPK^ 
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SEQ ID 2806 

VMiIfiI*VFVFTVIHQFA^ 

SEQ ID 2807 

JTCCGGCAAWUUICOGMGC^ 

GCGOQAAJUAACGCTra^ 

GGTAAATGATCGCGA!TTTGGCTTTO 

SEQ ID 2808 

SEQ ID 2809 

TOXK6ACCG6GGC66AAAAAATGCCGTCT6A^ 

CCCC^raSTTT^ 

CTGCCAAACOAGCTGGCCCAAGTCGAACAGA^ 
SEQ ID 281 0 

SEQ ID 281 1 

ATGAGAAAAACCAAAACCGMGCCTTGAA^ 

AGGAGGTTOTGGACGGTATTCC^ 
AACGCCGCCGTTATCGCCATTCCCCGCAAGCATCAGGC^ 
(XMTOTOTCATCAAATCGACGTT^^ 
AAACXATCCCTGCC^^ 

SEQ ID 281 2 

MRKTKTEALKTK^ 

NAAVIAIARKHQAIWREnTAVLTEAVE^ 

SEQ ID 2813 

TTGCTCJITmCCAAGG 

CTAGAAATAAAMGCAGCAGGAATTTATCX3GAAATAAC , T . 
SEQ ID 2814 

LLLFQGPDSSIiRGNDBSIRrETCTTSFM I 

SEQ ID 2815 ' 

GTGCGAGGCCCAAGGCACT^ 
ATAOTTTCAGT^ 

SEQ ID 2816 

VRGPRHSHLSVICPVKERCEIimFRLSn 

SEQ ID 2817 

ATG(XACGCGTAAMCGCGGTGTTA0CX3CCCGTQ^ 

CXKjTCTGAAACGCXXICTCTATCGAAATC^^ 

SEQ ID 281 8 

MPRVKRGTORARHQKFAIAKGYRGR!^ 

SEQ ID 2819 

TTGCGGAMTTCTATGTGATTGATTTTC^^ 

SEQ ID 2820 

LRKFmDPLSLKSmJIKPALmKFIQW 

SEQ ID 2821 

TTGGCCXSTGAAGAAAATAAGGTCGTCTGAAGAGTCT^^ 

CTATCGT^GA^ 

GCAAATGTCGCTOAAGAGCGCAAAACCATAGGTGCGCATATCA^ 

CTTGCCGCCGAAGCCCTCt^ 

TGGCGGACGGGCCTGAAATCGMGACGATTTTCAC^ 

TACGOMCTTC^ 

CATCAGGCXX^GGGTTOTGGGTASAAGAGGGCTAACTTTT^ 

CTOCITCCCGOTACCGAGCCTT^^ 

(XCTGAAftAATATACCGGTTTCGCrrTC^^ 

GAA 

SEQ ID 2822 

IAVKKIRSSEISDaSGYTGGLWSGGia 

LAAEAlDHIiPGR^BGGSUffTO 

HQAEGIOTEBGV^^ 

E 

SEQ ID 2823 

TTGAATAAATCTCAAATAMCCCM 
CITOGGGCTCAGGTATCXMAAAAAT^ 
ACAGAAAAMGCGOTAATAAAGGGAMTCGTGAGTTTTGGATT^^ 
TTTTGGAGTAACGATGTCGTCAAAAATACAACCT(^^ 
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FVJSNDVVKNTTCCAEKVKEn^ 

AOCTTATCAAAA 
TLSK 

AAMTCTGTATTGTACGATTTCCWATGGMMTCAGAUGATAGC 



^^^^^^^^^ 

AGAGAAAATATTTTTC&QAGT 



^^^^^^^^^^ 

AGGTCGGATTGCGTTTGGGTCGTCTGAAAACC 

===^^ 

"" ,, " Tmn ™ ' " " - * ;gglwygaampbqwgek 

TRNADPYDIlWWENLLKmV^^ — * 

LVQBISVPIWRGXGI^BGMKSVAVKW 

CATTGOTATGCAOGCATOaAAAAACTGACGTTTGTC^^ 

S Sa!S^^ 
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SEQ ID 2836 

l^FKAEI^ILV^^ 

SEQ ID 2837 

ATCACAAACAAWTTGATTTQA^ 

TAC(£TTCAGGAMTC(&AGA^ 

SEQ ID 2838 

ffiNNVDLTIPAKRYFT^^ 

SEQID2839 ' 

SEQ ID 2840 

LRDFTIJRIiEAHGPVIIYRQIiAHNPVRAVOCI^^ 

SEQ ID 2841 

ATGAGATTITIXX^ATCGGTTITr^^ 
(XSGTGTGCTGATGCTCAGGCATACGGGGCTGTO 
GGCGGTGTGra^TGAGTCOTGGATTCGTATCC^^ 
CMTCGGGCAGAAAAGAGGGATTTITCCACGATGACGATATTATC 

SEQ ID 2842 

QSGRKEGFFHDDDIIBGEY7VEEPYGGNRSRNAIEHKKDB 
SEQ ID 2843 

(X7IGCA!KX3G(^GATGCT^^ 

SEQ ID 2844 

MSVVroiVTORKVEIYHKYKKPCLVIiAVDPVM^ 

SEQ ID 2845 

TT6ATTGAAQCGTT6G6CGGA6GCTGTGTGAAA(XjGTA^ 

GTTmC<XXXX3AAACGGATGGAAA» 

ACATCATCATTACTTMTTTCGACC^ 

SEQ ID 2846 ^ I I 

LIEALGGGCVKRYWAIiGRMPIRIiEECET^ i ; 

SEQ ID 2847 i ' 

ATGACCGATCCGCTTCCCGTTTATCCT^^ 

ATCATCCCGTTTl^^ 

GCAGGGGCTCGACCffEATTT^ 

GTACGGCQCGGGTATCAQjGCGATACTTGGAACGC^ 

ACCGWTCGACGA3t^TGGGACGGGGCGGAW^ 

GTATTTTTATCATOCGCAGTATCTO 

GAA(^CGCGGAGGTCGTGCCG{^TATTATGTGTATCGGCAAG 

CGGGCGTGTTTATGCACGGOCCGA^TTTA 

ttTCTTAAAAGOOT^ 

GATTGT6TGGCQCAGGGCATTTGGGTACGCCCX5T^^ 

AG6AACACAGCAAA 

SEQ ID 2848 

MTDPLFVYPVKRAEGVFlEIADg^ 

VRRGYHGDTWNAMSVCDPETGMKHircSAI^ 

HJAEWPDIICIGKGLSGGVOTjAAAITSQKVTETISRGEAGVF^ 

owcamPFGRLwuippnisre 
SEQ ID 2849 

TTGAATCCGACGAATGATTTGAAAGCGTGGCAACA^ 

GCTATGCGGCTTOGACQGOGGCAAGCGTCTGC^ 

(XSTCTATTCTTTGGTTCACGAC^ 

CAAACCCMGCCTTCGACXnX^^ 

TCGATTTCGCAAATGTCGGQUUVCAAATGGTTC^ 

GTCCGACGCGGMCTTGCCGTATTGGACGOT 

AAAGATCCCGACAACGACAAGCCGACTTATGTGAAA^ 

SEQ ID 2850 

UJPBTOIiKAViQQR^^ 

(^AFDVLSW>TELPAARQIJ^^ 

KDADNDKPTYVKLMGLE^ARSYAHKLVAEAVALI£ 

SEQ ID 2851 

TTOTTTOCAAGAATGCCGGAGAAAA^ 

ATGATCCATGGCATGAAGATGATTT^^^ 

ACAIFrTTGAOCAAGACAATGTTC^^ 

SEQ ID 2852 

tfARMPEKHAVHRHVLtt^ 
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SEQ ID 2853 

AMCCGGTTOXAAGACATOKX&AA(^CATC<^^ 

GmXSO^CG^GCTCTmCCATOATCAra 

TCAMCAATACGGCATTCGCCTXACAGCCKXM^ 

CGiiGCGGAMGGKreGMTOjGCAAAAAC&GGCGCGGTA 

SEQ ID 2854 

MKGWFVSGIWDIGKIVMQOiRKl 



VPLTEKLIiTIDHIQQQHPVILVTSGRI/jSINHTIiIiSFVVL^ 
CCGAGTAITOCTGCCCGAACrGMACGCTTTATCGGA 



ATCMATAMAGAAMAATAAGTAOGMTOSTTTICT 
flAACAAGlMAAAGCCTTCTGTAGAAATCGAAAA&TGGGGGCTC^^ 
GCATAACAATAGAAATATATGCCMGTAGAAAIAAAGAACCAWjAAGATGGTCTAAAUT^ 
TOCAAGGGTGTAAGACTTATATGAAAGAGGGTTATGAATCT 

FQGCKTYMKEGYBS 
cpn in 985S 

M^TCTTAAATCCOCTTTATTTTTAOGACTGTCC^ 
TGTGGCTGGCITCAGACGGCATTCCCGATTTGGCGGTATTGGTO 

AATCGGCATCMAACCTCCGCCGTCACGTTO3GGCGTO 
TCTGGTTTGCCGCTATTTPTGCCCATAC^ 

A1WIMPIVLIiLQ«l8HAimVRQI» 
cpq in 2861 

CCGTACKX»HUiCCC(mm 
GGCARGTTTTCCCAAAAAA(X^TCAGAGAATCGCTGATGAC^ 

GKFSQKSIRESUfPVASAEEABAIliTEB 

AICCGMXX3GTTO3C(XCTACGGATGAAC^ 
AAGSCKTTWAAAAOllWGCATGGGACAATGAGGATTOGCGTT^ 

SEQ ID 2864 



MLFWENLPA 

KISPIRLISFSFQTSQHBRIVPTAGAHIiNPAYGTG 

SEQ ID 2866 

IiLVySMVCQKYRRWEGKAKMRYROTTVA 

rfATTATTIXATTGTGTTATTGCTGATTGCCTCTGCK 

TCWSlGOTGCTCGATGCCGAOGGPPrGAT^ 

mrasjQOT(&TTTCCGCATTMTGCGCT^^ 

TO^MTCOTTTGTOXKmCATC^ 

MTCGATCCGOTOT^GGCATCKa^ 

ATTQGTTATTTHOTACTGAAAAMTCGTQjAAC^^ 
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GATTWffTTCGGCAGQCGTGGlGTTT^^ 

AAAATCGAroTTGTTTTTA^ 

ACmGGGACTTOP^^ 

TGGGCGG(^GTGTGTTGTraTCG6TTTTATT^ 

G(X£GGCTACGCGCCCGAAGT^ 

AAAG&TGCGGGQTPAGGCACGCTGATTTC^ 

(^CACCCACATTCTATCCGGTGCCT 

SEQ ID 2868 

MHSIYFFKEKQMSQTDA^^ 

KSG1JSAD4RLLLTKSPRKLTTFMVV?TGIL^ 

IGYFVTEKIVEPQLGPyQSDLSQEEKDIRHSNEOT 

TLGLYLVIIFFAAQFVAFFNWTNI^ 

KDAGVOTLISflfliPYSAFFIiIW^ 

SEQ ID 2869 

ATCKXX^ATCTCCGTCCGCCGCCT^^ 

TCGCCC^TCGC^CCTGM 

GTTCGATATTTCCA!^^ 

CI^TGGATGTGTTGCGGATTTACCTGCAAC^^ 

AGAGCGAATTGGCATTGGMCTGATTTCGOGCGGCCATi^^ 

TTTTTTGGAAGTGCGCXK^TO 

(XXjTACGCAATTACATTTTC&M 

SEQ ID 2870 

MPSISVRRIiFDI^YKI£IAHA^ 

LMDVLRIYLQRTLAASSVKH 

lKQU3RIiSIRTErPESIUlVNVRSVTLm 

SEQ ID 2871 

TTGCCCGAGCCGGAAAGCCCGCTAATCAGGAOGAOTTC 

ATGTMTTGCGTACCGCCGCCTCAACCAAMC6G(^ 

GGTCGAGCTGCTTCATATACTCGTCGTCCGCCTCG^^ 

GOGCACTTCCAAJUUUkTCGCGCAGC^ 

AATGCCAATTCGCTCTTACCCAAACCAGAATG^ 

GCMCACAKX^TCAGATAAGGGCTTTCGAGTT^^ 

CATGGAAATATCGAACAGGTCTCCGMCTGATAACCC 

TGTCCGACCAGGCXXym^ 

CGGA6ATACTGGGCATATTTTATTCTTCGGTCA 

SEQ ID 2872 

LPEPESPURTIMLPWFKLR131AFQKFARWPM 



HGNIBQVSEWOMiPRF^ 
SEQ ID 2873 

TGT(X5TACCATATCGAACGTG(XK^CGAAACCGMTO 
GGTGGMGTTTTGTTTGTCGAGGCGGAAGAGGCCGTGC^ 
CGO^TGGCTGTTCCC(X^ 
TGGTTTTGGAGTCOTKXjGGTTCAAA^^ 
CAAGCCCGTTTGGGATTATTTGGACGGACAGCCGTTG^ 
GTCGCCATCX36CTGTACCGGCGGGCAGC^^ 
GCCGC 
SEQ ID 2874 
MKmiSGLSGSGRSVMiR^^IiGYPCVDNI^ 
REWLF pLKEIAYCIDTSKMNAQQIi^VRQWIiKVERTGL^ 
VMGCTGGQHRSVYIVSKLARRLKGRYELLIR^ 

SEQ ID 2875 

TOACG(^TTA(K3GGCCCGAC^ 
GTACAGCCGATG606ACGGTAACGTAGCTCCTGCTO 
AATCCCAAACGGGCTTGTCCATACCGGTGTAAGGCOT 
GM«^CTCCAAAACCACCAGCAGTCCOT 

— OTCOGCCTCGA 



CAAACAAMCTTCCACCCTGTGTC^^ 
ipfCGATATGOTACGACAttMCG^^ 



SEQ ID 2876 

LTCIAJfcGVLRLPV^ 
BGLQNHWSGTI^PLPIXMAEIiWIH^ 
FIMVRBQRRQHFQGQVVHAKIAQIFH^ 
SEQ ID 2877 
MXXEGTCTGAAACX^^ 
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SEQ ID 2878 

MPSETPSAVSDQISHIEKKBQTOQSHGPPAI 



IPNVGKSTLIatv*** *- _ iwiii^meci 

GAVLSGGKVKXQKAAENILTOHiH^GMT^^ 



SEQ ID 2881 ^^^^^^.^^^ 
SEQ ID 2883 



GFAVGVKHLTMVQIAQAAFQDIiPVRRYGWKRGQFGAV 

gVERSCTiKNIFQTAFFHLPKASALCALIiFWLHEWL 
SEQ ID 2887 ^..^^..^^uwwmm^ 



CMWGGTCTGCTGTTGGGCGATMGGCCGATTACAGC^wra 




ATATCCGCGAWTtmKAC^^ 




TOGOLIDIHGQiniHHSIiHQ^^ 
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SEQID2690 

LLHTSNRAFSSIiPYLTGGT^^ 

SEQID2891 

GTGTTTTCGATAIPrraCCGTGAGGA 

SEQID2892 

. WSIFYREK?YVYU?MIFCGRDGMRQRLKESGSAF 

SEQ ID 2893 

TP(^TCA6(XaU^CCCAGTGCC^^ 

ACCACATTT(£TGGCAGACC^ 

A(XX^TTGCCGGCGCATOA^ 

AAAATCTCGCCGCACGACACCGGACAG^^ 

ACCTCMTTACAGCCGCACCGAACAA 

GAACACGCCCTACGCCGAACCATTATQCGG&CTCTiOG 

ACGTTM(X(KXATTTCCCACGA£^GAG 

ACCTCGCCGCCCTACCCCT^^ 

CCK^CTGCACTGCTACGGCGCGAGTTTTATTC^^ 

TTGTTTGCCQCCAACCCAAACttX^^ 

ACG0CAAACTTGCACT6GACAAAAACTACCG(^TCG^ 

SEQID2894 

fcDQATQOiQFDSIl^^ 
nSPHDT(KJTEUiIAGYGyTKRI^ 
TLTGISHIXjQWIASTPNG^ 
LFAANPNPQKHQGKWVIRCDSPIHjPWGAIiGDIAAMR 

SEQ ID 2695 

GTGCCTTTAAACXXXIGCGTATTTTAACAGCTTTA 
CGACCAAACCATCCGCAAACACGCACAC 

SEQ ID 2896 

VPIiNGAYFHSFIPIPIFKPTIdT^^ 

SEQ ID 2897 |! 

TTCTATGCCGAACCGAGGTGTT^ 
TACGCGTCTATTCGC(XAGTGCGGAAGTGaK^ 
AATCAGTTTTOGGWXJGGCACGACAGCGGTCi^ 
TCGCGTTTTGAAAAGCAATCAGATTGT 

SEQ ID 2898 

LYAEPRCFLNNIFVEIRLIjKNRAV™ 
SRFBRQSDC 
SEQ ID 2699 

ATGGGOGGACAAAAAACGCATTTCGGOTTCAG^ 

GX^GGOTGCACCGCCT^ 

GGTCGGCAAGGAAGGCGAGGTTTGGCTGACCGATATO 

CTGCCTTTCCCXGACMTTATTTCA^^ 

•K^TTGGAGTTTTC^^ 

<X3CC6AA!TCCATCC6TATCCA(XXX36ATCA^^ 

GTGAAATTT 

SEQ ID 2900 

MG^KTHFGPSTW^mGKVAEVraSVAKN^ 

IJFPDNYFNIjVSVAFGLRNMT^^ 

VKP 

SEQ ID 2901 

TTOTTTCOTGTTOATTCCCT^^ 
ACCGCTCCC(XaCQ(^TTCGAQGATCT 
TGGTCGTGACX^GGGGGCGT^^ 
R£<&AACGGCATATCTTAAC^^ 

SEQ ID 2902 

IiPACLFPWMWPnPPHBTRBGENEVQADD^^ 
CGTAYLNKNGSL 

SEQ ID 2903 

ATGOTTACCmACCAATCCAAOXOTCGM 
.GGATGCGOCGCTACCTOUICACCTCOT 
GGAACTCAGCCCGTTTGCACCC^ 
GGCAGCGQAGAATCGGCGGATTACCAGCTTGCGGGACA^ 

CGA<KGTTATTTCGTGTTCGGCATTT^ 

GGCAACGfTCATCGAAGCGGCGCAAATO^ 

ACGAMTC£AAATA(MGAAGAAACG(XX^^ 

CGGTA(XK^TGACGGCT(^TATGCAT^ 

ATCGCCGTATTAACCCCGAATATCGMCCCTMACGC^^ 
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GQCAACCGTTTTTTTATGCG^ 

GGACGMTOXXE^^ 

ATATTGGGCTC^^ 



CGGATATTGCXX3GGAT.-^, ^ -~ - 
CGAAAACGAMCGGT^ 

CGACATCX^GCAAAGACGAAGAACTG^ 

0CXKHK3CAAGCCTTCTCGCGC(^ 

TOACCAACOX^^ 

CGGCGM<XCT^G^^ 

GCGGAAAGOTGC^^ 

CCTtMAOTTGCGCXX^^ 

GM^TamrmmTATCGo 

OCCAAACCGCCTftTTAOTGCAGCAAAG^^ 
SEO ID 2904 

CSGESMHOU^^ 

GNVIEAAQIIjKGGGDPDLTQAGHPIiLASLGKQGRDPFM 

IAWTPNIBPYTPFIEAVFGQAQPGAQAI^SVSDVKISRSQPFFYAL^ 

ILGVMiPEGGNI^DVSAWYADVNOTAMFGW 

FLROSITFCSMVPMRSLPF^ 

PLQAFSMIYFQEGG^ 

EWKftFm<#NRPIJ>FFAK^^ 

SEQ ID 2905 

Tfl^TTGGAATAATra 
MTACAAAGAAGAAATG<MGCCAAA^ 



OCAATCCCGAACCCGACCCU^ 

MGGAMGAAGGCGAAGACGGCGGGCTTTCGTGC^ 

GACGATAAAACATGGTCTTCACATAAOTOT 

CXXSAAAAAATCCGGTTTO^ 

SEQ ID 2906 .^^^ryn.r.r^^n^ 

PRPBLOTASAEAPHAQPLPEVSP^ 
mTWSSHHFLPSNGVCPQPKTFHVFGRQYRASYEPlOT 

SEQ ID 2907 

ATGCCGGAAACTAOTTCAAACAGATTTCX^TCC^ 

GaGQGAGCTOfflCOBCaAGa 
CCGGAGAMGGCAGATTTTC^^ 

SEQ ID 2908 , u l 

MPESIFKQISSDnWflR^^ 
raGRPSVGDKEVWISRNPLVRALYPVGYSHRQAVLQI 

SEQ ID 2909 

atcc^ 

CKXaACACTO^^ 
AA(^AAAACGGCTGAACCGCCGTATGCAGCTOT 

SEQ ID 2910 

MPSEHRFQTAFFLCRQHACOT© 
pSSHBQOSFHLQSTKYLPQIjPPPRVRmQmCR^ 

SEQ ID 291 1 

ATGAACAACAGACOTTTGC^^ 
CCTCACTGTATATGACCGCACC^ 
AGCCC»TTTC(K3ACGC^ 
CA03AACGCAGlTOX?rcA 
CGGACAGG 
SEQ ID 2912 
MNmSFAVIAIiGSNIiDMPAQQIRGAIJ) 
HERSFVIRPIAKIIjPDFILGKYGKVVEI^ 

SEQ ID 2913 

Am^CTATCTGATTGTCAMTAC^ 
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CTACAT(X£ACTGGGCATGGTAATGAT^^ 
CTGCCATTC 

SEQID2914 

IQYLIVKYSHQIFVTITILVFNIR^^ 
LPP 

SEQID2915 

TTGATACGAGCCGACCATGCAGTATCTGATTGTC^^ 
TTGGTCGGCflTreGAAAGCACTGCCC^^ 

TGCCAAAACCAAAGTCCTGCCATTCTGAA 
SEQ ID 2916 

LIRAEHAVSKQIQPSJ^YIffl^ 

C0IOSPAILKHCHBQQTPCKHRLGQQP 
SEQ1D2917 

TTGCCCmXrOATCX^GGACACGAAW 

GAATAGCCCACCCTArcFTTTGGCATTO 
SEQ ID 2918 

LPSSDIDTRTANRRQKPSDN^ 

SEQID2919 

ATGCTTATTGCCGTCCTCTATCCGGGACACGAA^ 
ACXXXMGAATAGOCCAOCCTM^^ 

SEQ ID 2920 

mliavlypghengkqbaeavg^aknlpqeqfavlrygf™ 

SEQ ID 2921 

GATTCGAGGATGTAGGCAAGAGAGGCGGTCAGCGCGAAACC^^ 

SEQ ID 2922 

FPVAKVPKIPPHETREGQfEVQADIXX^^ 

SEQ ID 2923 Mmxmnaci^^ ! 

SEQ ID 2924 I 

MTRCRLKRSDGILFAGGIiPEKLYKSASliSP I 

SEQ ID 2925 . 

TTGCTaXXMTOKX^^ 

SEQ ID 2926 

LLRIRGAAALtfGHPLALIANP 

SEQ ID 2927 

GTGTCCCmTAGAGGACGGCAATM 
CAGCCGAAATTGAAAACGGCPGCATCCAGOGGCTCCXX^^ 
TGTTCAGGGCTTGCGGCTGGATGTCGAATGCC^^ 
GGGAAGTGCCTCCX^GCAAAAGGCMrcGGCGM 

SEQ ID 2928 

VSRIEDGNKOTFFQQGEGCSKGGNAGFLVGGQIL 
GKCLPQKAHGMCD7FQ 
SEQ ID 2929 
TTGAAAAACATCCTTCCATTCX3C0CATO 
GCAACAGGGGGAAAGTGTGGGCATTCGAC^TCCAGCCG^ 

SEQ ID 2930 

LKNILPFAHCLLREALPEG<2I^^ 
SEQ ID 2931 

AT(XXXXXjOGGAAJ&CCQCC(^ 
ATGACAGC 

SEQ ID 2932 

MIXSGKPPKHKKIGKFSYSRLQFDTHRASARFHN^ 
SEQ ID 2933 

GTGGAT6CT0CAGCAAC6CTCCTGACATACAM 
ATTTCACAATCATAATAGC(X3AGTTAAAAAATATGACAGOT 
TATCAMTTGCAAGGTCAGGTCGAATCTTTCG^ 
AACCTCCAACATGAAAAC^MCCXXAAGO 

SEQID2934 

VDAMTLLTYKCP AENRPNTKKLVI F PI PGYNSTRTGHL PDF7I I IAELKNMTAKGOMLQDPFLNALRKEHVPVS IYLVKG IKI^GQVESFDQYVVLIMI^VTQMVYKHAI STIVP ARSV 
NIjQHENKPQAAPASTLVQVETVQQPAE 

SEQ ID 2935 

ATOTCAAACCACAAAATATGTTCCGATWAA 
GCTGTOTTTCAACAAAATAACGCCAATACTCAAAAATAC^^ 
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GCGCGMGGTCGTTTTGGftTGCAAACITOa 



CGAGGXaGGCAGGTraATGGWGTCAAAGCCMCATO^ 
ATGCTGCAGCAACGCTCC 

TQGLIWSQSAtVVNNK^IL^^ 



S T^wSmCATGGCTTGCTmTC^ 

SEQID2938 

LVQVKHGLLFP3SDAELVSFSGKPPANKMPSERPBRHEVI 

S S5LgCG«^ 

SEQID2940 

LBEARYKBGYIUjLKMGCIJSIYTRIGHSHQA 

CAGAACATCTGCCTTTGffKKTGTATTTO 

TGGAAACGCTTTCCCGA 

S !^^ra^GSfflfflmYmiJSSSQADFHSAE^ 
KKLYGKESI/jIERSTFVLNKDGEIVRiSffiKVKVAGHAQ 
ID 2943 

GCCGGCTGTCTPTGTGCGGCAGAACGCTGAAGf^TGCGG 

CCTGTATGTGTGGATGGMOGGGAAGGGATGAGG(KMGACAA 

CTKraSCACCGGATACCGACACGC^ 

AGGCAGGAGTTGCGAO^TTCITICTCAGTCAGAAAA 

ASCCTGCGCCACGCCTTTGCCACGCATCT03TGCGGC^^ 

TOMGCTGCATAGCGTGGTTAAGGAACACCATOCCGAAAC 

SEQ ID 2944 

fcVWNAMENGLIDRI^^ 

SLRHAFATHLVRHGIiIMVVQEMLGHA^ 

SEQ ID 2946 

HFVaCHOTD^IKETI^ 

GGCA3TITO3CTGTTTT^^ 
GIFAVFGRT 

GGCTGGAAGTGGTCGATGTGGATGAAMTC^ 
GAAACAGTTTCTCGAACGG 

SEQ ID 2950 

VCKiraCETRmTLMFRBYCS^ 

S 1S™^^TA(^ 

ggttgcgcattgtgcgccgtcagaaacgaa^^ 
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SEQID2952 

MSSEKMIIJtSCQKQQKCI^ 

SEQ ID 2953 - 

CCTGHXTFCA^^ 
AACCCGTATTCATCCTGAAAAGGTT 

SEQ DD 2954 

VLPLPAGHQPHKNVKNVDLKKMPSEVLLQ^ 

SEQID2955 

TTGACCCTTTCOTATTTTA^ 
TGATG6AACCA 
SEQ DD 2956 

LTLSLFYLSVGIOIMiLIPSIKIIUCF 

SEQ ID 2957 

ATGAGGCAAACGTGCCTTTTAMAM^ 
ATGCGGGTGTAGATAGATTGGCAGCCCATTTT^ 
TCTTITCGGCGCAATCGCCTTT^^ 
SEQ ID 2958 

MRQTCLIiRKGEQMKLIi^ 

SEQ OD 2959 

ATGA6AATCCTATTGACAGQCnC6AAAAG(X!AACTGGCACAOT 

CamTCXy&CATGGTC^^ 

TTACAACCTTGOCTTGGCAG 

MTGTATACGGACMTCCAAAACCGCCGG(mCTGC^ 

TGCTGM(XTTGCGCGGGAACGTTC^ 

TTTCCAAGCTGCTC 

SEQ ID 2960 

MRIIiTGSKSQI^LRDRLPEDWETIA 
IMGQSKTAGEIiLMjSAHPDSIJIjRTSWIiFSEYGDNFVM 

SDEGSAAAAPRPAYSILIXIR1QEI©FGIKPSD^KAIiAQWS^ ! 

SEQ ID 2961 l ! 

TT6GT6AAA6TGTATTGTA6GACTTGCTGTCG6T 

GGTTTGCGATTCAMCCTGTCGGGiraGT^^ 

CGATAAACGTATGCTGATGGTOGOTT^^ 

OT(XGCATA™TGCCCMCCATTTCACCGra 

TGGAGGC^TTGTGCGCGGTTATCTGGCAOGCAGCXaGTTGGAA 

GATTTTTACGCCCTCAACT 

COTA(^CCGMGCGGCGGAATATGCCAMTCGCG^ 

ATTCGAGCCttTTCT^^ 

TAAAGTGCCTGCCGATGTMTTCGGAAMCGGTCGAGAAATA^ 

SEQ ID 2962 

LAHIMPNHFTGDTVYDVLPENKAKAIiEm 
LYTEAAEYAKSSGIIimOTEFGLDEEGTLTLMDm 

SEQ ID 2963 

ATGTTTCAGACGGCOTTTTATTGTAT 
GGTCTGAMCXCTAACTATGCACAlTCGGATTra 
SEQ ID 2964 

MFQTAFLLYQTOLRWAFIIRN^ 

SEQ ID 2965 

TTGAAAGGAACAATGATGTTCAAT^ 

GCGAO^TT(mTTTGTTGra 

CTTCTTCAAACGCGAAGGCAAACAAAG(X5AAA^ 

GfTGTGTCCGTCGATOCCGAAATC^ 

TAAACGGCGTGTAGGTTTTGAAC^^ 

CGAA(^(X3TGATGTTGGGCGCGGT(K3TO 

ACCAATGAAGAACTGGTTGC^^ 

MGMGCCTTGATT^GAAGAAACCGACACGC^ 

CG(mCGA€ACC(mCC(mXMTCCGC^ 

ACTTCCXXXX3ACGAACAAATCATCGA 

GCCGTGAAATCGGTCACGGCCGTTTAGCTAAAjOT 

CTOTTGGCTTCTGTCTGCGG<X3GCTGT^^ 

ACCGACATTTTGGGCGAOGAAGACCACTTGGGOGAT^^ 

TGcmmswaxacaa^^ 

CAGOOUUaCAW^TOCGT^ 
ACTCAAGAAGCGGGCGATGCAGCGAAAAAACGCATCX^G 
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TCAGCK5W3ATGCCGGGCAAAGACGGTTTGGTACAtt 
GGATGACAGAGCXXXSTGTO^ 
SEQ ID 2966 
LKGTMMFNKYVKTFQYGN^^ 
WSVDPEIDSDIPAMLGASAALVLSGVPPAGPIGAA^ 
TNEELVAKVRGIAGETIKEAFKIRQKQARSA^^ 
TSRDEQIIDALSGEYTDRFMLHYNFPPYSTGEVGR^ 

TQEAGDAAKKRIEEITAEVEVGKVYEGTVVKILDNNVGAIV 

SEQ ID 2967 

ATGTGCGGCmTTAAWAGCGCGTT^^ 

TCGGATTMTCGGACMCTCGGCATTTC^^ 

CTTGGCGGCAAAAATCGAGAAMTCGGCAAAOC^ 

TGGCTGCCGTGCGGACTGGTTTACAGCGCATCA.CTO 

TCGGCATTTTTTCOTGC^ 

SEQ ID 2968 

MCGGLSSAFALQLPPHIMlFVnilliLLNTGRISOT 
WLPCGLVYSASLYALGSGS^^ 

SEQ ID 2969 

TTGTTACCCCTTAmGAAACACAATGTC^^ 
TTCACMCCGCACCGTCCTCTAT^^ 
TAACAG(^GGGGAATGCCCTCGTCAAACOT 
SEQ ID 2970 

SEQ ID 2971 

TTGCTTCC(^TCC<^TCC^ 
GTTCCTGCTGOTT^ 

GGCAAACCCGCCGACAGCACAGCAAAGGGCGGATTTTACAG(^ 

SEQ ID 2972 

I^PNPKSPSPKGl^ISRRLRIKMSHYTVOT 
GRTAGNCPFSP1HI 

SEQ ID 2973 

TTGTITO3AAATCCCGTATTGTCTGTOCT1^ 

SEQ ID 2974 

SEQ ID 2975 

CACXXHSTTCTCCAACG^ 
GCGGGM?(KXX3CGGCC(^ 
AAAGCGCCATAATMCGCCGAATTTGGCGTAAAAACCCAQ^ 

GCGCCCGCCATAMCGCCMCAGGATGAAACCCATATGC^ 
CMGCAGGGCAAACATCAGAGACCAGTCATGATGCACGGTTC^^ 
AGTGGGCGCGCCGTGATACACGTCGGGCATO 
TCATTGAAGGAAGAGGCGAGCACGCXXjGCAAATTCCAGCGAA^ 
CGGCTTCGGCGGCAAAACCGGAATCGCGGCGCAGGGCAA^ 
GCCCAA6AGGGCAM(^GCGACAG6GTG!TAAAACTCXX^ 
AAACGCGACATATCGTCCGCAATATACATO^ 
AACGGCATTTGTCATCACTGGC^ 
SEQ ID 2976 
QRVIiQRLGAPVENGLRHNAFQHQQQRVCA^EG 
QIADVFALVVmQHHQHAEPCRRHQRV^^ 



AQBGKQRQGVKIAFENTAHLVVGLGIDKQDKRQGI* 
SEQ ID 2977 

ATGGATACACAAATCAAAACTGAAG(XX3GTCAGC^ 

CAGGTTTGGGTATGCTGTTGTTTGCCAA^ 

TMTCCGGAGACTOGTCCXnOT'CCM 

AAAGTGGTCMCACXMGCCGOGCAATttQGGTTGCT 

TTATTAIWTTGTCAGCGGCTTGTAOCTTTG© 

MCTTGGGTGTCGTTGATTTTAOT 

CCGMCC(XGTTTCAGTCGTGOTA(XC^ 

ACGGCATTAACCCCAGCGAGCCGATGACATTGGA 

GTCGCAG(^rrCTATGAGTTATGACATGATCAGC^ 

AAATTTATGCXXSGCAAGC^^ 

GCTTGCCATTGCCGTGATTTG(X^^ 
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SEQID2978 

KVVtflMPRNQGWYKTMDEIHGD^^ . 

PNFVSWPTHGEVIfflXaCV^ 

KFMAASIAUffiGTLGVWSVIiANVVFClA 

SEQ ID 2979 

ATGAAAACTTTGAMTTTACCAAAATGCA^ 

GGGGCGTGGGCTTOGACCJU^^ 

TTGTTTTGCCCGTTTTGTTGCAGACM 

GGCAAACCX^GTTTAIPGCXX^^ 

TGGGCMTCCTCATGCTGTGATTGTAf^ 

GGTTGTCGGACGGACGGCMTCCGTTTGCGCX^^ 

GMGGGAAAACX3GTAGAGGTGGTTTTGCCGGG 

a 

SEQ ID 2980 

MKTLKFTKMmjGNDFMTmVSQD^ 

GKPKFMPSEIPPVPESGBGDDACIYGVHIiESGIQPVSCV^ 

EGKTVEVVLPGGTIjYIEHACG^ 

SEQ ID 2981 _ 

ATGOTTTGCTGC^^ 

TCT(XK3TTTTA6GGGTAAAGCTGATGCCGGG^ 

CCGGAAAACAGATGGAAACGGATGGttGGCAG^^ 

TGCTTm!GGGAACGCTTCAA g 

GCGOTATAGACCGGCAGCCGCCGTCMOm^ 

SEQ DD 2982 

MILIflLDFLSAIMAAVFLFIJFI&G^^ 
CTSGTVQVFWAAUjKLYALKPVYWFV^ 

SEQ ID 2983 

TTGCCGAAATCGGCAGATCA3TGTGATGCCGOT 

TTCTirrATGCTTCGGCGCAAATCXXOT j 
SEQ ID 2984 . ^^^^^^^^^ ii^ TC YK^iwi>LFR^ 

SEQ ID 2985 ' 

ATGAAMTTGCAAACAG(^TCACCGMTTGATC^ 

G<^GCGTCAAAGACCGCATTGCCGAAGCAATGATC 

GGTATGTGCCGCACGCXSGCTACAAACTGGCGAra 

ATGGCGGGCGtt&TTGCCAAAGCGCMTCOT 

AMTTTGGAACX^TACCGACGGTGAAGTCGAT(^ 

SEQ ID 2986 

ffiOANSITELIGlWLVKUEUi^ 
HAGAIAKAQSLVDAHPDTYmPRQFimANPEVHRm 

SEQ ID 2987 

TTGAGCXSGCXaGCGAAAAAGGTCCGCAOTGATT^ 

GCTGCTGCCTTCTTATGGCGAA(XX^ 

SEQ ID 2988 

IiSGGHCGPHPIQGIGAGFIPTVLMCITO^ 

SEQ ID 2989 

TTGCAACATAJ^AAAAAGATT^ 

SEQ ID 2990 

LQHM}Oa)SIlMQTYKPSESIKCCCPTPNCTIL 

SEQ ID 2991 

ATGGCAGAAATCTGTTTGATAACCGGCACGCCC^^ 
TTACGAACATCAAAGGTTTGAAGATACCGC^ 
TGAAAACGTCGGCGCMTCGTTATTGTCXsATGAG^ 
GATATATTTGTATTGACACAAGGTCCM 
TA3$CGCGGATGACCCGGTAAAAATGGCAT^ 
TTCAAMTGGTmATGCATOCCCGTCATC^ 
GCGGCAACAGAACAGCAGGCAGTACTTCCGGATAAAACAGAA 
' TTTATMCXJGTGTMGGCAGGTAAGGAOTTTCMT^ 
GATCTGCAAGGACTATGTAAAAAACGGCTTGCroro 
GGCGGAAAAOGC^XmACCTMT^ 

SEQ ID 2992 

MAEICLITGTPGSGKTLKMVSMMA^ 

DIFVLTQGPKLLDQNLRTLVKRHYHIAANKMGI^ 

AATBQQAVLPDKTEGBSVKN^ 
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SEQID2994 __ TBrar 
VU3BiaEPPPLSR(^PYRniIiIBHIVLKHRPBPDIIRPSTS 

GTGTTGCCGTCAG&CTGACTTCX3CCGCXX5GAMXKTGGM 
IAEQIjPHPAPGFTHAAFTPSLFFGRFAVGQSQWUIFPFGKIiCGKSSGT^ 

TCAGCAAGGGTGTATGTTTAACATACAC 

PCRSIAVHMMVtoiflHuuvpviu.orvntiwv^ v „„„.„ ^r^r.^ «ij^svEmVDEK*FLWFCCHACFIiIIAIiVF*1JQWGI 



UJIjKHFKNIAKAKPRFSPQDIiSTDIIiF^^ 

OjHOTXamAGGAITTGCGCTT^ 
SEQ ID 3000 



)RAPNBKTGRNARQHAFFFHH(?KKFKSRFVRHF^ 



OTAAAQYGGSLSFIlGNTIiIWPVFTQHPiHAG^AEADAGHEDOT 

GGCAGCCATATGGGGMGGGAGGGGATATTGTCGTCGGTAACGGftAA 
SEQ ID 3002 



TmtXCTICAATGATGATlTTGAGTrTGGGCGG^^ ~ " 

GTftOGGGGCGTGG 

SEQ ID 3004 

VFGI/3SLIVRSVPVGPYAIm'mvLMj*w«w» »> — ♦» • w«— »— — 

AEFGYNGNAVWGFASGLVSGIiMIiALSMVTV^^ 

GATTGGGCAAAAG^TTGAAGGCGTGCAGTGGG^ 

SEQ ID 3006 

VVMQCFAWftMmMH*^ 

C0GCAG6AAT6GCAAT6CAACTAAAA 

SEQ ID 3008 

U)WQAVKTGKTKHAD^^ 
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SEQID3009 

ACGGCWmATCTGATGCCGACCGGAGTC^^ 



CAGCGAACCGAGG 

SEQ ID 3010 

VGGIWKY(KYFSCIMPAEAAFLILPA 

SEQ ID 3011 

ATOCCGTATTTCCATATGCCGCCCAC^ 
(X^TGATTTAAACACACXJCAAATCAM 

SEQ ID 3012 

MSWFYAAHTVSRTKCPIiFN(M3 

SEQ ID 3013 _ 

TTGMTTTMTCXIACTATATGCACCGCCT^^ 

SEQ ID 3014 

I^THYMHRIAQIIIMaGTYGLPGKKMRSGCT 
SEQ ID 3015 

. TGATATTTTTCTATGGTTTAAAGMTGGTGCAGTT^^ 

SEQ ID 3016 

LSDAEIRIHIESGFFIIJOlPVPDim^ 

SEQ ID 3017 

ATGAACACAATOmTGTCGGGCGCGGC^ 

ATATCCmCGGCGGTC<KTGTGTTTGCCGC^ 

GACTCATCCCGAAATCTCGGAAAGCAGCCACT^^ 

AGCTCGATGCGCCCGGGATTAATCMGGGCGATT^ 

ATGTCGTTGTTTITAACTATCOT 

TTCX^ACATTCCTGACGO^ 

GGACAGOTXXXGTTTCCCTGCCCGTATTG^ 

(jppTOGTGTGCAAAGTTCCXX^ 

CATTTTGATGAAOTCGGCGATTTCG^ j 

SEQ ID 3018 

MNTMLMSGAAAALLAGIILYFKS 

SSMRPGLIKGDFILVGKFSYGLRVPVLNNVFIPTG^ 

<$PAVSLPVLGmSD^ 

SEQ ID 3019 

ATGAAAAATATCCGAAATTTCTCCATCA 
TCGATTCCATGGACATCGAAAAAGAGCGCGGCA 
CGTCGACTTCTCTTACGAAGTTTCC 
TTGGG0GT(KiAAGTCGT^ 

CAGCCAAAAGC(X3CATCGGCGTGGAAGACGTTTTGGA^ 

AAATCGGTTCAAAAACAACMCTAAAAGCC^ 

AGCCGCTGCCCGGTTTCCAAGAACTACAAAGCC^ 

q^TGAAATTCGAGCCTGAAGTTTCTCAAGCA^ 

TTACCGCGACXIATCCTTGTGCCGCAGGAATAOT 

OTGCOGATGAACGAAGTCOTAATGGACTT^^ 

AmTGGTTAACGO^AAAMGTCGATGCOT 

ACATCGCCGTCCAAGCCGCCATCGGCAGCCGGA^ 

fGAAAAACAAAAAGCAGGOUUVCGCCGTAT^^ 

SEQ ID 3020 

MKNIWSIIAHnHGKSTI^ 

I/3VEVVPVUMDLPAADPERVBQEIEDIIGIDAVGAVQCSAKSGIG 
KSVQKQELKAGEVGFIiITCVKEIX^AKW 
ITTAPrVVYEVVIiKSGEnEVENPSKLPDIGSIETIIjEPIITATI 
IMVNGOTDALSIJVHRQSAVHKGR^ 

SEQ ID 3021 

GT«n«TCTGAAACCGTT^ 

AAACAGTAGCCGTTATCGGCGCGATGGAACAGGAAAIK3GW 

ACGCATAGTGOTGCATTGAGCGGCAft^^ 

AAGGGTTTGAAAGTCGGCGACGTGGTGATCGGCACGGA 

ACGGCATATTGATTCAAACGGCAAAAOGGGCGGOGOGGAOGTTTGAft 

MGGCAGACATCAOTTTGMGAGTTTTTGAAAA^ 
SEQ ID 3022 



KGIiKVGDVVIGTETEHHDVIWTAFGYARGQV^ 
KADISFEKFLKTAAASSAKMVAEUVKSL 
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ACCAGCCGCTTCCGCGTCAATGC6AT6ATA 
TOCGCT6AAAAAACGCGG6CTGGTTAT 



ACC0CGITTTG6AACTGCTC 

ISELIBmHEIKEVHKKSTTIiGMSTFDQ 



HIiYQLYEKGBISL^IMiKNADSAHDIjEIAVQIilSREAQSSDPDI£LL 



GGACGAACXX^MMCmCGCAAACTGTTG^ 

<K£(^GTAT<MGI^ *™ 



====== 

GAACAGG 

VFVGWMQKWbVDI^^ 



TATCAAGGCTGTATAAA 
SEQ ID 3030 



AACCCGTCAGCGTATAUAtKiAAUAiA.xi\2flu-uKi-uufii ™/».v*.v»v.^>~w — . ;_„„. v,»..„»ww t i«rarri iTaccfinfJiOOJiGCGTAATA 

rcc^xrrecGCGGCJ^ 

GGGTftrCCATO 

AAAGAAGGAAT6AATACTCTCCCCCCTTCTITAOT 



LGHHIDHAAVI 




GYPYCQAE*WGKHSG*PVVKIKNIVVWVGLyRQRFRKS 



GCGGATA' 
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CGAATCTGACXX&TAGCGATTGGTTGTGG^^ 
GAAAGACGTTOTCATTAIfln^ 



CCGCTTGGGC^TTTCTTGGA 
AaXmCCATCTBCTC^^ 
TATGAAACCGTTT(n^CGCCTGCCATGGCAATGCM 
TCGAA©mTTAATACGATGC^^ 

SEQ ID 3034 

mTSQFTSMFWNimVIVLLSFIALAWL^ 
AZ^IETOKDPQAKQIAQ^FNTYCIQCHGSDAKGSKGPPNLm 
OTGQGIQGLGPNLTTOVWIWGGTQKSim 
YETVCMCH(MIPGIPHVGTKADWA^ 

SEQID3035 

ATOCIATATCATOGATTCGCOT 

GGGAAAGCTTCTCCGMGTCTTTGC^^ 

TOCCGCCCATCCGAAAGTGGTCGGCATCGGCGAGAC^^ 

GGAffKXXCGTTAaCGTCCATACG^ 

CTGCCATGGATTTGGGGCTTTATATTTC^^ 

cxxxyrrccT^cccG^ 

ACGGAAAACTmACCGGCFGTrTAAAAM 

SEQ ID 3036 



GLFVTVHTRDAAADTLAILKE^ 
TENPYRIiFKKVPDMRW 

SEQID 3037 

GTGCCGACGATAAGGAAAACCATGAAGGCAATTCATCCGTAT^ 
(X^U^GGCGGCAGACGGGAAGTAAAAAAAGACGAAAGCGGCGGCGGAT^ 

SEQID 3038 

VPTIRKTMKAIHPYACPRCC^ 

SEQ ID 3039 i 

ATCOTTOCATCO^ 

AC^OKX^(XrPGCA(X^ 

TGTCGGCGGTTCGGACATCCTGATGGAMTGTATGW 

SEQ ID 3040 

MASIHDQIKEVVTTHRWLFMKGTC 

SEQ ID 3041 

TTGCCQAATCCATCA6GOTTCAGCAGCTCTTGCAG 

TCGCTCTATTCCTTMTGCCTTG(^ 

GAAACT(K^ITCGTACCTTTCATAAACAATA^ 

SEQ ID 3042 

LPNPSGFQQLLQIiACLIHF^ 

SEQ ID 3043 

ATG(&TrCGGCAATGCCGTCTGAAC 

SEQ ID 3044 

MDSAMPSERVSWIFFSGKLKK^ 

SEQ ID 3045 

ATGMTATMTCGGGCCGGACATCTCAAAGGACACC^^ 

TGTGGATAAAGGGAAACAGMTCAGAAMGTCTATATCGGCATGGAGG^ 

AAAMTCMGGACTACGGAAAAAGCAGGTTTAACCGCACC^ 

AAAAACAAAGCACTGCAAAAAOTATTAATCTTAAAAACCAAW 

TGATAGATACCATACAGGACAAGATGGMCAGGTAAAAAT^^ 

CACCGCATCAGTTCTTTATGCGCAACTGAC^ 

<KJCA(»TTCAG<»»T^^ 

C^GATOTMmTCGTTtmTCA^ 

SEQ ID 3046 

MNIIGPDISKOTIDATtHRTM3SIHYIKPKNNDK 

KNKALQKIJMiKNQLQQQQKQlKNRIiHSTEEDFlRN^ 

©UiSRYGHRRUCS^ 

SEQ ID 3047 

TTCCCMTCAATATOTTACGATTGCG^ 

TATTTGGAGOTTOTO 

AAACTGAACCAOGAGTGGAAAQCXX^ 

AAAATCTGAAGAmACCCGTCCT^^ 

TTGTCTATTTTTCCT^^ 

CCATmTAACTTATCCATCACTGATTGra 

AGCCCAAGATOCACTTTTATAAAAT^^ 

TTOTATCATTCTTCCAAAATACAGCTW 

TAGACAATWK5AAAICTTGTCCAAAAATAAATO 
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SEQ ID 3048 

KI*RLTRPFPRrcLKTIAYPFSTCFAFPRP*FIKFPIH 

SPRFDFraLLGNFSQNICFIIAIKRSMSQI*RTD^ 
SEQ ID 3049 

ATGAAGGTTJKCOTJ^ 

ACGAGQCMGCCGTGATTTTG^ 

GGGTATGCTCGACGGCGTGCTCGACCTGATTC^ 

GACGAGCGTCCGGCTITGATTATCGATCC^ 

rcCCTGAGGGCGTGAAGGCTGTTAATGATGCGCATCC^^ 

CAAGATTTTCGGCACGCGC 
SEQ ID 3050 

DERPALIIDPMIATGGSMVATIDLLKEK6CRNIKALW 
SEQ ID 3051 

ATGAATTTCCAAGACTATCTCGCCACAT^ 

CQCTCMGCTGTACAATGCCTTGGCGGM 

GCATCCGAACATCGATTTGCTGGAAAATGTCG^ 
SEQ ID 3052 

MNFQDYUATFPSIIXII/3GLIWQDAEGKTVHHIPA 
SEQID3053 

ATGCCGTCTGAAACGMTTTCAGACCXjCATGGTCGAAM 

CCC(^TGG<XX^<£^^ 

CGAGOTGCCCT^^ 
SEQ ID 3054 

MPSETNFKMGIUttiLRGKRFEQ^ 
SEQ ID 3055 

TOATTTTTGCACTMTTTTTG^^ 
SEQ ID 3056 

LIFALIFDnPYNKIiNYKGFYllVGRNKRIEHFKOT 
SEQ ID 3057 1 

TTGTCCGATACGGATAAACAGGCTTTTTTGACT^^ 

AAAAT(XK}AMGATTGTGGCAA(X1^CGTTTTTTC 

<»WCATTOTAT^^ 

GMGTTGAGTTAGACGGCATTGTGCMGCTGATGAAACTrArrc 

AACGTCGCATTTCCAMGAACAAGTTTG^ 
SEQ ID 3058 

LSDTDKQAFLTSVSSKEQVKm 

EVEUEIVQADOTSTISYKGlfflK^^ 
SEQ ID 3059 

ATGTTTGCTCTIXKXXXXIAGTMTATGATOT 

GCTTAAAACGC(XAAAT6TTGGCTTTCTC^ 

AGGCATTGCCATTATTCCAGTCAGTTGCGTGATTG^ 

GGCACGGTMCGTGCATTGTCGTATTTGAAGCCC^ 

GCTACTGGCGTGAT(&C(&TOT 

AGGAGMTCGGTAACAGCATTTGTAACT6ATMC6ATGM 

TTATTTCMCTACTGGGAGCGGAAGTAGGGCCGACTG^ 

SEQ ID 3060 

MFAIAAS!WILRGDGKA1^QSSCFMTDFQDLIKNPK^ 

GTVTCIVVFEAIOCPHFQTVVIDPDTQEEISTKKAC^ 

LFQLLGAEVGPTENIJMSYEDDDNNDFGDIffi 

SEQ ID 3061 

ATGGTTGAATTGCAA<^^ 

CAGCATCTGTGGATTTATTGAAAAATACAAAACCTO 

TCnTOTGTTTTAAATCC^ 

CGACAACTGTTTGTGCCATCTTTGGAT^ 

(mAra^CGACTTGATGAAATTm 

AGCMGAACGCTACGMGTTATATCGCTCCTGATAG(X?TO 

ACTTGTGMTTTGTAGCAAATAGTGATGTCGCCCT^ 

ATGGA(BTAMC(£AAAGGC(^ 

SEQ ID 3062 

MVELQEIFDVSyGSiaj)LNKMSSFNPTINFVGR^ 

RQLFVPSLDEIPSWVESVNI^ 

SEQ ID 3063 

TTGTTAAAACACAGCCTTCTATTTCAGGATACGGGCMT^ 

SEQ ID 3064 

IiLKHSItLFQmXMWSmTTHKAPPyVIiP 
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SEQ ID 3065 

AT 

TCAACCGCMTCTTTACGAATAOT 

CX}G6GCAACGATGAAATTCCAAAACGGCGMTTTA^ 

CGGGTAfHHraC^^ 
SEQ ID 3066 

MMKKMPLSAMiLLSAMQTVWADTVFSO 

KVGCTPKTAQANFNDBDFSR 
SEQ ID 3067 

TT6GTCTATCTGMTGTA&AAATAACTGTGTCA 

GTACCMTl%TltXy^CCTTTGACT 
SEQ ID 3068 

LmNVKITVSSLLTPLYUiRIiPKKFRK 

SEQ ID 3069 

ATGGATATTMCGGTATTCGCGCTCTTTTCACGCT 
. TTOTGCTGAAAACXAAGATGCGCAAGATAAGAA 

SEQ ID 3070 

MDINGIRAIiFTVWIFICFIiLVIiYI^^ 
SEQ ID 3071 

TTGTrATTTTTATTTACGTTTATO 
SEQ ID 3072 

UiFl^rn^QMHGYTNIFAQPFNFVEFY 
SEQ ID 3073 

OEACGAAIXy&GCCTC^^ 

AGTCTGCCCGAAGGTGCGCGCCT 

ATAAACCTTTGAACTTTCAAAA 

ATCOTTCTATACmAtt^ 

TCAGACGGCATGGTCTTTCCCGGMCCTCAAACCAA . 
SEQ ID 3074 
METAKPVMLIVRPSGRAKDEWEVCR^ 
SLPEGARVLSVRGHGGRDFLMNALQ^ 
SDGMVFPGTSKQ 

SEQ ID 3075 

GTGGGCGAACCTGAAAACAMTCATCCGA^ 
CTramATCGTCAAAC^ 

CCAAGAGCTGGCATTCMCCAGAAAATCGACAAAGC^^ 

GACAGCGGTGTCAAAGCCAACXjG(X3MCAMTCCTGA 

CW^OGCTGGTGTTGACCm^ 

CGATTTGGCGGAACTGAAAAACCGTCCCTATGTCGATA 

GGCGTACAGGCCCGCAACGMGCCGTTTCCGCTTC 

TGATTTCTCCCGAA(^GGCATATTTTGTGOT 

AGCCGCCGTCAGACAGTATTrcGATGCCAAGT^^ 

TTAAATGCCGTCCGCGCCTATCGCGACGGT^^ 

A 

SEQ ID 3076 

VGEPENKSSEPVREIQASKBOTSETSS^ 
DSGVKANGEQILMTQKAYRHMG^^ 
GVQARNEAVSAS^QNVWEKSLGTI^ 
mmYRDGTRm'AAEKQEAEQAASEPANEOT 

SEQ ID 3077 

ATGAAAACGGTAGTCTGGATTGTTGTCCTGTT^^ 

TGCACGCCTTTGTGTTAGGTTCGCTGAT^ 

(XXmGGCCGCGCTTGOTTGAATMGG^ 

TTGGCATT6AT6CTGGGCGCGWCGCGGCA(^ 

TGGCGtMTCGGCGTTAAACCGGCGCGATTAC^^ 

TCGGGGCC&TGCGTTGCAGGTTCTO 

GCCGATGCCGCCOOTimAACCI^^ 

GCTGAAAGAACAGCCCGATAAOGOGCTTCTGC^ 

CCCGT 
SEQ ID 3078 

MKTVVWIVVIiPAAAVGLAIiASGirPGDW 
LALMIX^AHAAGQME^ELRDRYIiAEIAKLPEK^ 
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ACAAALKTCLKRIPDSL^^ 
IPARLVLAKVFDETAQSQKAEAQRNLVIASVAGEN^ 

SEQID3079 

ATGACTCTTTTAAAAAAC^ 
CGAAA(X!GGGCAGCTTCCTCGATTT^^ 
TCCCGATGCMTGGGCTTGGGGCTCTATTT^^ 
TJUX3TTTTCXSACGCGCT 
AATTCCGCACCATCAAAACCATGATGTAC^^ 
GGTGCAGATTTTCGA^CT^ 
CCTGTTATCGTGTTTGCCAAAGGCG^^ 
iVCCMGTCGCCCTGCMGGCAACTTC^ 
CCTC(^CACGGCATCAACCAACAC^ 
SEQID3080 
MTIiLKMOTLRAIMQ 

YVmWSIRKALDGKVPLIGFSGSPFT^^ 
PVIWMGGGLWIiESMAQIGADALGLrWT^ 

SEQ ID 3081 

ATGATGTTAOTTACGTTTTO 
GGAGTACCTACTTTACGCAAAGCCTTC^TGT^^ 
Ga(X?TCGATGC«5TGCCATTCGTC 
ATGACX^GTAACGC^GGTTTCCGCACGGAACGGA^ 
CCCG(KX5Ca3GCT^ 
GTTG 
SEQ ID 3082 
MMLFYWQSQTQSLQIGDYSIQLGFV^ 

SEQ ID 3083 

MOTGCGAACCTACTreAACCAAT^ 

TAITOCTGGGCMTGTTC^GTGGCAA^ 

GGATCAAACAGTCAAACGAGCCAAAGCCAATCTGAA?^ 

MGAAT6CCCGTGTATTGAAACGTTGGCGTMTTO 

ATMGCCGGCTAAAGATGCCAGTACTATC^ 

AGAATCTGATTGGCAACCTGAATOATCa 

GATGATATGAAAGACGAGAGTGGTGmTTGCCGAAG(^ 

ATCCAAGCTATAAAATTCAAGATCATCAAGCCTTO^^ 

TTTGGAlTrGCAAAAGA(XMGCCCGATATTCATCA 

GATTTTGACTTAGAAAAAGGTTTCAACATCATO 

ATACTTTCTGGCAACATTCGCGTATTTTT^ 

TATTGGGCAGGCAAAAAATTCAGGCGGCMTATTCAGGTTAm 

TATTTTCCACTCOCCCAAACGAAGATMTTTA 

TTCTAGATAAATTGG6TCGTTATGTTCCAGATGA(TO 

ACGTAAACTTTCTC^GCAACCGGCACAATGOT 

TGGCAGGGAAAAGATTTTTG^ 

SEQ ID 3084 

MLRTYIflQLTPPELADSVK^ 
KNARVtKRWRNLFASQSSI^OT 

DDMKDES GE I AEGAKQALLS FLI TCAEF ALCDKANCNP ALHPSYKI QBHQAFS KKIQ AFLNDLVQAVNNG EDLAG SFKESYLDLQKTKPDI HHFDEIYEKLTALLENKQ I STL WNSQTET 

DFDIiEKGFmi(X^C^GLTIPKLQTmSRTAKK 

YPPLHPNEDNIiSEINKILPSILKDEIQSDLYQIDIEDIjFLVI^ 

WQGKDFWliPHIKLPHNGLVYQSAK 

SEQ ID 3085 

AIGAATAttGTTTTTTCXiAATATCGCCAA 
ATGACGCAGTGGTTGAGTTACATAAAATraGGAGCT 
TAAACTAAATGATTTTTTGCGACATGAGAAMGAG^ 
TCTGCTMCTTAA(^TGTTTTTGGGATAGCATO 
GCAAAAATATC(^GAAGTTGAAAGACCMGTAAATTTATTC 
GTCAGAATATCATTTCTCAATTCC^ 
GAGTTGATTGTATCCAAAGACATCACTTCACAAG^ 
ACTTC(X5eiaGAAA« 
O^TCATTTOAGTTACGTTCAACCG^ 
SEQ ID 3086 

MNTVTSNIANAKITEKSriNAVWMDLFKSM 
SANLTCFWDSTERTYET^ 
ElJVSKDITSQECTPVLKSF^ 
SEQ ID 3087 
ATGCTATCTAAACAAATCTCAAATOT^ 
G(X3C(MCGATTTCTCCCATI^^ 
CATTTTAGGCG(&TTCCCTTGTCAAGA^^ 
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(mTOGTTGCTGAGMTGTGAAAGffm 
CAGMTTTGGCCTACCTCAATTTC^^ 
ACCATATGTMCAGCAGOTAG^ 
TTTACCGATOTOTAAAGAmTCCm 
GGGOTATCACTTCOTX^^ 
TGCTGTTOTCCTCAGGGCGTGGTTGAA^ 
. GACCGACTAAGCAAAATTCGAGGGGGAAAACM 

SEQ ID 3088 

MLSKQISNLNSSSNKPKILSLFSGCCX^ 
VFVAENVKGLLTANKKmQ^ 
raiITOHPLYVK(mSHVYRRMHRNEPSm 
DRLSKIRGGKQ 
SEQ ID 3089 

TTGATAAAAATCCTTCTATGGATOT 

ACGCAACGCMCGCAACGCMCGCAAC^ 

CTTGTTTGTATOQ 

SEQ ID 3090 

SEQ ID 3091 

ATGGCAAAAACGOTAAAACCCTTTACC^ 

CCGAGOCGAAAAACGCCCGATTCCAATCTT^^ 

0CTATTG<KXX3<XX£^^ 

TACGTTTCCGGCGAAGAATCCGCCCAAC^ 

AACAGCATCAGCCCGMGTCGTCOT^ 

GGCGAMCAMTGGGTATCGCC&TGATAOT 

CATTCCAACTACCGCATGATACGCGC^ 

GGCGAACCGGCGGCGGATTTGGCGGTCATCCTC 
KmCGCGGACAAGAGC(KXrK^GAArc 

TTCAAGTTTGCACXSAAGCCATCm 1 1 

SEQID3092 i 

MAKTIJCTLYQCTECGGTSP 

WSGEESAQQVALRSQRLELHAEGVNLLAEIRMEAIQAA^ 

HSNYRMIRAIKIJRFGAANELGW^ 

GEPAAIMVILAMI^SFRNRPMPEmVFGEIGLSGmP™ 

SEQ ID 3093 

TTGGTTTGTGGCGGATTGAAACAAGMGGCATACCGG 
TCGTCCGTTTCGGCAATGATTGGAAATCA<X^ 

SEQ ID 3094 

LVCGGLRQEGIPATDKICGKVACmmRPSSmR^ 

SEQ ID 3095 

TTGCCTGTGATATGGCAAAACGCATAGGCCCGTCAT^ 
GGGGAATOAGGGGCGTGTGCATT 

SEQ ID 3098 

LPVIWQNAYARHPCEGGNPESSVSAMIGNHGNPTAWIPDFG(MGTOI 
SEQ ID 3097 

TCTGAACAAGCCX^GAAGGTCATGCCGAGCAAGACCM 

AGTTTGGOGGAGCOGTTGAOGATGCCGTTGT^^ 

AGTATTTGTTTTCWCAmCGTAM 

AACAGGCGATGTCAAGCTGTGCGACACCATTGC^^ 

TAGAGCATGGGTTCGATGGCGATGTAGCCGATGATGACGGACGGM 

(JPTCTTCACCATGGCCGTCTC 

GGCAAMTAGGCAACGCCGCTGCCCGGCAGGGTGCTGTA!^^ 
ATAGTCAGCCAAGTAATCXHEATGT^^ 
(X3T6GGTCATGACGTG(3AACATC^^ 
ACCCAAGAAGCCCATAAACAGGGCGGTMTCGCGCCGATTACX^ 
ATCGTCG<XKX!GTGAATCAATGCAGAAATCGGAGTCX^ 
TXACGGTMTCAAAGACCATTCCACGCCGO^ 
GCCGATACCGAGCAAAAAGCCGAAGTCGCCGAC^^ 
ACCGOTCCCAACCGAAGAAGAG^ 
TATAACCGATGGTATAGATATGCACCA^ 
GGTATAGACCTTTTCGTCAAACTTO 
CCGATTTTGTTGCCGAACAAACCCGCAAIPC^^ 
SEQ ID 3098 

SEQAPEGHAE&DQYAEHHSGG 

mCQAVRHHCgaVELFFHDVHGRMRVGVDEm 

GKIGNAAARQGAVFHCFDNRIPWEAGEERRnjQRQM 
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GIBVFVKimAIEBSPQHIW 



AAGG 
CAACCGTT 

SEQ1D3102 _ 

MICCPSTQNKKQSCmHim 

M^T^GGGCGCGOTATCGGTATAGGCGGTGCGTACWTGCCGTCCOT 




AAAACCGTAGGCGAA 
KTVGE 



QELKKAGYPNVANHGGYEDLIiKKQiK 



TCCCQC ^^ 

AAAATOXXXXGO^^ 
MCGM^m^GACACCGACCCGCl^f 

gaaaaagcx»»ca^^ 

(^GCGATArreCCGTTCTGGTACGCACGCACAACGAGGC^^ 

(XX7KOTAAGOCGGAACAAG3GGCGCAGOT 
jCGCCGACGASGAAATC(XS3GAACGCCTGCQCCTGCTTrATGT " 
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(XTTGATCGAAGGOTGCCGCAAGAC^^ 

®£ACGGATTTCACCTT^ 

ACACCAGCTTTACCGCCTTMGCCXsC^^ 

GGCTTCAGACGGCATATCGATACACGACTTTC^ 

(XXXJACAAGCTGAAAAAAT^^ 

GGTGCCX5COnCCCGAAATGGGC^ 

GACGCTCGACTTOACACCGTCAAC(^ 

WGCAG6CGATG6ACGAAGCCGTC606CACCA6CACTACT^ 

GCTACCTGTTTTOCGtt^TTGGA 

SEQID3112 

LEQKSVERVLVVTFTKAATAELKT^^ 

FDVT&TEEDGDRLIiVPAQDF^ 

AGQLPOiDKDRIiaUiIiK^ 

IDEFQDTDPLQYRIFQKIFIARNRPItFL^^ 

EKANKDVLRRRAAITC^ 

QNESEIUWAESARTALDIV^KYG 

CPPAWDAQDTGPSDWQILHQGANRAEUiAKAQ^ 

GTDFTPTEDAPPPMYRGNADQAAEFAANSIPERGPRPVRHK 

ADKLKKyGFEEIWLPAVAEMAEACRKTPLTGAYGLSDIPPGC^ 

BGAHDEAVAHQHYYIjQALIYAVAAARYF 

SEQ ID 3113 

ATGCCGTCTGAAACCCITI'TTCAGAC 

SEQ BD 3114 

MPSETIiFQTAFSRFTVQPPSYTLCPHPSEHSra 

SEQ ID 3115 

ATGAACA(XKXXX^TCCGCCATCATTCCCX3^^ 

CGTTATCCGCCTTGAJAGCOCGCTTTCXJGGA!^ 

CGAAATGGGaUCGGGOTWGCC^ 

ATATGCGTCGCGTCCGAAGMCACGGTTTGCGCC^ 

CCGMGGGCTGCCCGATGCAGGCGGCAGCTATGTCC^^ 

AACCBACGACmTTCAGOT^^ 

AGCGGCGMCACGGTTTGATGTTTACCG(^TACTGCTO 

QQQCQGCCGmG 

SEQ ID 3116 

MNTPQSAIIPDHAQAGIFIEADFAANRLNDIKAACRASLEA^ 

ICVASEEHGLRLYQDRGIJXjFVIXtf 

SGEHGimAYCRTIiHNIEAQIiI^^ 

SEQ ID 3117 

GTGAAATTGCGGO^C^CAMGAATACTTTTTO 
ACGAAACCGAAAAGCGGC^TCAGGTATG^ 

SEQ ID 3118 

VKIiRQTQRIIiFLYGYSKSEWERYAEAAGSGNKGGGKGGGRHETEKRHQVC^^ 

SEQ ID 3119 

GTCTTGA0GGGCAACATATCATCTGCGCG6AAAGTO 

SEQ ID 3120 

VLTGNISSARKVTVSANATEKDSVFGDAVHIVIAT 

SEQ ID 3121 

CCX^mCAACTOSTCC^ 

ATTTtJTCCGGCAGTA^CGCCCAAATGTTCC^ 

CCATC«X«<&T^^ 

SEQID3122 

PFQLVQIGNINRIiSVAVHQ^ 
SEQ ID 3123 

ATCGAGACTCTCATGGACAOO^ 

TTCCTCCTGCCCTCGACTT^ 

CTACGTTGTCCMCGTACTTGTAATACX^^ 

CAAGGTAAGGAAIATGCCGAACTGGAATQGCXJGOT 

TOCCMCTGGTTCTACGGCGGCTTTATTTT^^ 

TATGGTTCMTGGTGGTACGGGCATMCGCC^^ 

TCCGTCGTTCACTTCTGGGCGTTGATO 

TCGCACCCTCTTGGGGCGGTATGATTAACGGCA^ 

TACCTTK3AAG(KXX}GATGATGTCGATT 

TCCGTCTATTACATGATTCC^ 

TTGCAGGTGTGATGCAGGGTTOATGTGGA 

TTTATTGTATCTGAGTGGCATGTGC^^ 

SEQ ID 3124 

METIlffiTQTYNYKVVRQFAIMT^^ 
QGKEYAELEWPIDILITLVWVAYAIW 
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SVVHFWALIFT^AGFHHI^^ 
SVHMIPRIJGKEQMHSTKLVE^^ 
SEQ ID 3125 

TCGAGCGCATCAACAATAAAGGCACGGTTAC^^ 

GGCGGAAAAACTGGGCGTAAAAGTCGAGTTTAM^ 

(KCCAGGCGACATTT6ACAMTCC6MCCTT 

SEQ ID 3126 



VARPSTAINBLKAD6TLKKLGEQFPGKDISVQ 

SEQ ID 3127 m 

ATGAGCATATCAGCGCGTGTTO 
TCGTTGATTGCCGTGCTKMTTOGC^^ 
TTTTCTTCAGATAGTCCAAAACXX3CCAA 
TTTGGCTTTTTCGCCGTAGTTGCra 

GGTTCGGATTTGTCAAATGTCGCCTQG^ • 

ACTCGACTOTAC(XmG 

AACCGTGCCTTTATTGTTGATQC6CTCGAT 

SEQ ID 3128 

MSISAKVSVINCmEAKKnJ^ 
FGFFAVVAGQGIAGFHAFDVGYGF^ 
NRAFIVDALD 
SEQ ID 3129 

CTTTTACGGGCTGCCGTCXX3TCGGCAOT 
CCGAAAGG(KIAGTGGGAAGCAGGTTTCT 
GCTTOITCAAAAACACCTC^^ 
WATTGGTGTTTCTGTAAAGTCCTGTTTI^^ 

SEQ ID 3130 

MTETRADMLISAFWPMVKAGFTVSLPLAI 
PKGQWEAGFSlQfiWftTFRRIVAP^ 

SEQ ID 3131 

ATGGATATTGCGGATTTTAATCATGACAACTCOT 

MTCGCCGCG(X^TGGTTTCGGAAGCGTATGCGTO 
AGCTGAACC^CAGCGGCGTACOGCGAACGACGGAAATA^ 
TAATCATGCCGATAACGAAAGAAGCGATCGCCAAAGGCAAAGA 
AGAGGCAAGGAAATTATTGAACAC 

SEQ ID 3132 

MDIADPITONSIjFGDITVETFFQTRLIW 
NRRAYGFGSVCADIERKADIXX&NRID^ 
RGKKI1BI 

SEQ ID 3133 % tmfvmmmntmm ^ 

ATGATOAAAATCCGCAATATC(^TAAGACCTTTGGOGAAAACAC^ 
CAACATTTCTGCGCTGCCTAAACGC 
ACT(XXKCG(^(nCCGGAATGGTATTCCAACMTACMC^^ 
GAAGAQGCTTTGAAACTGCTGGAAAAAGTCGGCTTGGGC^ 

AATCAAGTT(^C(£TGGMGTTGCCACGAA^ 
TTAAGCCAAATCCMTCTGCCAAGATT 

SEQ ID 3134 

MIKIIttttHKTFGENTIIjRGIDLTO 

EEALKLIiEKVGIiGDKTO^ 
LSQ1QSAKL 
SEQ ID 3135 

ATCGCAAGC^TCA(TCGCGACATCTTC^ 

MGGTATCGCCCGCATCGCGATCGGACGWACG^ 

CACTCCTATCCTOAOTCGCAGCC^ 

GCAGGCGMGCCATTCAAGAACTTTTAGCTATTGTT^ 

GACACGTCAAACTOVAACGCCCGATAAACATCGCX^ 

AGTGGACGGCAATTKX!CTAATCACCACCCT^ 

GACCOrrTCGGCGTOGW 
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TAATGGAAAAAACCGGCCACAGCTTCATCAAAT^^ 
(^GCGGACACGTITrcTTO 
CCGCmO^TTTCXy^m^CTCM 
rcGACGGCCTCCGCGTTGAATTTCCC^ 
(XX&TTCAAAGCCGTCATCGAAAGCAATCO^ 
SEQID 3136 

MASITRDIFKAYDIRGIVGKTLTDDMYH^ 
AGEAIQEUAIVEKDGFVAADKQGSW 
DRI£VVTKIX^IYPDRQI^^ 
PQSISTPEIOTSLPBGSNGHQVIEELAAKAETO 

SEQID 3137 

ATGACAGMTTACCGTGGATAGCCGMGCGAGAAGGCACATC^^ 

CGGCAMGTCTTGCTACTTTGM 

GTCGGGTTrG&CGAAACTCGAAGCCCCOGCATAOGGCTG^ 

GQCGGCAATCAGGGCAATATGGTATCCAIIK^ 

GTTACXIGGTTGTCGGACGTTGCCGCCACGGC^^ 

SEQID 3138 

fflELPmEAREHIGLraPGARMPrn 

GGNQGNMVSII PFDPADIDGYFWPSKHGGKAVPSSPAEGRYRLSDVAATAKQGAGBA 

SEQID 3139 

TGCTTTGCCCTCCCC^ 

TTAACTTGGTCGGCTGGATCCATACXJTTTC^^ 

CGGTATAAATACGGCTGCCCAAATCGCGGATATTGTTCT 

CAACAACGTACCGACCACACCGGCAAC^^ 

TAAACCACCACCAGCGI^CGCCCAAAAACTCX^^ 

SEQ ID 3140 

CFAIiPFRIiEAV*NFAFFGRFDIiRFD^^ 
QQRTDHTGNRRVPPETGAHFLDVDIQ 

SEQ ID 3141 j: 

ATGCCTTTAAAATGCAGCTTAACCTACATCAATAAA^ 

SEQ ID 3142 ! 

MPLKCSLTYINKVPQKGKIGNKSl^RVVIYYNNMnK 

SEQ ID 3143 

ATGCGTAGTATTGGTTCC^^ 

TlTGTCmmTTCAm^ 

OTCCaGCAATACCTt^^ 

CX}6ATTCAAAACGAA0CGTCCAAC6GCCT^^ 

TCCTAAACFTCCGTTC^^ 

CAAACmACG<mCCACGACGATG^ 

SEQ ID 3144 

MRSIGCSVGLCCTDVNHRPK^ 
PIQNEASSGLPTOKYTIAMARTSDPHSAGAQPFIira 

SEQ ID 3145 

ATGMTCCCCCTGATGCATCX^TGTTTTTCCCOTACAT^ 

TOK^GAACCGACGGTTTOCAAACC^^ 

AMCGTCGAGCTGCTCAGTGOXAGAAGCCG^ 

AATAAAGGTATCGCGCTGCK3A3OT 

ACATCAAAAAGGC(^TGGCTGATTTTTCC^ 

GCGCCTffK^OXMGCAATATG^ 

ATGCTGAGCCACCTCGACCGGGAAAAAGAACACAAAA^ 

(X5AACGC(nK5GAAATCAAAa 

(3CGGC(»CATCAax:TGA6a(X3(^^ 
CTTCATTATTTTTCTGACC^ 
GGCATCtKm^TCTOCGCCTTa 
TATGCGTAGTATTGGTTGCTCTGTGGGCTO 

SEQ ID 3146 

MNPPDASCFSPYISIPSHKQNRQIVYNIPTLHRITNIIiEAVAEPTO 
NKGIALLIFVAAIAfFmVHI^ 
HLSHLDREKEHKTYVFLIi^ 
GISNPDTVIALSMVAWVFGV^^ 
GIAASCAFTILPVATPPNAIWGTGLI^ 
SEQ ID 3147 

GGCGCK^TCTATOGCTGCTOTCAAAGC^^ 
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CKniPDQRSPCPLCRAFCLPPGU^ISPKQDBSP 

GfXxmCftTAOTXXmCGCGGCGO^TKC^^ ' 

TCTGCCTGCCGCCCGGAC 
SACRPD 



IQMOCTMYYRIiUfilGHPGDRHLWGTO 
AGCGCGCCGCAAAC(XXTTCAAAATCMGAAGAA 

GMACGCTGCCGCACCGG ^^^^^ 



p * nrvPOTPT &TTOC^TTCATTGAATTmA^ 



GDFGM^QWOraVLTHTFlIGGSAESMLCW 
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SEQ ID 3161 

lOT^GcrirrccmcGCCTC^ 

TCTGCAAACGCTTAAAATCATTGA^^ 

SEQ ID 3163 ^^^^^ 
GCGGTACGGGGCGTCAGCGGCTTTGCGT^^ 

ATAKHXSTTTCC^^ 
ATTCGATGTTGATCGCCATTCCC^^ 
CGTTPICTCTO^^ 
TCCCTCAA 
SEQ ID 3164 

RFLPGKIHCVFMKKVFERYAHATO 

AGGTCATCGGGCGTAGCGGCAACGAACTGAAGGOG^^ 

OTCGAAGGCCCGTTCAAAACOTAOGTGGA 

AmCCCCCGTCTraACCACC^^ 

SEQ ID 3166 

WVKKVEKNILVmGADKMFELVDKVEOT 

ISPVFHHLSATLVEAFVKEADRRYA 

TCCC^CAGGTCAA(yU^CGGCGMCTAT(^^ 
TGACAACCTGATTCAMCCCTTTTGMCAAAAAW 
GCATGGTTCTACTTTATGCGTATGCAGGCGGG^^ 
TCGCCGGCTGCGACGMGCCAAAGMGAAGTG^ 

TAAACCAATTATTGGTTGAAATGGACGGTr^^ 

O^GTCGTCGTCCCCCTG^ 

TTTTXXXJGCGCGGATTTG^ 

CG6AAC6CCGCA6TATGG^ATGCACGAAGACGAAAAACGTGC^ 

TATGCCGCGCGGACGTGCGCTGGGTCTGAC^^ 

GACATCTTCGTCGGACGCATCTCCACCGG^ 

CGGAAAACGAAGGCmtfK^^ 

TCCCTACAAAATCCTO^TGAAAACCG^^ 

CCGCXmGGATTACAGOmACCTGCGCG^ 

AGTCCGGGCAGCAGCCTGAAAACAAG®?? 

AWFYPMRKQAGGGGKGGAFSFGKSRARLLDKDANKTO 

OffEQAKKHAPCTIFIDEI^^ 

FSGADLAl^VNEAALFAGRRNKVKV^ 

DIFVGRISTCASNDFERATQMAREMTO 

PPKDYSHNLRENADAAEDNAPHAPTREKTEAPAPAOT 

SEQ ID 3169 

ATGGCTGTACGTTCCAAATCCTCA^ 
ACGAAAAAGACMGATAATCAAACCCGGCACGGTACT^^ 
CATOCTGCCTATGGAAGCCATAGGGGGCGTGTCCTT^ 
GATATGGCGCCCMTAT(mXK^AAACGCOT 
MGTCTTTCAGGGTG(^GGCTATCAGGAATAC^ 
CAAAAATAAACGC 
SEQ ID 3170 

MVIBKSSKAWLHEH^ 

(j-IXnGAGGCGGTIXaAOGCGCAA 

SEQ ID 3172 

Ml^FYSNSRKMADTK^ 

SEQ ID 3173 

GTGGTTGTTTTGCTATTGTTCITTC^ 
GTTGCGGCXSGCTTCCGTATGAffP 
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SEQID3174 

TOnjJJFQVPARCPSMHFGVPKPFPVKTDFB'fRSMPSJSCGGPRKI 

• ^ TTGCATAAGCft^^ 

AMTGCCGCGOCGAAGGACAl^TGCCGCCCASTCAAAQAACAA 

SEQ ID 3176 

TGATTfATCCGGTGTTCCTATTGGAH^^ 

COTATTCCGATGTOJCACTCTITCCCGT^ 

TWX5AACTGGGCATTATGACGGATGTCQCGCTCGATCOT 

GGCATATTCCGCCAAATATGCTTCTGCATTCTACGGCCCT^^ 

GATTOVMCTATrACGCCATTGAGGC(XKAAAGATGOTjAAGOGT 
S ffi < KLi©FF^ASI^^ 

EAUffiVAICIQBGAMVMVKPGIJ^UWm 
ceq in S179 

Sggcggc^ 

ACCAA 

SEQ ID 3180 

LAAASGEVSAFPPimSGGimTJFCLREGGVIQHYQAHQ 

SEQ ID 3181 

ATOTTMCATACACOTT^^ 
SEQ ID 3182 

MFNIHPSGNSYQ^RKRGNNHERNRRPffii j 
SEQ ID 3183 

ctSkxjgtcggctacgacttcggcggctgga 

CncraGOeGOC^ 
ATC^rcCTATATCGG^^ 
GTOXATCAGAAATGCCCCCCAAACCTGCCTA 
GmCCGCTACCACTATTG^ 

SEQ ID 3184 

VSVGYDFGGWRIAADYARYRKW^^ 
VPSEMPPKPAYHEWRSSRRLGFGAKAGVGID^ 
SEQ ID 3185 

TCAAGAAAAOTAT^^ 
TGC(X3TTACCCGGTTTTTCAGTATCCG^ 
TTTGGGGACAGCCTGTATAGAGGAACAGAA^ 
SEQ ID 3186 

FGDSLYRGTESEMFLYA 

S |^^A^(»CA(»TCCC^^ 
AIAAAAITCCCGACTTATCCATATOTACA^ 
AAACAGTATGTTTGCGCTGCACG^ 
GGTTTTGCGTTTTCCAGCGGTATGGCGG 
ATGTTTATCAAAAAT(mGGTGAAOT 
GOTTTACGCrK^AGACAT^ 
GAOT^TTCCATTCX^ 
GCGCGATTO^GGTCCGACGGACTGCTGGCTGGT^^ 

TQCXATTGAAAAAGTCT^ 
(^GCGGCAMCAGCGTGATTAAAAACA^ 
TGAAAATOGAAATGGGCATCAGGGTCXXa^^ 
SEQ ID 3188 

GPAPSSGMAGIDAVWRTFLRPGDriVAVADlYGGATO 

DFAFHSATKYLCGHSDV^ 

EAANSVIKimVKtfPSSIXKSVESLVim 

SEQ ID 3189 

TOCAGGCAGGCGGCCGACAGC^^ 
AATCCAGCAGAGAT?TGCGCAACCTGCCGATCAATCA 
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CTITKX&TTCCCGCGCCGGCAAAATCAOT 

AAGACG(K3GCATACX&C^ 

(BCTTCCGATGCCGAAG^^ 

GAAMCCCGAAATTCGTGGAAGATATGGTGCG^ 

OTATGCCTATATCGOTACCCG 

SEQID3190 

LQAGGRQRFFRRHLPMQYTHIMYLNIKIKQEMAI^ 

LLDSRAGHSVSFPPFRKKTAPVSGI^ 

ENPKFVBIS4VRDVATSLIADKRI^ 

SEQ ID 3191 

ATGGTTGTGGATAGACTCGAAATICTCGCOT^ 
GCCITITOX^MCGTATTTTT^ 
TGA(XOTMCGT(XmC<Xm^ 
TTCACCCGTGAGGCTGACATCATAATCCAG 

SEQID3192 

MWDRLEILAIiDDETLDAFVGN^^ 
FTREADIIIQ 

SEQID3193 

ATGACAGTATTAAGCAAAGAra^ 
GTTTGTCGCCCAGTTCG(?rcGGTO 
TCATTTGGTGGTGTTTTTGG^ 
CAGGCGTTTCAAGCTGACGACATCAAGATT^^ 

TCGATTCCTGCCCGGTGGAAGGTTTC^ . 
GGTTCAGGAAATCGTCACGAAAGCGCGCAGGCCCTTGC^^ 

SEQID3194 

MMiSKEQVIiSAPKNRKSCRHYDAAl^ 
QAFQADDIKILDDSRALF^^ 

SEQID3195 

ATGAATTCCAAAATTTCCTCmCCATA^ 

CGGTCMCGGCAAMCCGCCGTACTCGGTCAGAMGT^^ 

CGOTATCHCAGCGCACMGCGGAAAAA^ 

CTTCmXXmKSGCAGGCTCGATATC^ 

GCGTGCGCGGCAAATTGGACGAAAAAGGACTTGCOT 

T0GOOTATCTCGCTCCCGGCGAATCGTTT 
SEQ ID 3196 

MNSKISSDHTEDTIRI£KRMAQI£LCS^^ 
IAPAGMJJIDSVGLIiVL^^ 
WKYLAPGESF 
SEQ ID 3197 
ATGAACAGCCCTTTTCACAACATCGCTATCGTAAC^^ 
MGTCGGCGTAAGGGAATGCTGCATCTATACCC^^ 
TCTCTCKKXKCCCGCGAAA 
TTAGAAGGAAAATACCTTGCCGAAGAGCGCATCC^^ 
GACAGATGATTGAATTKaAAGTCTTCGTC^ 
MT(ATGCAGGCAGGATTACACGCCTTCA(XOT 
GACGCGCGCGTTCATTTOACGGACM 
AAACCCTGCGCCAAAAACTGCACTGGQGCGAGC^ 

SEQ ID 3198 

MSPFHNIGIVTRPHTPDIQ])TAOT^ 
ItEGKHiASERILIEAALIREGKTASRALAU^^ 
DARVHFDGQSFIDVQNLDRII IRRYMPLRIIiHPTDYQYTKTIiRQKLHWGEQLV 

SEQ ID 3199 

ATCTCCGCCCTCCTCC^ 

GACGCATCACAGMGACGGTTTGCPCTCGGCGGGA^ 

CAGGCTCGAAGGCGACCTCATCCTCGGCATC^ 

GCCACGGACATCGGACACGGCATCAAACAAATCGGCAGGAAC^ 

ACGAAATAAGCAGACTGCGCGA(X3GCGT^^ 

SEQ ID 3200 

KSALLPIINRLII^PDSRSELTSFAGKTLTO^GLKLAGW 
ATDIGHGIKQIGRNIAEQWSREPESA^ 

SEQ ID 3201 

0X3ACGA<»TCAACCTGC^ 
CTCCTTTTTGGCGCATC^ 
TGCGACGAAACCGCCGCACCCGATCCGCAAA^ 
ATTttGCX^OGOaTC^^ 
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>T(JSXGCGCCGGMTT!CICMXCGACGM: 



CAkCCCCGGCGGCAGGCtt^^ 
AQCGTCGGCAAMTCATCCGTACCGTTTGCGKf 

SVGKIIRTVCDHGLSI. 

Ca^GCA^ATGCCGGTTC^ 
RMEGHPVDEIIRSAQAABVNIiEMFBERIJBftBV 



ATTCTTCGTTAAA 



GTITCACCGCGATGCftGCCGTATGCGGGGGCTTCGAGOT 

(YTGTTGGCATACCTGAATGMGCCGTGTTGCCCTCT^ 

TCCTACCmAGCCTCCTTOGCCOT 
SEQ ID 3210 



CCTCCGACTO^^ 
ATOXXXXXTCAG^^ 

CACMCAMTraC^ATTl^^ 



HIKFTVFARFQVElfiAEPNWLPTSFSL 
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ceo in 3213 

AT6CTOT 

MftJTACCTTTMGGGCMTATQQCGGOOC^^ 
GGGGATTTTCGGCATGATTTTGATGTGGGCGGC^ 
CCOXEGCAAT^^ 
TTTACGGCAAATTCGCTATGCCCGOT 

ceeorraBMSG^^ 

QCTTnCQCTraGQCQGOQC^ 
GGCMGCATGTACAACGATGATCCGGCAGTTT^ 
TACAAAGTCACCAAGGTGCQ^TGTT^ 
^TOCCTCX^TCACCATCGCAa 

SEQ ID 3214 

MLLAQVAQVGIGFVDTVMAGGAGKEDLAAVAI^ 

PAAMVIffiAIiHAYASSI^M 

VlftlAPFGHJYVAAQQVGISLSGILTO 

YKVTKVFMFIHAAAFWGCGIiLPGY^ 

S Ara!^ramTTTCT^ 

SEQ ID 3216 

MCTIISHHFPLPFFI^AFIiMmCCS^ 

ACTTGAOGGCATTGAGCCXjTCGGAC^^ 

SEQ ID 3218 

VHRNKGftHnVHm^SDGGYGVYKV^^ 

SEQ ID 3219 

TTGTTAATCCGCTAIACCXXKT^ 

SEQ ID 3220 

LIJRYTOCK1KJEGQRTAVRFAVIjPFKLK3NAA I 
cca an 3221 
ATC^rareCCTTCT^^ 
AGMTCCGAACAGCCCTOTC^ 

SEQ ID 3222 

mOIiFYPVNTOCPIRNQTASIiPQRITHiSK^ 

SEQ ID 3223 

ATCTTCCGGAAGCTE^^ 
AAACGGCGATGCTTCCTTCCGATTCAG^ 

SEQ ID 3224 

MFRKLPCUffiNLPSVIiSCKHKRIiPD^ 
QFftl in *^225 

ATGCAGACATGGCAGCTTCOGGAACATATCGC^ 
AGOTCCGCTCATGGAGTACGCACATTCCCTC^^ 
CACGCCGCAGCTTCGCGCGTATCGATGCCa 
CCCTTGCAGGCAGGGGCGGAAATGTACGGTTTT^^ 
GGCATATCGGCATATTCCGCGCCTTGTC^T^^ 
TfKK5AAGCTGGACGGTATGTGGGCAAAA(^ 
mGACmTTGCAGGCGGTGTGCGACGCAT^ 
TCttCGACGCGGTCGC<^^ 
CGAACGGCAGCmSCCGTCTT^^ 
TCGGMGAGCTTGCAGGCCGTCTGAAAAAC^CG^ 
ccq |q 3226 

PLQAGAOTGFADIRGDIEUDLMLKSmAKf^^ 

U)ELQAVCDAFPDNBIHIDIiSBLRVEN^^^ 

SEELAGRLKKTDGVWQVVKR 

SEQ ID 3227 %M m ^nN^mv^^ 

GTGTCCGATACCGTAGTCMTCACGACACACTGCCC^^ 

GGCAAACGCCCXIATAAAGCTGCG 

SEQ ID 3228 

VSDTVVNHDTIiPLJTO 
cpn |Q 3229 

ATOTTATGGOTAAAAATGTTCT 

ATGCGG(mTACCTTG(?rCGTCG« 

CCTGTflXXmAATCGAC^^ 

GAAG(M i CGCG(XjGCAAAGGCAAAATCGGCAO^^ 

GTGAAAAACTGGATGCCX3TCCTTGCCTATO 
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ACCTCGTCCAACTGra^^ 
a&OWTTGTTCGACG^ 
CCAAATCAACGO^TPCCXm 
TGCGCHTOSCMGC^^ 
AACGGATTGAAGAAffTCTGCGGCGCGCCCXj^^ 

SEQ ID 3230 

EASRGKGKIGTTGRGIGPAYEDKVARRMRAADI^^ 

TSSNGAGAASAGAGVGPQMOTVIiGIVK^^ 

CGSDAVETCKPIYETMPGWRES^^ 

SEQ ID 3231 

CCGTCGTOmTttCTG^ 
CACTGTCGAAGCCCAAGCGCGGCMTGGMTCTGAA^ 
AGCACCGGTTTGCTCGACC&TATGACGCGC^^ 
CCTTTGTCGTGTOCCTGTCGCGCATTATTG^^ 
TGCCAGCCT6ATTGTCATGTGGTTCAGCCGCX^(^ 
GTCmTTTGCCXmGAAATC&^ 
SEQ ID 3232 
VKRII^LATNIAVLWINIVLAV^ 
STGLLDHMTRDEVEAVLAHEM^^ 
VDLPE™A>mGDTRDSLLSTO 

SEQ ID 3233 

TTGCAAGCAAATGTTTTGCATACCTTO 
SEQ ID 3234 

LQAKVLHTFAEPDIHKAATFWIMPQNFKTSH?rKK^ 
SEQ ID 3235 

GTGATOOTTTAAATGTGTTTATTGGAATC^ 
CTTTGCCGAGCO^TATTCACM 

GCTGCTGATGTCCAAAmATCGCCAMCMTCGGTCG 
CTGAAAACGCCAGMGTCGCCATCTACCACIXXXXC^ 
ACGMGTGGAAGCCGTGTTGGCGCACGAAATGGCGCACGTTO 
CMCCTGATTGOCCGAMCAACGACGGCAGCCAGW^^ 
CAAaX^TACCGCGCa3AC(X«GGCGC^^ 
SEQ ID 3236 

VILLNWIGIKQMVYDSTRI/3AIVQKPTAQTMCK 

AADVQIYRQTIGRCGSHRHAAH^^ 

QPIXPKQRRQPVPGNIJPSQHGIPNPVRI^ 

SEQ ID 3237 
ATGAAAGCATOAOTraAtXKXX^^ 
CAGGCACGCCCITGGGTTTCGAAGCGAAAAAAATCATC^^ 
OTXnTCGACGGTTTCCCGCGTACATTGGCGC^ 
GGCCGC(X5OTGCATTTGGCTTCCGGCCGTAOT 
AAACCGTGAAAAAACGTCTTGCCGTTTACCA^ 
AGAAGCTGTGAAAGCCGAAGTATTGGGCGCATTGGGCAAA 

SEQ ID 3238 
HKALLIX3AP(ftGKGTQAQFITAAra 
GWIVHLASGRTYHVTYNPPKTEGKDDTO 

SEQ ID 3239 

ATGWGAAAATAAAAGAAAAATCAGATGCTTGTATTO 
SEQ ID 3240 

MFENKRKIRCLYSRLRR^ 
SEQ ID 3241 

ATGAAAATOCCCTACC^^ 

AAGCGGTTTTTOTGTMC^CTGm 

AGAACOXsAAATGGTCAAGAAACGCCCTGTCGTO 

CACAAMTGAGTGAAAACCCCTTACCGGACAAGCCGCA 

GITGCATCCCAATAATCAGTGAAGAGGAT^^ 
SEQ ID 3242 

MKMPYRSGQQPMPHPAGWjRTLKGSGGIPI^^ 

HKMSENPI»PDKPHIQ(3U&CD^ 
SEQ ID 3243 

AGCTCCCGCCAI3TTCM 

GCACCGTTTTGTTTGGCGACA(KX3GTrTC 

GGCGCACGATAGAATCAmTGAT^ 

TTCCGCATCCMCGan^^ 

TGTACCACXXXKTGGAAAGATTGCAGAQ^OTC^ 
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CATTTCGGCMCGCTC^^ 

TAAATGGCGGOHWCAGG^^ 

GGCCTCmACTCGTCG^ 

(KnCAGGAGCGGCTGTTCTOGCGCAAGT^^ 

TreTCTTGTTGGCGCAAACGGGTTGCAAAGTT^ 

GGT7GCC<nCTTTGGCM 

ATACAGGCCTTCGTAGGCAAAACGGTCGCGGTCGGACAGCCAGC^ 

C(£AAA(HX3TC(nBTGCGG^^ 

GCTCGGmCCACC(KTOGCC(^ 

GATACAGCGGCTCATTTOTCGGCX5GAAA(XAAAG^ 

TGGCATTCGCXXjCCCTGGTCGCAGGT^ 

CCGmCCGGCG^^ 

ACCGAGCTTGTGCXmCTTCAATT 

SEQ ID 3244 

SSRQFNNAHQRTQRGIFGMQRQMHHVFRQTGIGFI^^ 

FRIQRVFQNRSGIWFKTFQVKQVAQYAQH^^ 

HFGNALARVHTQYVRTDAVGGLRQNAQH^ 

GQERLFFAQVRTDYQYGIVVSQRFNTl^ 

IQAF^GKTVAVGQPAFVIXSFVFQRQHAAIXjMVFGLYNQV^ 

DTAAHFUjGNQRTHVIiADDCT 

TELVRGFNYRRAIjIjHRYLFAVDFDL*H 

SEQ ID 3245 

TTGATAAACAGCGTGTTGCMCCGAAAAATAATCATO 

SEQ ID 3246 

IJNSVLQPKNNHSASFPRRRESISEFGQSLFKSDEliSPIKGFPPARB 

SEQ ID 3247 

ATGACGGCTGATGTACCGGTTCAAATTTATCC^^ 

SEQ ID 3248 

OTADVPVQIYPKQFVGGLSPRRSDIHARPTV 

SEQ ID 3249 

TTGAACCGGTACATCAGCftSTCATO 
ACGCCGAA 

SEQ ID 3250 

LNRHSRHSRAGOTLIKLSSSDIiJSrcPNS^DSRLRGNDAE 

SEQ DD 3251 

TTGAATTTACGTTGTTATAGCGGATTAACAAAAAT^ 

TGAGCCGGGGCGG€6CM(^CGTACCGGTTTAAAGTT^ 

(XAAACTCCGCAACMCGTACCCCC^^ 

CTGTTTACCGCTACGGGGCGCAGTTTGGCGGAAAGCCT^ 

(XGATATGGGCGTGTGGATGGTCGATATGCACGCGro 

GACCAGCATGGAGCAMGCGATTTGGCGGAAATCGGTTTGAA^ 

CMGAAGCCKGCC(XrPGCGCCGCGAATT^ 

CCTTGGCGGCGGGTTCGACTTATCTGGTGATGGGCC^ 

SEQ ID 3252 

UMCYSGLTKIRTRRRAADSTDSTEPVRPV^^ 

LFTATGRSLAESLIHQGFKLFLDLKYHDIPHTVAQA 

QEAAPLRREIiGRDFVLVTPGIRLDVAGNNDDQRRI^ 

SEQ ID 3253 

ATGCCCGCCMGTTCCMCMGAMCOTCAAATCC^^ 
AAGCAOCCGTGCCGGTGGC^AAAATCGGACGAAT^ 

CTTATCCGCCTTGATTTTGMGAA^ 

GTCTGTCGCATATCTCCGATATGATC^ 

CGCCGAATTGAAAGAAGTGGTTGGCAGTTGGAAAAACGAAGGCGA 

ATXjACCCTGTTCAGCGAAGCXXjAACCCATTTA^ 

GCACCATGCCCGAAGCCATGTACCTTGCCMTACTO 

SEQ ID 3254 

MPAKPQQETLKSRFAQAKVLVVGDVMLDRW 

LIRLDFEEHPNREVLEQIKRKYREILPEYIMIPOT 

ffl!LFSEGEPHQPTRAQEV¥DVSGA(X^^ 

SEQ ID 3255 

ATGACCATCCMTGCCGTCTGAAACCCTCAAMCAM 
TTACCGACATCGTOXXXjI^ 
TTTACCTTATCAAAACATCGAAGCCGTTTTC^ 
TGTCAGGACGAACGCATCCCCTTCCm 

TACTTCAACGTTTACGGACAACACGAA^ 
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S52SS5525S55S5S^ 

CGAAGCCGGATi 

SEQ ID 3258 

VQIfiMiLHFGHHRNPYIKLNKSKSPDIFRRFFYGHSH 

TCGCCGACAJAGCOGMST 



(XXXX^TimmXimACG(X!G(mG^ 
GCCGAAK 
SEQ ID 3260 



AEISETVARIERLRREIDEVIASrET 
OXS^a^TTTTTTGAATTT^ 

SEQ ID 3262 

IJNNFU^SEKHASDG 

SEQ ID 3264 

HGSRARATOTBKPSTOVMGRGMRP^^ 



ACCGGCTGCCAA' 
ATOAAGAM 
AA 

SEQ ID 3266 
SEQ ID 3267 



SEQ ID 3268 



*«"**" r * . . _ - — - — - _ _ AMAnmMAMAi % 

TJOTGACAMTTCACCGAGCTGGAAGCTi 



gBNYFRNIGSKlWIATHH^ 
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SEQID3271 

TTGACAATCAAATCGGGGGGGATAGCTGATGGCTATCA^ 

ACGAACATMTTACGTO3GCGTGGATMTCTOT . 

AAATATTCGTCCGTATITGAAAAAAAT^^ 

TTGGCCGATGATAMTTTTTT6CTTO 

AAAAAATCtm^CCATCCTAGGCA^ 

OTCCCCAAGGCTOCC 

SEQID3272 

LTIKSGGIAIX^Q(31IjKNVVWKTLG 
LADDKFFAFNMKmC&KmGS^ 

SEQID3273 

TAACTCATTTATTATATTTTTAAATGAOT 
CTACTGCATCGCAA 

SEQ ID 3274 

*LIYnFK*LK*IiFHS!HiIWia^^ 
SEQID3275 

ATGGCATTTGGTTCGATGAATO 
CGCATTCCATCCCTTTGGAACTGCCGACCGOT 
TTCTGCMGCAAAGTGGAAATCGGGGMGTGGAMGCCGTOT 
AAAGCTTTAGAAGCXXSCCCGTCAGGCAGGAATCACCA^ 

SEQID3276 

KALEAARQAGITKIGFVTEmQ 

SEQ ID 3277 

ATGAACCTCGAAACCAAACCCATCGC^^ 

CCGATTTGC AG AATCAGGGGT ACGAATACCGCAAGGATTTGAW 

GGCXjCGGTTTTTGACGGAATATTO 

TATCTGCTGGACMGAAAAACCTTGCCCGC^ 

TGCAGATTGAATTGAAAAAGCGCGGTGTGGCGGTGCGCGA 

GATTTCCAACGGCACGGACACGCGCTATTTCGCCAAC^^ 

TTTACOGCCACGTTCCTGCAAAAAA 

TTTTGTGGAAAATCAACAGCTCGGCOCAGOCG^ 

6GCGAC6GAATCG(XGTTTATCGACAAG6TTTrc^^ 

ACGGCGGGCTTGAMCGCAATTTGGAAAAAGACGGCMCAAMTCAT^ 

TATTTATTTTCGACGMTGCCACCGCTCGCAATO 

TTTGGGCGCGGAAACCAC(X£(3X^ 

CAGTTCAAAGCCGTGGMGCGGAACAGGACGAGMGGMCTC 

GGC^GAAAACGCACCGCTTGAATGC^^ 

GCACCCXjTTGAAAACAGCCACCATTTTTT^^ 

TTGCAGGCTtmTCAACGATOCAACG^^ 

T6CTGATTGTGGTCGGCATGTTT1TGM 

TTACGATGCTACCAAAACCTTCGGCAACATTGTC^^ 

TACGAAGMTACATGAACGGCTATACCGACAGCCAG^ 

MGACAAAAMGATTTTGCCAAACTC^^ 

TGCGATGMGGCGTTTCAGGAAAAATACTA(OT 

CGAGACTGGCTG(XX!CGCCAAAAAGCAGG 

TGCAACTG(?mTCGAGCAra 

CATCAACGATACGGATTTGGATAAAGTGCCCGACGTTO 

AACGAAACCGOCGCCAAACGCTA^^ 

CGAAGAMCAAAGCGTOTXEAAAAAATCGCGGCGTT^^ 

SEQ ID 3278 

MNLETKPIAETPNFIVLDQYQQEQSGSYQSENRLEAELIA^ 

YUjDKKNLARNHVQLINQFEQTGTHANRYDVTO 

FTATFLQKSVIiLSVLLHYSWDAMMMMRHQIAAA 

TAGLKRNIiEKDGmiVTTIQR^ 

QFKAVEAEQDEKEIjSAAENRQAIiI^ 

LQAAIM>¥NACFKINF<OTS^ 

YEEYMNGYTDSQTGEARRGYI^ 

RDWLRRQKAGEQRBQSKIDHDDVVFEVDU^ 

NETAAKRYI/KjSLKRGYAS^^ 

SEQ ID 3279 

TTGTCXjAGGAGATTTMTATGCTTGCCC^^ 

TCGMGAAGAT(K!CGCCGTGCXX»AACGTCTTA^ 

AGACACGGTCGAAACCCGGCCCACGCTCXK^ 

AC^GAAACCGACAGCXIACGCCGTTTATTTT^^ 

CAGACGGCATBGACGACGATMTGAGAAC^ 

CAAACACGAMTGGAAOSGCTOTGCAM 

GTCAACGACGACATTCXXXjACGCGCTC^^ 



WO 02/079243 



-257- 



PCT/BB02/02069 



CAM£AGCCTCATTGCGTGT^^ 
CCTCCGCA 



DFRHVILIMTTI&GM&IjS^^ 

CGGGCAAAAGC^XSAAGGACATCCGCTGCMTO 
SEQ ID 3282 .-^^n*— 



RAKAEGHPLQCIVH5I 

qcn 3284 

UIIiimEKFl»KIVKMFNMGFOFimBGGE^ 

SEQ ID 3288 

MPSE)BGGATCPVPAMiPFTaAPPPrGDIAHFFH(^ 

S 5Ss 2 OTTmKFV* I sPHLv ? im*imPi^™ 
SEQ ID 3292 ^ mmmgaamt ^T^ 
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SEQ ID 3293 

ATGCTOTCAACC^^ 

TCAGCGTGWCTTGGGCGAAATCGAACAA^ 

CCTGCGCGACaceGK^ 



MTACGTTTACGGCAACAC^ 

CGCCTGCCGCX»TO(^TATGGA^^ 

ACCACCGTTTCGGGCGGGCTGATCGGACACCT^^ 

AOTraXX3MGAAC0^CATCCCC^^ 

CAGTTACAGCGCGCGCAMCT(X3GAATGCAGACCAC^^ 

TTGCTGGTAACCGAATIX^TGGGACAGGGTG^ 

SEQ ID 3294 

MLFNHTASELLDLCR^ 
EWDLDTEAAVGLMQCEQAALDTDSiaENOT 
TTVSGGLIGHLVGTLSGGALYRQSSFLI^ 
LLVTELMGQC^ITGDYS^^ 

SEQ ID 3295 

TTGCAGGGCGTGCGGTGCAAAAAGCCGTCTGAAOT^ 

AAATGAAAAAGCAGATGMCCWTITGCAGGATTTG^ 

AAAAAWACGTCCAACGGCGCGCTCAAACGTCAGGC^^ 

CACAAOGCCTTTTTGCAACGCGTGGAACAGGC^^ 

TCCGCAACACXIAAAAMGAACAGGAACAGGACAMCCGC^ 

SEQ ID 3296 

LQGVRCKKPSERGGILLKYMGKNSRIKMP^ 
HNAFLQRVEQARVRLLADEGALTQFM^ 
SEQ ID 3297 
mTCCAACCCGGCCTTCAAAAGG^^ 
CAACCACACCGAAGGC6CGGATTTGGTCTTTGTCAA 
CAAAACXX3CGGCGGTCGTTTTACGGMCAACATOT 
TTGAAACAGGGCGCGGAAGTTTTTCGGC^ 

TATCGCGCATCAGACGT(£GATAMTTG(^ 
GTCGTT 
SEQ ID 3298 
LSNPAFKRPVSFAESSGLTANQYFPPIITRLPF^ 
LKQGAEVFRRFVLFLFFFGVADERPQ^ 
W 

SEQ ID 3299 

TTGGAAGATAAAATGTTCTTCCGTAAAACX&^ 
GCAAACACGTTGACAAAGACCAAATCCGCGC^^ 
AGATTCXSGCGAAGCTGACGGGOCTTTTGAAGGCC^ 
AGCCAGAATTTCA6TACCGGAGGTCTTTGCCTGMCTW 
C(XXK^TCCGTATCCGCCAAAGGCAAATACTACXKX^ 
CAAATCCMGCTGTTTGGAAATATCTTATAT^^ 
SEQ ID 3300 

IiEDKMLFHKTFAAVLAAT^^ 
SQNFSTGGLCLRYDTGRPDD 

SEQ ID 3301 

ATGTCAGTCAAAATCCAAACCCGCCCCGTC^ 
AGTTGGACGACAAACTOXnTCa^^ 

GCCGAAATCGCC(KXXKmAGGra 
ATTTGCCAGCCGATACCGTACCCGACT^^ 
TGCCGAACTGCGCCGCCGCMTTATTTTT^ 
CGCATCCTCGTGTCGCMGGTTTGAAACGGATC 
TTtXOTTGGGCCCGCGCATCAAOG^^ 
CAATATCGMCGCCGCGAAATOAGCAGTCTA^ 
GTGGTCGGCATTGTTGCCAGCCGCCTCAAAGj^^ 
GCGATGCTTTGGACTTGGTGTCCAAAC^ 
TGAAGMGCCXjTGCGTGAGATGGTGTGCGMGACGATTT^^ 
TGGGGGCAGGGCTTT(XX£CGCCGAGOT 
AAGCGATGTTCTGGC6TTGCAGCGAA6ACAITO 
GGOG 
SEQ ID 3302 

M^QTRPVNTDVFNHLLAAGADPIim 
AEIAAAQGVDLLITVDNGlTtfUGVA^ 
RILVSQGLKRMRSGKMRPGIRAIiFEVARRDWR^ 
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VVGlVRSRLKDBFYBPIIVFJfflMJHGBJRBSGRSIBW 



WGQGFAP1 
SEQID3303 

GTGirCAGACTGCATCW3GCGGTACGA' t ! ^ m ,^,„. 
SEQID3304 



TTCAAGACTTCGGCAMCGGCGTGCCCAAO 



(XBOnTIGACGTATTCGTC^ . 
SEQ ID 3306 , „„„,„ mTmTnmnji ,„ ffV n»inTttOTP»tOTAAGGAl^ 



GTACCGTi 
AGTGCGO 



S6 ===== 



AGGCATTTACGGCAAGGCTGCGGCAAGG 

PKLYRVIFRSRKAKVKFQCOTFEEVIPQIEKTGGYQITPiOTADDRTGLSRAVr 



GMGGTTGTGGGMCGGMTCGGAA 
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SEQID3313 

ATGCCGTCTGAAACGCCGTCCACCACTC^ 
ACGTGCGCCTTTGCGtnTTTCM^ 

SEQ ID 3314 

KPSBTPSTOQVRGPHCRFRRARAGNHRRHGNAPVQ^ 

SEQ ID 3315 

ATGCCCAAAATACATTTTTTCAT^ 
AGAATAATACA 

SEQ ID 3316 

HPKIHFFINFSLLLEPCPPGHPFS^ 

SEQ ID 3317 

ATGAAAAAATGTATTTTGGGC^TTTTGACCGC^ 

(X3GGCG6CMTACCGTCAAAATCGAOTTT^^ 

GAAAGTGCCOTK&CGTC^ 

SEQ ID 3318 

MQfclLGILTACAAMPAFADRISD^^ 

SEQ ID 3319^^^ 

GOOGTTTCGAGCaTlTrACCTT^ 

CGTCGAAACCCAAACCGAAGATGCCGGGCATC^GGAAAAAAAT^ 

GCCCTGCTCGA(X^C(ACATCGTATTCGGCCT^ 

TPOGGCCTGCC(nCGAAGCCGGTGAGAAACTGGGOT 

TGTAACCAAGCTGATTGAAAAAGGCCTGATTGAM 

ATGAAAATGTTCOT3A(XCGCATCG^^ 

AOGGCGTGGCAGGACTGTATTTCC^TACC^^ 

SEQ ID 3320 

fflflTVHLHLEETDNPTLQRU^ 

ALLDHDIVFGLGPAGTGKTYLAVAAAV^ 

MMFLTRIGFGAXAVITOT^ 

SEQ ID 3321 j 

TTGTK^TCTCGACCACTTGCTTGTC 
GCGCGGGAAT(^CGGCATTTCAGTATTT^ 
SEQ ID 3322 1 

LLDLDHLLVCRHSRAGGNPSSMVSNVLIjNSET^ 

SEQ ID 3323 

TTGGCACMTGCCCGCAACTGCGCCTGAACATCGG^^ 
AACCCGAAGCMTCTTGGATTTGATGAGGAACAGCGATO 
ACATOGOSKXKEKX^ 
(^GCTCAGAACCGACTGCCTCGCAAC^^ 

SEQ ID 3324 

LAQCPQIiRIOTGGSGQEEQRIjKQQAADLGIAHAVlTLGAIfl 
QLRTIX^TFGBDAVIGRIJAWRQAARNTARKRPKNRIjK 

SEQ ID 3325 

ATCmiCGAACCGTTGCACGTCCra 

CCATCCTCGCCCEC^^ 

TTTCTTCCCCGCTTTCCGTOT 

SEQ ID 3326 

MSEPimVIPI^RSEQDVDGIPPQNQAgriiQRK 

SEQ ID 3327 

GTGCGCGCCX3GTTTGAAAGCCTTCAAACOT 
GCATTCCCTACCTT^ 

caTTXtxxmamxrrec^ 

SEQ ID 3328 

mGIiKAFmiRMGLPDLIHAHCKNYAGIIAQI^ 

SEQ ID 3329 ^ . 1TO gcgtoI' TTTTOGCGGC^^ 
AATCCKXX3(^AT(X&GGTGG^ 
AAAATCGAAAAATGGTAGCTTTAAAACCGAATTGGTATC^^ 
GAG(XKIAGGGATTTTGATGCTTTAAACCGTOT 
GCAAGGATGCCXTITPTGCGTTTCAAGGCGC^^ 
TGGGGCTGTTCGGGCAAAAATGCTGCGCGCGA^ 
CAGTTC 

SEQ ID 3330 

IJWFQVRIRICCVLFWRLPCWR^ 
EGRDFDAI^IADVQTDASLOTDFS^ 
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MCCTTACCCCGTTC6&GCCG6AATGGGAAGGCTACGTTACCCTO 



)WTPFTOEWBGYVTIiEFSNTTPLPAKnAGEGV&QVIiFFESDEVCETSYra 



SEQ ID 3334 ^^.^n*.™*™™^.^^ 

IiQIiRQFIj 



ATTTGCCGACAGGCTCMGGTCGTACATATTCTAG 



fnra'ITITGAAGCCGGCtKTO 

KVVKSSGFSKLDNAARKAAQNOT^ 

SEQ ID 3338 
SEQ ID 

TATGAAATGCGAACAC®3CATGAT 

SEQ ID 3340 



YEMRPFHD 



TaGCACCATGTTOMGAGCTG^^ 
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CGCTTCAAGMGAAAWX^ 

SEQ ID 3344 

VFSSTERHLTQKDPIMKPTIAIJGRPNVGK^ 



RFKKKTKVSVKKQPSK 

SEQ ID 3345 

ATGGCAGCCCATCTCGAAGAACiUOAGAGTO© 

TTTACCAAAACCGTGCGGCTO 

ACAAAOTACCCOCATTOATTT^ 

AAAGACAGCCTGATTCAGGCGTOK^^ 

AATGAMCTCGATTCGCTGAAA 
SEQ ID 3346 

CGATTGTCGAGCAAA(X6GTTTGTTTGTC^^ 
AGGTACKGCCTCTOTra^ 
CAATTTCATCAGGTCGGTATCGAGGCTTTGGGT^^ 
AAATCAACAGCTTGGG^^ 

GACGGTITGGGTATTCAATATATTGAAMTTCGCG(^ 
GCGGA(XHTACGACSGTTTGATTGAAGAACTCG^ 
(XXSXKXKXnTUTGTCTATGC^ 
CTCAAGGCGCAAAT^^ 

AAACCGTTTCCGCCACCGATTTGACAGA^ ! 

SEQ ID 3348 

MNDLIiFVGQKDFKLTAVFWQ^ 



LKAQffiOUOTSGARFALIVAQDEIJUE 

SEQ ID 3349 

^GGATGGAGATGGTTTO^^ 
TGGGA 
SEQ ID 3350 
VRMEMVCOTVKMPSETVFRRHLTO 

SEQ ID 3351 r^n*™^„n™n^rv!nn* 

TTGCCGCTTCTGTCTGTCAGCCGTTTCGGGCTG^ 

TTACCGTCAGGGAGGTGGCGCGCGGCTATACGGCGCGCTACTGG^ 

CGTACCGCTGCTTACTTTGATCTT^ 

CCGCTGTCGMTCre^ 

CGATTTTGTOGCGCTCAAC^^ 

GATOTCCTGCKSC^ 

SEQ ID 3352 

LPIASVSRFGLHT^ 
PLS HIAMAVI«G^^ 

SEQ ID 3353 

TT6AATTATATTTATAAAAAAGGTCCAAATAA(3GTGC^^ 

SEQ ID 3354 

IjHYIYKKGANKVQIKLVFPNGLKNKSEKSS 

CCGAAAGCCICCCGCAACiATG^^ 

TTCGimCCGlTreCT^ 
ATTTGTTCCAAC^^ 
SEQ ID 3356 

^ * ■ ..._»„.™»™« ,<^marnfuv «n TTiTuiunnnnm t nnnnnT m "AVBHKRYPIH)HRBHI AHP F RLF HR I Vift E IQ 



SEQ ID 3357 

ATCMTTTACCTTTTCGGGCMTGGTGTCTGATO 

TTATTTTGGCAACAGGACGCAATCATACC(^TATGTCA^^ 

GCmATAGCMTAGTTTGCCTGMGAAOT 
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AftTGGCACAAAAAKPOC^TCCGT^^ 

SEQ ID 3360 

HPSHiCFRHHFYPPANHIiHPHKP^ 

GAG 

^^^^^^^^^ 

SEQ ID 3366 

MPmKFTOIiYIWnpLSII^IiUJaAI^QO^ 

SEQ ID 3368 

IJFDElKMOllVHmEALASKIjQyFNSKMIAA 

TTGATTGTTTTGTA 

SEQ ID 3370 _ 

I^OiRHAKASVVm^ 

SEQ ID 3372 

MKDLFLFSCSVCLI^YPIiIiLIAmGRTINKIK 

SEQ ID 3374 

LQIQQFBYCHCSVQKEStiUCCVWSFSIAF 

SEQ ID 3375 .^^^.^..^^n^ 
TTGGACAGTATTGATGTATGTTCTTGTGA^^ 



GAOTTCCGAATMiCC 

TCTTTATTTGATTGACAAGGCAG^ 

SEQ ID 3382 

UJHYSUWHFDFIPSINUriVFKSRlYA 
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SEQID3383 

TTGATGGGCKKmATATAGTTC^ 
GTTTTOTGAGTTTGATCCTG& 
SEQ ID 3384 

SEQ ID 3385 

ATGAAACGTATCmTTGCCCG^ 
TTAOTATCTGAA?A(£GAAAACAGCCGOT 

AMCGGATGTCCGACATATCCGCCGGCATO^ 
TMTCAAAAAAAGGGajCOTGCCCCTTm 
SEQ ID 3386 

MKRIFLPALPAILPIiSAyADLM 
KRHSDISAGISHJYLKDGKNPAL^^ 

SEQ ID 3387 

ATGAAAAAACGCAATAAJn^TCTTCCGCTGTTGJ^ 

CTTTTTTGGGAGCGGACGATTGGCGGAGGCGG^ 

TGAMCCGGTTGTGAGAGCGAGCCGCGTATTC^^ 

TTGCX3GGAAACGCTGCTCACGCGTCTGC6OT^^ 

ACAATCCTGCCACACG03(X3GOT 

TTTGGTMGGGGTGA6AAATT1TPTACGA(XK^ 

A(XXX3CAAGAAACACCAGTTA06CGTGCATAT^^ 

SEQ ID 3388 

MKKHNNPIiPIiiaR3VIU>S^ PFEEKIIjPVDfflLIWDKPHFLPVIPGQRF 

LRETLLTRLRLRPELQHIiWEDITPIHRlJ}KOT 

TGKOQLRVHMIffiliGMPIJJffiALYPV^ 

SEQ ID 3389 

TCGGTCAACCT0CXXX3AATACGATGTCCTGOT 
TTCAGGQK^AGGGTTTGTTTGCGCCGTCTC 
TGAMTGGTGAATCAGGTCTTCCATACCTGOT 
GATGATGCGTACGGATTGGCX3CATTTCTO 

CGGTCAGGAGGGTTTCGAGTGTGTCX^TATTTT^ 

AMTTTGCTGCTCTCOTATTTGGGC^ 

TCGCGGTCGCGGAAGGCGTAGAGGATGGCGGTCATCGCGO^ 

GGGCGCGGATAQGCACATCGATACCGGCAASTTTT^ 

TTT(XOTTCC(X£AGTOT 

SEQ ID 3390 

SVNLPEYDVLRTDMjNDV^ 
DDAYGLAHFFHTAEVTVVAVAVDADGDVEIHFV^ 
KFAALVFGHKAG*VAVNMGTEVGGVHTCAGHGFVQVH 
FGPRQPFGAPVQEADMRIGTDDLFAVQLQlffiTQ 
SEQ ID 3391 MM&M^^ 

ACACAGAAAAAGATGCACGCTATATCGCCGACAAAA 

GCAGTTTACGCTTTATGCCGACGCGGCMGCGGGCGGCGGCXT^ 

AI^TGTCGAMCAGGGCGITKX^ 

CGGAT 

SEQ ID 3392 

MRAVIQKTVGAKVDVVSEACTET^^ 
IHVETGRPRraMQVSLCN^ 

SEQ ID 3393 

ATGCTGAAACC(X5AAACCATACACA^ 

AAGGCGGGACGMCACCMCAAAGTCATCCAAACCGC^ 

CGGACGCG(£GAAACG(^GACTGCCTCX^ 

GCACAAGTGAGCATACCCGATOATTGCTGC^ 

AAGACGAAGTCGTCGAAATQ3AAAAAG(XCTGAA^ 

CATCCTCCGOTCGCACCCOT 

SEQ ID 3394 

MLKPETIHIPGPAGILETIHIPSEQVPARGVAVINHPNPIiQG 
AQVSIPDIiLLLIGAAVCHYTORPEPSAVPNVAKTIffl^ 
SEQ ID 3395 

TCAACCTCBACOX^^ 
TGCAOGCTGGACAGACAOTACCTGCCCGAC^ 
ATTATCGAAGCAGACGGTGCGGAATACATCAAAGTCTTC 
GAGACTGCC^CAACO^CTCTCT^ 
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AOTCCTGCCGAAAGCCACGGCM^ 
SEQ ID 3396 

MRHPYRRLRPAKSGFPEra 

IIEADGAEHKVFRHlfPDIILVDGPDGBQIIDTLVEEPF^^ 
FHRMLAGLKASNPNHGKHFHL 

SEQ ID 3397 

ATCCAAAACAGCCCB^^ 
AAGMGGCGMTTATGC^GGCAACTTCACAC^^ 

SEQ ID 3398 

HQNSPTVLHNRAVKSGAGTKSRTRDPLIT 

SEQ ID 3399 

AKamCTGTOMCA^^ 

TCGGGGATTOTIGGGMT^^ 

CGAOTGCCGTTTGGTGCATATCAGCAG^ 

ACTG^TTTCTGCAAATGCG^ 

TTAACGATGCCAAGGCGCATGATGAAGCCGGAGCTGCC^^ 

GGGC6CGGATTGCX3ACGGTCAG6WTGGT^ 

CGGGCGTATGTTGCXX3AAGTCAAAG(£AAto^ 

SEQ ID 3400 

IDTVmQKMmGEKimTAYESSFAAUffi^ 

TEFI^MRGIPVCAHIGLTPQSWAFGGYKVQOTGGKAQM 

RAYVAEVKAKTFPAASJIFAD 

SEQ ID 3401 

ATGTrCCGCAGCAGGGAAGGTTTTGGCl'TO 

ATAOCGAGCATATCGTGCATCACCAAAACCTGACXX^ 

OOITCAGTACGAOG(XIA^^ 

(^GACCGATGTGGGCA^^ 

GCGGCAAACGCCTGCTCCTTACTCTGCTGATATO 

CCG<»GCGTCGAOTGTCGtt 

GATTTTCTCGCC6GCCG(XTTCAOT 

SEQ ID 3402 



QTDVGTDRNSAHLQKFSRFRHPHAAFEFD^^ 
DFLAGRimQCTHSNHQILLPRPFQTAPSV 
SEQ ID 3403 

TTGCmTTGAACMGGTCTGC^^ 
AGGATTACCGTAATGCAMTCATACATACCATO 

SEQ ID 3404 

LQIBQGUjPMPSESRFRRHmPCAARIIGAFCR^ 

SEQ ID 3405 

TTGGCCACOJPAGTTCTCC^ 
ATCGGGAAGTCTTTGCGGAACGGATGTCCGACAAAGC^ 
CC(mrrGTTC?rAMTATCGGCT 
GOGCTTOOCCIGCCMGCTTOGTTOT 
TCGGGCAWCACTCGACGGTMTCTCGCCC^ 
SEQ ID 3406 

AFALPOTFVSAVII^GVNQKIiEMQI 

SEQ ID 3407 

TTGGT(^TACGCTCGTGGCmGATCAT^ 

TTTOAAAAACAGGGCATTGAAGGAAG^ 

AACGimTCCGAAAGCGCACTGa^G 

GGGCTGTCTTTTGMGGGCTGATGAAGO^ 

GC3GACMTOCCX^CAATCa3^^ 

TTTGAGGCAGGAAAAAGCCCOCTTGAAAATCGATGC^ 

SEQ ID 3408 

LVDTLVAQIl^ADRH^ 

SEQ ID 3409 

GTCGCATTTGTGCCGACttTC^ 

AGGGCXAGMTTCGJ^ 

GCG?TACWVCGTCX3AACCGCCCCATT^^ 

GCgKHTTOGCAAAAAAGATTACCA 

TCTCCAGCCGCAACCGGTACTTGAGCG^ 

gcggcaggcttoxsaccat™ 
gcccgtctggggacgacgcgtct<^^ 
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SEQ ID 3410 

VAFVPTtm^EGHLALVREA^^ 

ACTGKKDYQQLATOGLTEDI^IEIVPVD^^ 

ARLGTTRLITNVEVGLP 

SEQ ID 3411 

ATGTATGATTTCXOTACGGTAATC^ . 

SEQID3412 

MYDLHYGNPYPSGFIiPDKTRRLYAPRTAKQiaV 

SEQ ID 3413 

TTGTCCGGCATCGCATACAGGCAAACCGACGCAAOT 

SEQ ID 3414 

LSGUYRQTD&ISKHSPAPGRVLQIYKPFGL 

SEQID3415 

GTGAAACGGGCa^AACATAATCATCCm 

SEQID3416 

VKRAfiNinLYPTSVRTPRRHIEYIjDKASLYHKPHGL 

SEQID3417 

TMGGAAACATCGGCtfmCAO^ 
AGGGCAGGCGGGAACGGCTCTGGCAACCTATATGCTGA 
ATGAmATCAGAMTGGCGGCAGATCGAGCCTATAC^ 
TGCTGOXmTCGGACGATGTGCAAA^ 

CCAOT 

SEQ1D3418 

MIMLPARFTILSVIAAALLAGQAYMGAADVEL 
MIYQKWRQIEPIPGEAQKRAGVMNVUlEGGNQHIiDGI^^ 

SEQID3419 

wGccccccamm^ 

T(MCTGGTTTCCX3^^ 
AGGTGCGTCCGTTATCGACCX^ 

TGCCWAAACTTCCCGMCAGCAGGGGCGGTATTTT 

CAT(XXXMAOTKKGCGGCGGGM 

CTCAAACTTACGCCXXjATAATGCACAMTTAT^ 

ACGATACCGCCGTTMCGACAGCATAGGCTGGGOT 

TTTGGGCGAAGTGTTCnKKjGC^TTCGC^^ 

GCCTKXXXX^GCCTTCCCGAAAACXXX^GGAAA 

SEQID3420 

LPPTLMTLRLTARKYPEILIXOT 
WQWIMVSAPEYLFDKGVlAAAAAAELra 
UaTPDNAQIMNNLGYSI^ 
ALPEPSRKPEK 

SEQ ID 3421 

<?TG(mCT(X»T(XXXm^ 

SEQ ID 3422 

VRTACRLPECPGFPANLMPSERMGLHYFRGFREGSGKAI P 

SEQ ID 3423 

GGCT(n!TTGGTGC^ 
CGAGAGTTTGGACATTGTGTGCCATTTCGAOT 
OXTOATCAAMTCGG<^CCCGM 
TTGAACGGATAAACGCAGATTTGCAAGAGTFGGAGMTT^ 
C(^TGCGamiTC6CGTOX: 
SEQ ID 3424 
LFRAAPGVKVAGIDRLTRNILMKLY^ 
RLIKIGLPEFVTPEAVKYFTOKKEKSIGSFSANIiNKTAQYLE^ 

SEQ ID 3425 

ATGAAAGCAAAAATCCTGACTTCCGTTGCGCTG^ 

CCGCATTCOTGCGGAAAACAGCCGTGCCG^^ 

GTCGGCAGAGTTTAMGATGCGCTTGCC^ 

GCATA(X£ACTGCATAT(XX£AAM^ 

CCGACAAGGAAGAAAATCXXaAAAAAAGO^ 

ATATGTGCC(X3GAAAGATTTG^ 

TATGTCAACGACAGCCGCGAGGTGAMC^ 

ACATCAAACCTGCAAAA 
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S ^'j!JScSGSIJAC1w™^ 

ATOHIAKTQPVSEQEVMVY^^ 

YVNIOEWKVPSFDm^IAGiniQAKRimVGAUmHIKPM 
epQ in 3427 • 

^CGGCACGCCGCACCC^^ 

AT^tyiAAAjXTGTGGCAGCCGT^ 

CGT03AAACCATCMTATCAACACCCCTTTCGGCACT 

GACTTGAGCAGGCAGCTCGTCGGTTTCAAACTGCCMrc 

ACGGTTGGACMTCBGCaGAACCGCCGACAGTGGGGGGCMff 

MCOXmACGCTGTGCGGCGCGCACGAG& 
SEQ ID 3428 



IGMPSETETPERCAABTR 

ISUIUUlWUVTIIWtr 11U1U/WWMIU)(< wri — —— ■ m" * 

SEQ ID 3429 



j3»HGCAGGATACCGTATAirTGAAAC0GAGGGA^^ 

iTOjQOSt^CAAOCTTGCCGGCGTGGAAK 

CAGTGTGGTTTGACGCIAGTGGCAGCTCAITCATTTCGGTGCGGCTn^ 

fl^TATTCOGAAACAGTGGTATGTTATTGft^ 

ijs(XTGCAACCGFIW!AAACGATATGCA^ 

TGTATATTrGCAGCATTTCAAGCTAGGAATAGCG^ 

^ttilADGRQAreAMQi^ 
CIFAAPQAEHSAYNIYRQVSGLYEAYIiAEGLSItHPIjLSV 

ATGTTGaGGTATAATOOSGG^ 

OraGGCGCOACGT^^ 
TTCKTATTPCGGCTATGCGCGCCMGACOKXIGTCCG^ 

Scaagcxmaac^^ 

CBHHTCICafflfiMOCG^^ 
S UPACEGMP^PA"" ~»»«n»niT^jnietnni!Tinei.m^ 



IPLSEAAKHCDRIRfiVT 
IAGIiIiAETVREISNEESVSYLFNEEVHTGSMLIiP 

TIAACGGCTTGAGMlTPrCAGCGTTMCAATGTGCAGGG^ 
GGAAATCGCGGACGATAACGTCTTTGGTnOG(XX3TTCAG^ 
AGOTCTTCACCTramTAtt 

. ««' w ^^ATCACTTCTTCATTGAAAAGATA 



RLIiWWCOTOTI^ATQAAARTSAIiCFTNRGLNF ISH\^SKU3KIAVIGRDQU a 'ASG I *(P®^Q'H^^ 

S ^l^ATGMMTOttGCC^^ 
TIGC(OTAGATCACAAMCCX3M 
CCCGTOXSKraOSAAGTCCAACA^ 
(XXXKTCGfXSTATCTCTGCTGAACACGGCTGT^ 
ACATC»AACT«XW5AAGGTCTAGAJA« 

GGRVSIiOTAVEVI^ANIPAPIiDIiIXaEmGDI^ 

GCAGGCTGAAACACTCACACAATTTAGMTATO^TGACAGCACATA 
ACCGAt^ACAOXmSXCCAC<3C^^ 

GAATGTTTGTACGTCKCGGG&TACGffrCAGCACCGACA^ 

mTrTO?fGCAACAMTCAACAM(3A^ 

CTCTCKAAOMTG^^ 
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TAAACGGOTGAAAGCO^^ 
ATCGGACMCAGCAA(%TGCT 
GCAGGCTTCCTCAAAGAAGCCTACATCACTC^^ 
AAATCAAMTCAGGCAAAACGGACATAra 

SEQ ID 3438 

VniQCGIItaiLKSQNHCPYAQDGMCARKIItf 
Pi^TQOTAYLVFKNMKS^ 
I^EAIMDFPEYYPIiFSIXSFKFENIEQNNRm 
AGFimnTLIOTEAKMABQIIjETIQPIPAPVKK^ 

SEQ ID 3439 

ATCACCGCACGCATMGCCGTCCTCCTTA(^^ 

TCfyfl'TTTCAAGC 

ATAACGCTTCAJX^GCCC^^ 

lAAAACTG&CGCAGTGCGCTTl^^ 

CCAMCCTTCmCAATO^ 

CA&TCOSTTOOGOCTG^^ 
CGTTOX^AMTOmmV^ 

CTTTATAAAAACCGCTGATATTGTCGTAAACGGCTOT 
AACGGTTTTGAAGGAC^^ 
ACCACGGTGTTGACCACTI^TTTTCCAGQ 
TCT(»CCOTAACGGT^^ 

TCAGGCAGACTTCATTCAGTCCGCCGTATGGAM ! 

SEQ ID 3440 

JfTARISRPPYLNVQCRLK^ 

ITliHDPFKRQPFEQBIiLMflRIJiF^^ 

LHDLRHQRIHQSRVDFGCFGCRKTCQHCSNNCGSGLFLFHGirc^ 

RCRKFAFFQVLNCZJKRKNT^ 

NGFEGRFLVVARLFRFG^FGKRIFKLCRPVEFQA 

SGRU1SVRRMEILVRGSRPDAERVNHAA ! 

SEQ ID 3441 ' 

CTCGTCAC(XXXXjAITOTCG^ 

TCGGCACCGCGCACAACG6AATM6AATACSTGATAATA6CCG 

CCATTTTATTGGTCAGCCTACC^ 

GGCGCAGCAGGCCMTCCGAAAGTAACCGGCCACMTGAACCGGTACKX^ 
GCGCACCTTTTTTCCAT 
SEQ ID 3442 

GAAGQSESNRPQ*TOTMVQHRIRAGGDETFFQNmSHSSAPm 

SEQ ID 3443 

ATCW^CGCCCX?^ 

ACATCCTTTTGAAACGCGMGATTTGCAGCCGGTT^^ 

GGGCAATCACGCACAAGGCGTGGCGTO^^ 

GTSGTTOTGCXaQGGTCTTTCR^ 

C(X3TGGGGATGGAMTraEAGC^ 

TAAAGTTATCGGCGTACAGACCAAC<& 

(XJAAACGGAACCTTCCGCCTCTGCAA^ 

(£0(£<3CGTTGGCGTTG^ 

6GAAC66AGCGMTT6GGC6AGGGCAA(X5AAGGTATT1^^ 

TAC(XX^ACGGCGGCGATGAAAAA(£G^ 

CCAATMTGAGATTGCCAAAATCCATATCX^ 

GGTTTCTTGGAAAGTCTGGGATACAGCTATC^ 

SEQ ID 3444 

MNTPLPYSDYLIRIl/rASV^ 
TOKGVSYNDAYDYAMELAEQEGLTYIAPFDDP 

GNGrFRLCKEwamTvi^ 

YRYGGDEKAHIFVGbQAAGPQDIAV^^ 
GFLESLGYSYHEETQHAAYKLFIiA 

SEQ ID 3445 

AT6AGTATTACCATTCAAAACTTAAACAAAAAACX3OT 

(X^AAACCACACTTTTACGCAT^^ 
gaACACTACGCCCTCT^^ 
CTCAAACTCGTACAACTCTOCXy^^ 
ACGMCOTTCGGCGCGCTGGATGCCAAAGTGCG^ 
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^TCCCCGCCGAiC^ 
TG GCAGC(»'IWXX»3CGCW^ 

^AVSWSGHIRGEWnPIiVEIFYHEYHFTQftPSIiSGVLaLL 



CAGTGGOOTrGGGCA^ 
SEQ ID 3450 



QWAIjGRRSGAKG > 
CGATGCGTACfXSX^CCCTOCM'ITrTTMGCiACGCCCCCGACAK 

CTWX^TGCTGCAAACOC^^ 
GATGTCAA&GGCGTATTGAGCX3KACCCGGCTT 

W^KGDSAYEQHAVO^LU^^ 
YEGWVKSGKAIGK^^ 
UKCCKMPDDIAGFTO 
DVRGVLSVm 

CAAATCCGTTAACM 



QIR*Q 
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SEQ ID 3455 

ATGCCGGCTGTC6ATTTGATCC(3C6AA(3G^ 

GOTAOOOCTCflO^^ 

SEQ ID 3456 

mVDLIEERIiQTLDPLVLEIGDES^ 

SEQ ID 3458 

MPPEKRKKWKVWCKDffVIKRIiAVFQAGAHP 

SEQ ID 3459 

ATCCAGATTTTATCOTT^ 
ATGAAGGCGTGGCGGCACK£TTGCG^ 



SEQ ID 3460 

MQILSFQADIAERMLKSKSESVNE^ 

ATCXK^CACMCTCT^ 
CCGTACGGACAMTTGTGCGTTra 

SEQ ID 3462 

MPItfSAGASVAWSAIFSRF^ 

S TOAC^^ 
CCGTACGGACAACGGCTATTGGATTGCGTGGCATGAAGGCX?^^ 

TGCGCGAim<mCGCGGCGAGTTI^ 
AAAGGCATGGCAGGCTTGCCGCAAGGGGTGC^^ 

KQlftGLPQGVQGFAVP ■ . j 

^ TTCAaLaW^ 
AOTMTOGGAATTCTTTATAGAQCA 

SEQ ID 3466 

UIKNYIGIIIRQBQL^ 

SEQ ID 3467 

ATGACMTCATTTTMGCATTTTMGCCT^^ 

^j«rCCACGGT6CTTGC6GCAGTTCATACG^ 
ATTTCTGATATTATCATO 
TATTTTTTAAACATCATACCCGAACTGAT^ 
AACAGGCGATGGCAACACTTTTT 

YFLNIIPKLIGIAIJ^AWYVIjIJGLP 

SEQ ID 3469 

GTCTTCAACAACGC(K3^^ 
TCGAGAAATTCCMGAAGCGTTCMTGCCT^^ 

GACGAAGTCGCCAAACGTTTTGCC6CA 
DEVAKEFAA 

TGHA 

SEQ ID 3472 

MVTPKSVNRI^ISYFSEFQKPIAGKYRQNAQKNPINTI^ 

AGATTCTGAAAAAMTOTCGGGCGGCGTGTT^ 

CCGCXATCC&TGCCX^TGGGGCA^ 

TTTGGTTTCCACTTCCAACGC^^ 

S ^ffl^MFHYGKHHQTYITN^ 
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c 

SEQ ID 3476 

MRGQCVRLIRQU 



OCICMMOGACGQGiCWTlCMCW^ 
SEQ ID 3478 ^.nvoT.nmrcrmiTOI^MFlKI^^ 



TOAATTTAATI wv ^v^.^"^Y"Viii»- uu uuu^ -™ 

ctaTAGCxxrpccmA 




TTCAGAI 



TGAA 



NALPDAVAIHYA1 



CTGATTCCCGTTGCTGMTCCATAGATATTAMTATCCGGGTTTT^ 
GCAGCTGITCCAAARTAGCAAMOCGGCTMGAAAMATCTACCTTC 

GGTCAATCOTATGCGGTCGWa^ 



CVRTOVSGCPEDEDAGKEERFRYB 
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CCCAAMTCAAAAAATCGGCAATACCCGCCCAC^ 

GAAAACAGGCAAGACTAMTGAAA^ 

AATCAAACTAGTATTCGGTTTAGTGTTCAC^ 

Rl^SGIGTPVPSRIHQHQ^ 
VVmPINNIJTISMVNRM^VFVFNQ 
AIRTHMraWEHQPPmXffiQGNEDTGVVD^ 
NQTSIEFSVHHlHPPCNAftlLKQQ 

W3AATGATTTAATCGAGGTTAGGA 

MTTAAA(X!TOXrCrTGG 

TACAACAAAAAA 

SEQ ID 3488 

HCEFKDFRHmPCFBEYD^ 
YRKK 

SEQ ID 3489 

TraCTCTGGAATATTTTA 

TCCCGGCGTGTACAGCGMGTOTGATC^ 

SEQ ID 3490 

LTVEmiJATIXSEDVHEMO!^ 

SEQ ID 3491 

ATGCGCAAACAGMaCTTK^^ 

OTACAATACCX^CAGAGGATCA(?ItXX!AACGM 

CAGTAAGGTTACGTTTAGCGAAAACTGTGAAAAAGGTCTGTCT 

GTTGAGGCGGTGAAGCGTTCTTGOCCTGCAA^ 

ATATTATCGMTATTTAGGCGTGAAGAACGGACAAAATGTTT 

GCAA 1 

SEQ ID 3492 

MRKQmTCIPTSI^RGSMflVLMVMIVW 
VEAVKRSCPAKSGKNSTDLCIDN^^ 

SEQ ID 3493 

ATGATGAGCMTAAAATGGMCAAAAAGGGTTTACATTGATTGAG^ 

AGGCTATAAGATGAATCOGAAAATCGOCAA^^ 
•FKJICGGTATGGATGAACAGCGTGGGCGACX^ 

AA 

SEQ ID 3494 

lsvwmsvgkykcrdatsaqa^ 

SEQ ID 3495 

GTGAAGCTTTCTCTAATCGTAAAAAATAGG 
TGCCTGACGAGCCGACCXXTATCAATACGGTAAACAA^ 
SEQ ID 3496 



SEQ ID 3497 

ATCAAAGATTOATTGTCGTCXXX^Ttt 

AAACCGTTATCGAAGCCGCGCOGGGCTOCGTC^ 

TTCTATGTTGlTEACCGTATrGATAG® 

SEQ ID 3498 

MKDtLSSPSFUiFSCMIA^ 

SEQ ID 3499 

ATGATGGGGACGACAATAAATOT 
GCGACAAACAGGTTGACAAATCAAGGTM 

SEQ ID 3500 

HtfGTTINLSFPFrPKISGQQAGQRAG(^PRAV^^ 

CGGCAGTAG03CCCATGTTCG(mC^ 

SEQ ID 3502 

MAFSEIQKRKAAGLPAFWFYMGACT 

GCATGGGCGAAOTTCTTTTATGCCGC^ 
SEQ ID 3504 
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TCATftMGGT&GGAAGCA^ 
GMGGMACTTCAGO&TGCAAAACAG^^ 
ATCGAACTCAMTCGATA^^ 
CGGTCGAAWCMKPTTTCIftJTITCM 

SEQ ID 3506 

VFTISILSVTQPY*SSNQKVKAGCQR^ 
DGYFSDAKQGDDT*RK!W9iASAEIRTYTQAI^^ 

MTAGtffCTGAATTTT^^ 

^A^IS 
GVQAWEAKGKIVTI IAGGKENYSTEPYYDB 

SEQ ID 3509 4 M ^„»m^^mvM^^ 

TTGCTTTCTCAT^ 

T(XjTTCAG(XXXOTTCCTGTA 

TATGTCGAAGCCXjACAGTATTGTTCTG^ 

TGGGGAAAATTGACAOTATGTGGCm^ 

TGATGCTTTPTTTCTTTAITATTC^ 

WatlOT^FLIFKGIAl^^ 

GTTCTGCAGOTTGGATTTGCGCGCC^^ 

(XX3TTTGCCGTCMGCCGTTCGAGOT 
GTACGGGTTCTCAC 
SEQ ID 3512 
VYNLQSLKPAV^^ 

PFAVRPFERIA^mVPWQAGFKRVEEFVGSSRGEGGFGSTGSH 

SEQ ID 3513 

ATGCCGCCTGAACOTOTra 
ACX3ACOK3CAC(^TAACCATTTG^ 

SEQ ID 3514 . 

MPPEPSFWfflSISLVmTAEAAUTAASDKPIiIOT 

ACC(X5ACACCGAAAGTCATTACGGATGCGCTGACX^ 

TTAIPGACGGCTTCACGGTCAA^ 

ATCGTCAGCCCGAATCCCTTTTATCAGATTTA^^ 

MGAGGTTTGGGAACGCACCAAACTGGTGTTCGTC^^ 

TATTGCCTCGGATGMTGCTATTCCGA 

TCCAAGCGGTCCAATGTTCCGGGCCTGCGTTO 

GTGCCAGTATTGCCGCCTGGAAT^ 

TGOTCGTTTTACATCTGGTTGAAAGTO 

CAGGGGAATOCflXaGGAAGGTTATGT^^ 

IVSPNOTQIYEC&TLLGGGE 

SKRSliVPGIjRSGFTAGDAELLKNPLLYRTYHGSAMS 

QGNPGEGYVRIALVADVATCVKAAETIVSLYR 

TAiTGTOCAACGGOGATCTTTAC^^ 

GACCTGATACAGCOTACCGTCreGAMTGGGAAC^^ 

CAGGaGmTAaOTTGTCAA^ 

AATOTCA(?ITTCTATGCCAAACGCGCGCGCGG 

GCGGCTTCC^TCAAAGCGAATGGGTGT^ 
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SEQID3518 

MFFVLSPAKNLNEKDrc^ 

DLIQPYRliBETSFAlGiRGK^^ 

AASNESEWVFMRSEQIK 

SEQID3519 

TTGGCAAAATGTATAMCTTCAmTOTTOT 

SEQ ID 3520 

IAKCINFIIIPTLQETCSVAERP^TVLKTVEG 

SEQ ID 3521 

GTGGGTTCGAATCXXXXXXSTTTCX^ 

SEQ ID 3522 

VGSNPAVSAAKQNRPDSGRFFPVQVVLITI . . 

SEQ ID 3523 

ATGCAAATATGCAttGATGTTTTTG 

CTATGCCTGATGACGAAATTTACAGACTGTAC^^ 
SEQ ID 3524 

MQIOTVFBKRIflnfVIHnG^ 
SEQ ID 3525 

ATGTGTAAACATTOTAmGC^ 

SEQ ID 3526 

MCKHLY* P*EITAIiLKIiQAGKIErS*ARKRRITVQTAVEIifl 

SEQ ID 3527 

CTGTGCXrTTCATAATTCT^ 
TOTTCCCGmTGTCmAMTGAACCOT 

SEQ ID 3528 

VWVHNSCPPKRTGSSKNEJOIGDG^ ' 

SEQ ID 3529 

TTGTTAATGAGTGCTTTGTCTTCTTAl^^ 

TGOTAATGCATCGAATTTTGGTATTTCCT . *; ■ 

SEQ ID 3530 ! 

HliSAISSYGADSTIflE^^ 

S EQ ID 353H j 

TITrTTCAGTTTTTCTAATTCAAGTO 

SEQ ID 3532 

FFQFF*FKFMFQLFTOFGEV*nGKFQ*ISFFFSVK*m 
SEQ ID 3533 

ATGGATGCTTTAAMTTATTGACGAACCGCCGATCT^^ 
ACATGCGCCCOTCCGTmACCC^ 

C^CTGGC6GCAAC6GCTCAGG(^TTC^ 
TGGTOmCGCCGAC^ 

SEQ ID 3534 

MDALKLLTNRRSSKKIiKHPAPDAAEIiEQII^ 
QLAATAQGFDNVWITGMWVNSPIiLREAFGOT 

SEQ ID 3535 

TTGCAATCTTGCGGaTCTTOT 

SEQ ID 3536 

LQSCGmiliHPFMVMYMVNFGQTC 

SEQ ID 3537 

CGCTGGCTGAGGCXX^3AATGGCa 

GTCGGCTGCCTG^^ 
TCGCGCAAATCGCTTCGGGTGTGTTTGAACWj 
TGC6GGCA6GG1H3CATATCCATCCTGTAAC^^ 
CGTATCCCCGCCGMCGGAAAAAGGAGCGGCGGGGGCATGA^^ 
GTGTGMTTTTCAAGGGCATATCMTM 
GAAG(HCTATAATGATGTG<XX3GAA 
MTGATGTTTTGaGGCTTGCCGACAACGGAAAA 
SEQ ID 3538 

LSDGGGLALWVYPTCRR^^ 
VGCLRVMmGIVOTIiRKTKNSIiKHVFAFAV^ 

RIPAERKKERRGHDVPLSRAMRWVLDQARGI^VIW ' 
NDVLRIiAENGK 

SEQ ID 3539 

ATGMCATCACX^AACCGGGCGCAAGGATAGACCGCCCGACCAT^^ 
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GGAAACAGCTTTCTTATCTiO^ 
(X^AAGCTCCGAGCGGT^^ 

CCCGCGCCT(aTCCAACAC^^ 

CWMJQO^CCPOC^ 

TCAATCAGGCGCGGCAATTCGGAC^^ 

OUACGCAAACACX^TCTTCAGACTGTTT^^ 

ATTGCCGATAGCCGGAAAAACCCACCGTTCAAAATO 

TCCGCCC(XACCTim 

QITQCCTTCXSaOCm 

SEQ ID 3540 

MNITKPGARIDRPTIRSLIAYATCRIKPISNSTUiRMEK^ 
PKIMWSRLRPRPQRMGKQL^ 

IADSRK^PFKIPHNLSGILFRPPLRPTHKPI 
NUCRQRHDDFCHYGKPPFVQNLPYTIjP 

TTATTC^TAACGCTGATGGCGACAGCTTTGCC^ 
ATATAAAAAAATCGGGAAGAAGAAA 

SEQ ID 3542 

MESSDVQGRMETLAFATAPKK(»I^ 

SEQ ID 3543 

(XSTCAAAAAMXXJGTCSGC^^ 
ATTC 

SEQ ID 3544 

MPIMDAIAAC^AVKKRKPGQIGI^^ 

TTOTACGGCTTGGCATGCGGCAATTGCGT^^ 
GAAGCCCGTACCGTTCACGGCA 

SEQ ID 3546 

LYVGRMCFADRFFWjRLLNT^ 

S ^E^CGG^ 
GCTGCTCGCCCAi^CCAAATCTGCACT^^ 
QiGCGCAAACTCAGCCGTCAAACCGGCGCAT^ 
GCCGATGTGCTGCTGAAGGCAMCCCAAATGTTTC^ 
C^CAACCCAAACOSCAAACGGCGAA^ 
AAAACCTGAAGCAAAAGTTGCCGC^^ 
CTACCCXKn i GCAAATACTGCCGa^ 

SEQ ID 3548 

UmmVSSAQAVRDPVLVFRIGAGAQ 

VPAANTAASETAAESAPPRSRRFQIRHADRRHR 
CAAAAGTTCOCMTTCCATTAATATGGTTT^^ 

^ ATOGATMM&GAAA^^ 
TAAATGCCGAGGATATCGCXTTATC^ 
CAMTOTAO^ACGCXSGGAAMCAATGATC 
MAACCT(X;CG«X^GGAAAGCCCT^ 
CCATTATGGACGGATACAAAAAGAAAGAATATCCGGATATGG(XX^ 
GGMTCXK^TAAGCiAGTGMGTTACACTATGCCGTGAAC^ 

SEQ ID 3552 

MDKEKVbDKIKKCI^^ 
KTCRAGKPSHRTYLM}RFCGGWIASAWETVKKF 

SEQ ID 3553 

TTG(XXXCAAA(?rCCGCCGATAC7GC(^ 

CGAAGCCT(mAGOT , TG^ 
CGAAAGCAAAGCGAAAGACGAAT(^GCAAGGGTGCGAAAT 
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SEQ ID 3554 

LPPKSAWAANTRAYLTAFQEAGS 

SEQID3555 

ATGTTGTCTGGCGCAAAGGTATO 

CAGGAAGCAGGCAGC7K5AACGGTATATTMGCO^^ 

GCGGAAACAAAAAAAGCCCC^^ 

SEQ ID 3556 

ffi^AKVSCTEALTCGIMMRLVAAKVRKyCGEHP^ 

SEQID3557 ; ■ 

TCTCAAflXX£AAICATAAAGCTCT^ 

SEQID3558 

LTARSCRVDSFAARLAASICFAVGFVATTO^ 

SEQ ID 3559 

ATGAGCTATTTGGAAGATGTAAAAAACGCAra^ 
MGCGGATACCTCTTTGGAATTTCT 
SEQID3560 

SEQ ID 3561 

jVTGACCGCOTXmCTCTACC^^ 

AAGCGCAATCCGTTATCGCTTATAWAAAAAC^ 

AGACTACTTGGCGCGCAATATGftKHa^^ 

CCXX3CC6AATTTA!TGCGTGAGGCXXn^ 

TOGA 

SEQID3562 

MAIALYRCAADVQAAUTC^ 
PAEFMRRAVKTEPEKTA^^ 

SEQID3563 

TT(mCTGCTTCG(&C<X^ 

TCACCATCGGCAGCGACGACCTATTCATCAAC^ j * ' 

SEQ ID 3564 

I£LIiRTATEKGNKKMANID i 

SEQ ID 3565 

(EECTCTCCAGTTTGGAIATTO 
AAATGAAGGAAATGCCGTCTGA&&AI 

SEQ ID 3566 

VLSSLMCMF^VCRPDQGAAVERAGTO 

SEQ ID 3567 

CCCGATmmCCGCACCGACAGACCTAGATTCCCGC^^ 

CO^CTCAMOmGCO^^ 

SEQ ID 3568 

PDKFPHRQT*IPA*AGMTVRVFLTIH^ 

SEQ ID 3569 

TTGMOTAAGTGT^ 

AGCGGCAraCMGATTTTAGTGCAGTGGGAAATCGA 
MGCCAACTGGCAACAGAOCTCAAAAGCTGGCGCGG^^ 
GAAAGCAGC6ACGGCAAMCXACCTATGCCAA(^TT^ 
(m^SAOTACGGCCTOTGA^ 
GGAAGG&XX£CGGAACACCra 
SEQ ID 3570 



ESSKKTTYAMSAISNKMKSYTPKHPDffiVFA 

SEQ ID 3571 

TTGCTTTaTCTTTttC^ 
Tj^^TGGCOTCTGCCTGTTTT^^ 

SEQ ID 3572 

LLSSFTLKINDIFGI^ 

SEQ ID 3573 ., 

SEQ ID 3574 
VAPOjF?(2ffPCPGGFRGPAGYLISSGlCPyGSVSPPPL 

SEQ ID 3575 

AT&U3CTTCCATCCCGA^ 

SEQ ID 3576 

MSFHPETAYMX5GBTEPYGPSPEEIKYPAKPRKPPKHGG 
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OTTCCGCCTCCCTTGGCGAATCCGAAG^ 

SEQ ID 3578 

LWTAI^I^YKRFRDGSSPFTUmFm 

GCCGGAGGGTAAAGAA 
SEQ ID 3580 

' i,RLSSKymYWRRAMPSDSPREmVRQREHPM 
SEQ ID 3581 

ATTGCGGCAATGCCTATGGCGTTGGCTCAAAA^ 
ACGGCGATGCGGAAGTG(XX3GAGGG 

SEQ ID 3582 

MKTRN1AFKFAVIAAVIAAGYAFGF 

SEQ ID 3583 

MTCCGCGCTMTCGGAATOmAAA^ 
TAAAGATACCGAAGCCGCGATGAACTACGCCATAGAC^ 

SEQ ID 3584 

VKYYGTMYGSPDWGMERYYARH^ 

S aS^ 

TGCGGCAAAATAArcMX3CA!KXXX3^ 

SEQ ID 3586 

fflVKVFI/TIHRAVKrVFSTPSGK^^ 
opQ in 3587 

CAAGTGTGTCGTTTACATCGTTTTTCGGAGCGG 

GAOBATTCTTC^^ 

CCGTC^CAGimCT^ 

?TTCGTCGGTCGGCX3TGTCGATG(XAGAA^ 

AAGCCGGAGTTTCGGACACTGQ^GTTTG^ 

TATOKSTTTCTCrAC^ 

TTGATCCTGTTCAC&TTCGG^ 

GACGGATGGCGGCGGCAATCTGTTTGJ^^ 

SEQ ID 3588 

MPFCAAARLSKSKSMRLBGVSVSTSNVCFAD^ 



YEFLT 
SEQ ID 3589 



GCG6CXMTAGCGGCAGCGGCTACGGCAACGGCAACGGCT 
CAACGGCTACAGCAACGGCAGCGGCAACGGC 

SEQ ID 3590 

msyldqp™ctgngngs™ 

SEQ ID 3591 

ATGGAAGCAMTAAATTTGAAGTGAAAAGTTTGTCAGACC^ 

ACGAACCGCMCATGA^(£AAC^ 



SEQ ID 3592 

MEANKFEVKSLSDUK^ 
SQICGELDSVWLEAIEIIPVTGGAAESIRTALBVAQS 

SEQ ID 3593 

ATGAA(X3AATOAATCAGCA(M 

SEQ ID 3594 

MNELISRINRFGARAKDEQSLU^^ 

SEQ ID 3595 

ATCATTCAOTAAATCTCCACT 
GCCGCCTGTQCCTGTCTGTTTCAOTTT^^ 
ATCATATGACGGCAGTGATACAGAAGAAAACGTG^ 
CTCTCCGAGAGCCGTCCTTT^^ 
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SEQ ID 3596 

MIHIflVHYC3)EIIRHAEroKySLIGI 
LSESRPLSPSWIiPYHS 

SEQ ID 3597 

ATGAAGAACTOTACTCCAACTMCGGCACATGGTT^^ 

GOTGCGGATAMCGGGCAGATTAAATGCGTCAAACC^ 

GGCAGCAAGMTTGCCGATAAAATAGCOTCAAAG^ 

TATCTGTCCCGCAKX&A^ 

ATATGGAGCAGGCA 

SEQ ID 3598 

MKNCTPTIKnWKMGATITOVAAm 
YLSRIENSKQSLQOKTVQ^^ 

SEQ ID 3599 

ATCGAGCTGACCGTCtt^ 
TTGACGGCTPATACAAAAACAGATACTXAGGGAATATG^ 
ATTTGACAGCGMGMGCC(^GGATACAGAAT^ 
GAATTTGACTATCAGGC^GAACACCC^ 

SEQ ID 3600 

MEMVHFKAEQDLDRtf 
EFDYQAEHPITKRIIKDYEELYSN 

SEQ ID 3601 

ATCXXnTTTAmT<XaraCCAAT(OT 

AGGAGGTTGCCGATCATTITGGGGTGAAACAGCCAAGCG 

AAGCCATTTCGGCATTGAAACATTCAGAGTCTT 

GAGOTTGTGGATTATGTAACCGTCGCTGCCG^ 

GCGACCTMT6TTCGTTGATACCGCCGTCGAAGCOT 

GATCATCAGCGACAACAGCTCATACCGAACCGAMC^^ 

SEQ ID 3602 

MPFIIDYQCLCAINSEEKMFSGEQLGQAISi^ 
EVVDYVTVAAAWAREHjGGNLNKIQVITARGDSMEP^ 

SEQ ID 3603 

TTGCGCATACATGCCGTAACG6CAACCTTATACGGCTTACCOT 
TGCTTCATCACCTTAGGGAACCGTTCCCTTTG^ 
SEQ ID 3604 

SEQ ID 3605 

TTGAAAAACCTGM(mAAGTTGCCGCGCCC^ 

ACCTCTACCCGCTGCAAATACTOXX^ 

TCTTGCGGAAACAGCCAG?^^ 

TTGCTGCWGCCGtXXXU 

ACGACGCGGCAGAW5AC(XXIATAAAAAT^^ 

AGAGTTATCGGCAAAACAAACGTTCGATGTCGAAACCGA^ 

GATGAAGAAACCCAAAAACGCGCGGATOCXGAT 

SEQ ID 3606 

LKNIiKQKIjPRPKQKRKN^ 
LLLAGOGTALIALLLLLLRIA 



SEQ ID 3607 

ATGGAAACATTAGACGAACGAATAAAGAATTTAQGAAAATC^^ 
AAACGCTATGTGAmTATTGAAt^Tnm 

SEQ ID 3608 

METLDERIKNLGKSLEDRI^^ 

SEQ ID 3609 

ATGAGTATCCAAAMGCAGTTGATTATTTTGGTAAT^^ 
GGTGCGTGCAGAFTCAAAAATTAACCGGA 

SEQ ID 3610 

MSIQKAVDYFGNESRLARAIGVKQOT 

SEQ ID 3611 

ATGAAGAAGCAGGACAGAAACCGCCTGTCGAAGAAAGAC^^ 

AC«5OTK^TT^^ 
SEQ ID 3612 

MKKQDRNRLSKKDRRLIKKAMLKAAMOT 

SEQ ID 3613 

ATGOCGTCCGAOTATCCTTCATACTCGCCAA 
(ECAAAAACCCGAAGAATCX&AGCCAAAT 

SEQ ID 3614 

MPSDLSFILAKELIRSGSIRLSGST^^ 



WO 02/079243 



-279- 



PCT/TO02/02069 



SEQ ID 3615 

TTGTTTGCATTGCGTTTtnTTTTG^ 
TCXMGGCGGTCAACCGTCTTTCCG^ 

SEQ ID 3616 

imHiFMHNRIiSPIiPEPL^ 

SEQ ID 3617 

GAATTGAACCA 
SEQ ID 3618 

VTQLAARITDLEGRGTOAKPmVGNCKNPV^ 

SEQ ID 3619 

ATCA<mGGAACAAAGAATACG(m^ 
ATGATGAAGCCGTC 

SEQ ID 3620 

MSREQHRGARLPWAHHQAE^ 

SEQ ID 3621 

TTGATGATGMGCCGTCTGAAAGTTTGAGAGCGGC^ 

GGAACGATAAGACGaGTATGMTCAGGCATTTACCG^ 

ATTGATTGAGACTGAAAAACGMTTGAACACCGCAm 

ATCAACAKX(XX3MOTCAAAATCMCATO 

mraAGAACAGAAG^^ 



CCGCTTCAGGAMTCGCCGATTTGTACAACG^ 
CGGtXKsCGCCAAACGGCAAGGT^GGTTCGGGGA^ 
TGCGGTCGGCTTCGATTGGATTTTCAAGGOGGGC^ 
SEQ ID 3622 

LMMKPSK&RAAGRPIAY^ 

PRRRK3HGV 



PI£EIADLYNEVLGGRI£SVQV^^ 

SEQ ID 3623 

ATGCMTGGAAATTTGAAGATCCTCAAAATACTGCTOT 

TAGATGGCMTGAAATTCATGMGGAGATGCTGTGATAGT^ 

TACTATATCGGAAAAATGGAATTTTGAT 

SEQ ID 3624 

MQWKFEDPQmVITTRRII£KK^ 

SEQ ID 3625 

ATGAACCGMTCGAGGAAACGKM(XXOT 

CTGAAAAGTTTTACCAGGCGCAACACAGGATTA'ETTTC^^ 

GGAAAACGCGG(?IK5GCCTGGCTTACTTGAT^ 

GCTTCGGCX&CGATTGAAAAAATCGCGG^ 

AGACATTCGGCCAGAGCGTTG^ 

TOAAACCTGATTCTGATTGCGQCGCGTC^ 

GCGGTAGAGCTGGCAAGGCGCGGCATGGCGGCTGhGTGCAATATC^ 

MGAGTGGMGTITGACGTGAACTGCGACCTGCTCAAC^ 

G(£AAGGG(AG(X3AGGGACGAGCTG(X!(^ 

AAGCAGGCAGACAMCGCCCGMCATGGCAGAC^^ 

GCATTGOXiMCT^^ 

ACATGATGAATATGACXXTTACAGTGTC 

SEQ ID 3626 

MNRIEETEAVQSLASVGAB^LGGILIEPTAIARCA^ 

ASAAIEmVSKIX3GT7AEKLSKAADELAAAGm 

AVELmGKAAECNIPMQNIi^^ 

KQADKRPNHADIRGS(M^ 

SEQ ID 3627 

ATGCGTGAAACCTGOTCTATO 

GGOTAAAACGCTFTGGGAATATCCCAGOT 

SEQ ID 3628 

MRETCFYCNHADFKTMOTPVRGFAKCAKAR^ 

SEQ ID 3629 

TTGCGTTTTMCGCGTTTTTTAAATCTC^ 

OT^CTCCCGCCCCC^ 

ACCGGCGTGCCGGTGTTGGTTT 

SEQ ID 3630 

LRFNAFFKSRVGWDI PKAFYPPSR^^ 

SEQ ID 3631 

AOTCA&CTTCAAOCTCGGCA^ 
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GTTCAAGMACTCAAACACCGCC^ 

GAGGCATT 
SEQ ID 3632 

KJEDNFAWAAACVEDIiLEKMPPEA^ 

SEQ ID 3633 

SEQ ID 3634 

IiFYFLSVGFMGRGRI PAASVI PAQAG IRTLEQQQYf&I I 

SEQ ID 3635 

ATGGATACCOTGTTAAGCATCATCACC^^ 
AAGACGACTTCGACGGCTTCGGATAT 

SEQ ID 3636 

METLLSI ITALSFAGMTIAVWI^VEAADAVLRRKREGKG™ 

SEQID3637 

TTGC^GGTGCGCATTTCTTCCGCC^ 

SEQ ID 3638 

LQVRISSAILIFPFVFMCLISEAVEVVFAFAF 

SEQ ID 3639 

ATGGCGGMGMATGCGCACCTGCAAGGCCTGCGGCGG 
GCAACAAGGCA(^GGO^«X3CXX^^ 
(MCCGttCTGT^^ 
SEQ ID 3640 

MAEEMRTmCGGTKPLERGPNAVPRKEGGVYmSCKTCRmVRQm 1 

SEQ ID 3641 

TTGGTGGAAATCCC^AATAAGCGGTTCAAMCGTCrc 

CCGra^TCATTCCC^ 
SEQ ID 3642 

LVEIPNKRFKTCRCRGSHIQt^ 

SEQ ID 3643 

GTGGAGGCAACCGGCAGGGGAATGCGCGTGATA(X3(^^ 
ATMGGAMCCGTCCGCCGTATCGCGCMGGGGCGGGCGCGATO 
GArcGATTTGGACAACGCCCTGAAGGTTGCGCTGG^ 
GGTTTGGCGGTGGAGGTGGGGGAATTGGATGAAAAG 

SEQ ID 3644 

VDATGRG^VIRLILPYPVSAMYWRIWRNRAVRSAEAM 
GLAVBVGELDEK 

SEQ ID 3645 

TTGCGGCAAACAATACM(XTATATCCGCCACTTT^^ 
CCGCCATATTCGGCGGCAATGCXjCCGCACCTGCCOT 

ATCAGGCXOGGCCX&CCGCCACAAAGCOT^ 

ATATCGGTTTGCCJGATACGGGGTAAGGCAGGAT 
GG(^GGCGGCGTGTGCGGCGCGGATGTA!KX!GTG^ 
GCACGATTTQ 

SEQ ID 3646 

LRQTIQPISATFCRCDFSASVIiljFIQFPHLHRQTAACDRIiAAIF© 
PDRPVAPNPPI SVCRYGVRQDKAYHAHSPAGCVHR^ 

SEQ ID 3647 

ATGAAMGTAAMCCGAAGCGGAAAMTCAC^TCTGCAAAAAGTG^ 
(H y GCAGGCGCGGGCTGCGGTAGMTTTGGMTC^ 

SEQ ID 3648 

MKSKTEAE&SHLQKVADIGCIVCra^ 

SEQ ID 3649 

ATGCAACAAMCCAMGTCCGCATTACACCTTO 

SEQ ID 3650 

M^l^SPHYTFREGRLNPPPRTIBQAVLYIKTPR 

SEQ ID 3651 

SEQ ID 3652 

LADCAEGEIQTAFAEGVMRTLVLLHEKGLIKL 

SEQ ID 3653 

ATGAATCAACAJ^TTTGAATTTATT^ 
CTTGCCCGCATAAGCCTCAAATCGGCT^ 
CGGTfXmGTCGCAGGCAGOT^ 
CAGGCGGCAGGGAATGATTTTGATTTGCTGTTTGAGAAGOT 
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SEQ ID 3654 

MNQQEFEFfflTOLARAFERRTO 
QAAGNDFDLLFEKV 

SEQ ID 3655 

AT<X3GAAGCACATCA^ 
TTTTTTGTGTTATTGATAGCAGGCTC^^ 

SEQ ID 3656 YFVFCVIDSRLSYE 

SEQ ID 3657 

ATC&CTAMGCTATTAACAAGTT^ 

CXXSCTCTTTTAAAAAGTGGCGAAA^^ 

TTATTTAGGGGGCGGGAAATCCX3TAGCAMTGAAAMCA6AAGM 

CTGGATAAAATAAACAGCGTGGAATGTTTTCAAAGC^^ 

TGGTACTTTCCGATAAGAAG 

SEQ ID 3658 

MSKAINKWIEREKEmiiLFAQREKFYW 
LDXINSVECFQSSIRISQSGKNKPIFF^^ 

SEQ ID 3659 

ATGMCAATTCAGTTCAAATCTCTAATATTTCTATO 

SEQ ID 3660 

MNNSVQISNISIHQTEIGIlFSLKDL^ 

SEQ ID 3661 

GTGTACX^CCTGAGACCTTCAACCCGTACAA 

ACAACAACCTGTTCTCAAACAACCATATCGGCACIAGAA 

AACAGTTGCC6MGT(X!AAACA(X}GGMGAGTTGCAC^ 

TOMCATTGCTTCAACC^^ 

GCX^GGAGGAAACGGTAGATTGGAGCAOTACCT^^ 

CCTTGCCGAAATOATCTO^TCGOT 

GGTATT i 

SEQ ID 3662 

VYEPETFNPYNFVEIVIOTPEPBSVAQTAE^ 

SALLQPAEAPAVEEHAAEITLETPDSNTSEADALPDFLKDSEE^^ 

GI 

SEQ ID 3663 

OSCAACGGTTGTCTGCCTTGGGCGGGCGCGGGTGGC^^ 

a^CCGATTOXGCCGCCGTCATTCOT 

TGACG<MTCTAGGTTCG^^ 

SEQ ID 3664 

RNTCIiFWAGAGGGRAVreY^ 

SEQ ID 3665 

TTGTTATGTAOTGCTTAAAMTTCATATCCTTCGG 

GCATTTGTATTCCCTACTGCATCAAACCGCC^ 

GCGGCTT 

SEQ ID 3666 

LIXTOLKIHILRACENPRKG^ 

SEQ ID 3667 

ATC&GCGGCACGAAACGCAMTTAG^ 
TGCCGCCMTGAA^CTATTTACCGTTGGTTGG 
TAGTGCACAAGCAGAGAGCGCGGCGGTTTCAAAGGCXj 
TCGGGCGATGGGAGCATGAGGGCGGCTGTACGGOT 

SEQ ID 3668 

MSGTKHKLGRPTDYTKDMAM 
SGTCSMRAAVRLDAEEYRKIAEDVLRSV 

SEQ ID 3669 

ATGGCACTA(X3GCMTT(XaA^ 

CC^TCACCATGOmATCT^ 

CGCGTCGGCGAT6GGG(X3CXn , GCCTGATTGCGAGOT 

ATTOTXTGATTTG(^ 

$£AGG(&TT(XKKXXK^ 

GAA6ACX3GCACGGCGTTGTGGCCTG 

GTACGTOCAAGGCT(&CAACCT^^ 

GCTTG<£GraCGGAAGACGGG^ 

ATATCCATCCCGCMGACC(XXXKXIACX3^ 

GACCGTTCGCGGCACAGGTCMCATCGCXAATGT^ 

AGACTtnTPGGGCCG^^ 
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SEQ ID 3670 

MAtt^FDDVETSVIRSLSSA^ 

IFPDLALAGESGHHWRTTAGGVMYATGAGGT1TGFGA 

EDCTALWEKHDIETLR^ 

ISIPQDPGQAGKSQTLYI/TRQIAGPSVSAGPESGDKVTRAGPFAAQTO 

SEQ ID 3671 

ATGAGTAAAAAGACACCTTTATCXjCMGGCTO 
CGCAGCAGGCAGAGGG(K^^ 
TGTATTGCGTTTGCTrGATCGAGGCGCGTMA 
GATGAAGCCGTTKXsTTCTTCCGG 
TGGGCGGCGGCTTGTACGCCCTCGAAGTGATGGACGGGGC^^ 
GGCGGCGGTCGATTACACGGCTGACGAGTTGA^ 
CGGCA(XnTOCGCGCTGGAATACTACACGG^ 
TCTCGGGCGAAACGGCGCAGC^^ 
CXmrTGCTTTGCOTT 
AAAGCCCTGATTGACGACGTGCTTGCCOT^ 
ACGCAGGCATOTGACCGCCGACGAMTC^ 
AGGGGCTGAAAAGTTGGGAMGTCGGAMGC(XX^TGAGCGM 
GAGGGTGCGGCCOTCGATTGGCAGGCCG^ 
CGATAATQGGC(nSTT^^ 
CGCCGCCGAAATGGTCGGCATGAAGCGGGCGGGCGG^ 
CGAAACGGCGACAGTGTGCAGGAATTGGCAGGCCGCC^^ 
TGATAGGCTGGGAAGGMCGGGGCTGGTTGCCGGCMGCAGTGG^ 
TTTCnm^OSGTGCGTTGACG^ 
SEQ ID 3672 

MSKKTPLSQGFIARVAAGVRYAFTOJA^ , 

BEAVAFFRSPDKEHTWM^^ 

RQVHAIiBmAGSVPDAL^ 

KALIBDVLARYMDMAAYEFV^ 

EGAAVDWQAGDIiAAELSRAAGVVAN^ 

RNGDSVQELAG^ ! ! 
SEQ ID 3673 | 

TOOTITGCTCCAAAAAAA^ 

TOGGA 
SEQ ID 3674 

LVCSKKKPPREGAAKRTRLTRro^SGISPAKTGTTAHRQFG 

SEQ ID 3675 

ATt&CGAAGTTATACGCGCAMTW 

CAATGAAGGCGGCGATTCCCGATT^^^ 

GGCGCATATCGTTGACCCTGTTGCX3GT^ 

AAGCTGACGGAAATCAGCOTGTCGACCG^^ 

ACAAACmCCGAAGAGGMGATGAAMTCCT 

ATAA(KXX3G<mCGG^^ 

GTAAAGAATGACGGCACGCTGGATGACGTGGOGAC^^ 

SEQ ID 3676 

ffi^YAQIAKTEAQDKTVKW^ 
KLTEISLVDRPANPDAVFTCPKADKPKD 
IiKWLVEDASYDNIDEAVTAQIKESAAGIiAESLKALAASE 
VKNDGTLTOVATLIKAKQTGRL 

SEQ ID 3677 

TTGAACCGGCACACXXTO^CAGCAGGCOGTCGG 

SEQ ID 3678 

LNRHRMQQAVGNKKPSETPKGFRRHHRRRYQV^ 

SEQ ID 3679 

ATGMTAGAAACGGAGGGGTAGTGAAGTCTGACCAATCTGGAGM 
CAAAAAGTAAAGGAGGTTCAAATAGTTATAAAAATGCACAGGTm 

SEQ ID 3680 

MNRNGGVVKSDQSGBVLVR^^ 
SEQ ID 3661 

TTGTCTGAGTTGG^ 

SEQ ID 3682 

LCELVHVHLSLLIIKKPP7S6VTDGLFVLP 

SEQ ID 3683 

ATGMCGTGAACCMCTCAaC^ 

lACrcGGTAlTGACCXXXOT 
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CGTCtmACmCGOXXXSGTAIK^GC^ 

GCGGAmOXXGCGCG^^ 

GCGTTACACCTACACCGA(XXXXX3TO 



CGGTTACCACTGCCGCCW^GTGGAAATTT^ 

AATCGCCCMCCTCaTTCG^ 

CGCCTGTCCCCCX3ATA(XX3OTACG^^ 

ACATCCGCG(XK£TGTtfnTC 

OCOTACCnTCAGGOXZ^^ 

CAAGmTGTTCATGCCTGCOTCGGT 

SEQ ID 3684 

HNVNQLTQOTIELMKSAQAGGGPPDK^ 

ADYAARGFEDVRAIAVAQTIiQATMVAEEM^^ 

AAASVSGIEAGKmAVIPAWGAVAYAWVWGAAGSEK^ 

RLSPDTVYVHARDLAALTC^ 

QGMFMPAFQGTKVG 

SEQ ID 3685 

AroCOSlCTGMAAraC^^ 

C«^GCCG(K^(K^CMTCCGAACCTG^ 

SEQ ID 3686 

HPSENQTAPSFGDFBffTVKIiKAPEGFTDVSPGSQ 

SEQ ID 3687 ^ ^ 

ATGGCCGCCCCGGTAITCGCTTGAGGAGTTCA^ 

GGCGCAGCCTOTGGCCGCC^^ 
CGGCGGCTGGGAGTTOACAACTGGCTGCTC^ 
GCGGTGATGATTATCGCGGTGCAGCGCCTGAACG 
CGTCGGCGTTTGCGATACTCAATGA(n!^^ 

SEQ ID 3688 

MAAWSLEEFKQRIGVEHDRRIH)PFLSVIDGVSAAAEAnGR 

AVMIIAVQRVNEIEGKGVRSKTLAGETVAF^^ 

SEQ ID 3689 

GTGTTGGGGGTMGGACGGGGGACCTC5CGACGCTCOT 

GTITCGCGGGCAGTCTCAACGTTAAGGCTTCG 

GGAGCGGTO5TTCCIKXXaTOGGCAT^ 

SEQ ID 3690 



SEQ ID 3691 

TTGCAGGTCGTCXECGAACTTC 
(KXTTTOXmCGCCTGCC^ 
TGCCGGmCATTCACGCO^^ 
CCGAIT 

SEQ ID 3692 

LQVVRELRRQIAQCRSBEI^ 
PI 

SEQ ID 3693 

TTCCCGCATMGCGGGAATCCAGACCTGT^ 

?ETCGGAAACGATAAACGCCTCAAATGTTra 

SEQ ID 3694 

FPHKRESRPVGAETYRIKRFL*IIjRSGFPLS^ 

SEQ ID 3695 

ATGAATCGTGAGGCGGTTTATTCCGCGCTGTGGGCGA^ 
TATTTATGGCGQG(X£GATATGCAGGCGTO 
GAATCCGCraTK&CGCG^ 
ACCX&CGAGGGGACGCTTGGCGAACAGGCGGOT 

SEQ ID 3696 

IJHRBAVYSAUlAKIiD&LDGFTTO 
TDEGTLGEQAVCIIPIVICAA 

SEQ ID 3697 

^TGCAGTTGACGWTGGC^GCGGOGAAGTGTO 

TGTCGGCGGMTTGAAAGAGTTCTACXKnK^AAAAOT 

OTGTTTTTCGGTAC^ 

AAGTTCGCTGAGGACGCGGGCGTGATGGGCXiAGGACGGCA 
(X^TAAAGGCAAAACCGTTTATCCGAGOTTACC^ 
QftCQCIAGTTCAAAGGCAAAAAAGCCCTGT^^ 
ACCGACGAGGCGGGCTWAAAATCGGTACGCTGTG^ 
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SEQ ID 3698 

MQLTFGSGEVFAQMITDAYOT^ 

KPAmGVMGQDCTAnKVASSPKQGQYTVSESGraFN 

TDEAGFRIGTLHIQB 

SEQ ID 3699 

GTGGATTCAGGAGTAATGTTTATA!^^ 
SEQ ID 3700 
VDSG^ICSI^RRLFLFNQSQCmNI^ 

SEQ ID 3701 ^ 

CGGCGGATGCCC^CAAATCPCTACCG 
AGTGTTTGCCGCXIGTGATGGACGTT^^ 
SEQ ID 3702 
MVRIKGVTVELNGADYVIPPIAI/^ 

SEQ ID 3703 

GTGATTGCCCACGTTTGCGOT^ 

TGGCAGCGTCGTATATGOK^TCMGCCGTC^ 

GCTCAAAGCGAMGGAATCATC 

SEQ ID 3704 

viahvcastgwtodyvai^ 

SEQ ID 3705 

TTCXXX3CGTGTTGAAAACCAAM 
MTOCGCCCATC(^CG(^CAA(X?ra 
TGCGGGCGTTGMGAATCCOKXAGTTTTT^^ 
ACGGCCGCTOCC^TTTCTra 
GCAGGGCGTCAAATTCGTCTTCGGACAGriCG^^ 
CAAGATATGCACX3GGCGGATGTTCGCGCCAATAOT 

SEQ ID 3706 

lrrvenqnalvgngviamqrraadldggpfi^ 

tmcrflrprtpyrrflppplqa(^enr™ || 
SEQ ID 3707 
ATGGGCAATGCGGTTTTCCCCGAGTTCCCCGGC^^ 
CCTACCCGCAATGGCGGTTTTCGCTGTCGTTCGA^ 

AAGGAACGGCCCGCCGTCAAGGTCX5GCGGCACGGCGTTGGOT 
GGACXSGGtXKX^TTTATTTCXIGCGT^ 

SEQ ID 3708 

MGKAVFPEFPGIiKWGRKKTAVVreTGTQKS^^ 
KERPAVKVGGPALAYGRDYTVTDKGVL 

SEQ ID 3709 

TT(^TATGGGCAATGGCTTraAGra 
GGTCGCCTGAAAAT 
SEQ ID 3710 

lAT^NGFVAVCRFDQGMVEGTGVGFLK^ 

SEQ ID 3711 

ATGAAACTGCTCCGCCGAAAAGACTATCCCG^ 
ACGACTTGGGCGACGCCTACX^CACATCXKXX^ 
CACAACCGTGATAGCCTTCCTAGAAGACCCC^ 
SEQ ID 3712 

MKLLRRKDYPEGRFTLAFVGY^ 

SEQ ID 3713 

GTGGCATTGGGTACTTCTGCTMTGCA 
CGGTGtXTATTGGTACTGAAAGTAAGGCACAAAATAAA 

SEQ ID 3714 

VALGTSAKASNFGISIX^SAASCTKGIAVGTSSQAmSAVM 

SEQ ID 3715 

AGAACTCTTTTCGGATTTGTTCOT 
SEQ ID 3716 
RTLFGFVIi* pigkn*fylhlictffiniiq 

SEQ ID 3717 

ATGMGGOGGCGACAAAAGMCTSATTGGOCTCCTGC^^ 

TGCCCGTCGTTTCGGACGGTCAGGOT 

CGATTACAGGCTTGAGGGCCTGCMTGGGCGGAGGC^ 

AATATTTTTTCAGGGCGCGTGTCX3GACOT 

GCATGAGGACGCTCTATGACGACGGCTGCAAGGTC^^ 

CXjGCTGGTTTTCGCAGGGCXjTGATTAAGTT^^ 

aGGCGGGC<aTG0GTTCAA(3(^ 

TGGTG 
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SEQ ID 3718 

NIFSGOTSDVSGOtfSVKV^ * 
QAGDAFKVYP(X3JKRRDTC^ 

SEQ ID 3719 . ■ ■ ^ 

AT<XXXnOX3AAACG<^^ 

GTGTCGGACJG&CATCGGCCAAGCCmC^^ 

SEQ ID 3720 

MPSETDLRARIVEEARSWI^ 
VSDDIGQAEIjIGRGVRPFTPSP 

SEQ ID 3721 

ATGGAAAfXXX5AA0CCTMTCCTCC^^ 

ACTCGGGCAGTGOOTTCCTrAT^^ 
SEQ ID 3722 

SEQ ID 3723 

TTGGCGTTTTOGGGAATT^ 

SEQ ID 3724 

LAFSGNCRRQSNBPPEKFSGFIFTICFDISKIiYRP 
SEQ ID 3725 

ATTOATTTOGTAOGGCGATFTOT 
GCTTGCTTTGTGCGM^ 
T^CGCACCTGO^^ 
AGGTCATCGGGAAATCGGOT 

GTATTTTCGCAAGGCCGCCTGAAMTCATCC^^ 
mCGGGCGCGCAfiGACCCT(?^ 
AGACCAGGCGAATAK&AGCACT^ 
GTGCGCAACGAATATGAGTTTMGCTGGGTTGGAAATAC^^ 
l PC6AAGAAGAC6GAC^GGGGTITTAT0CGTC&^ 
G(X(X3GCMCGCCCATGTGCCGGTAATOTO 
TOGGCGGACGGCGACAGCTACACOCGCGTCGGC^^ 
AAATCTC«3CGG^^ 
CTACACATTGG^^ 
ATTGGCCGC^GGTTTGGGTCAAACTCGTTO 
AAAACCTGCGOTGACATCATCGTGGGCATACXS^^ 
GCGTGCGGTTGACGGCATCGTTGACMCAGCTACAOT 
ACGGGCMTTGGGCXX!AAATCGCGGCGCAAAA!KXK5C^ 
T1*^G(SjATTATCGTCTG^ 
GAMGGAAAGACCTATOOTGAGG«X5GCG^ 
AGCAGTCTGAACAAS6CTTTGCGC5ACAAAATC^^ 
AGACACACGCCCGMGCATOUV^^ 
GOTATCGTGAATGCCGACCGOTCGG(^^ 
AAGGCGGTTGTCGATAAGTTGAAOGCCGGGGATAT^^ 
0C6TTAC0GCC6ATAAAATCGG6GCAAACGCGGTSACTGOT 
GGGTAATTTTACGGTGTCTTCAGACGGCATCC^^ 
GACGTGTTGAGGGCGCACAGGCTGCGTTGGACG^ 
GCAACMCAGGffKfcAGK^ 
AGADCTCCXXXXXX3GQGCAAGGGGGGG06GG6ATTATA 
TTCTCCX^CCOTCCXaCGAATCGGTC 
GTTTCttXSTACGAC^ 
TAOTACGGCATATCGTTCC&CTAC 
CAGTATTCTCTGTT^^ 
COmAAAACGAGGmGGCCCTCCGTC 
SEQ ID 3726 
MQGKSSTVTSABER^^ 



VFSQGRLmPYGMSH^^ 

VRNEYEFKLGWXYCLLEPMDIVTI/^ 

SADGDSmVGAVNRK^ 

IGRTVWVKLVSYHVFGGGIQDIiAEVPAYSWIEGAPLGQI 
TC^QIMQNPQLQAI^ISI^^ 

KAV^ 



WO 02/079243 



-286- 



PCT/IB02/02069 



DVLRAHRLRWTEGNVWVU>I^ 

LSLASHESWSPFVSVSYL^ 

QYSLFQEEFIRAVPKSNIjLFFVHfflV^ 

SEQ ID 3727 

TTGgCGGCGGCGGTATCAGGTATTOCATAGGCMCCGA 
TTCATCAGGCCCX5OTCGCGGACGACG 
TCETATCCCAGTTGTGCGTCAGT^^ 
AACGTTTTGGTAGGAATCGAGGGATT^^ 

SEQ ID 3728 

LAAAVSGIRIGNRIVAAFLUIEI^ 
NVLVGIEGFFEIAHAKL 

SEQ ID 3729 

CATCGOTOCAATTTCGCAM 

TTGGCGTGACGGACGTTTTGTCCGATGCGGA^^ 

CGGGTTTTTO^CGGACAGGTAGTGCAATACT^ 

GCGTTTGCXSGGCGQjGGTGATGGCGACGTACAT^ 

GGGTTGMTTGC&AGCCOT 

AGGCGGGATCGTCTGAAATGTTTTCGG(X!AGGG^ 

TOGGTTCOTAGTGCKXX?KA 

TTGGCGGCTTTCGCGC^ 

CACG^GAGGGCGTTGTWTCGTCGGGATT^ 

GCGGAACACXXrmmCGATGACGCC^ 

TCTTCGAGGTCGGTAAAGGCGGAGTAGTAGCGGATTTTC 

TGTOCCGACGGAGCGGTAGTTTTGTTCGAGT^^ 

GTCGCCGACGGCAAATACCGCTGCGTTACXXXXXIGCCAT^ 

C^GCGCAGGATTTCGTTACTTTGCAGCAT^ 

TTGTGTGCGGATCGGGCGCGC(XMGACGGAAGCX5CG^ 

CAGGCGTTTGATGAGGGAAAGCTGGTCGCCGCT^ 

CACATAGCGCGGACGTOATO^ 

GCCATQCGATACGCGTGGTCAGCACGCGCGTTTTGC^^ 

ATTGGGGGCGGATTGGTCGGGAAACAT 

SEQ ID 3730 

HRFQFRKLGIQLFHAIJAEVDGQSGGFA^ 

AFAGAGIXSDVHEAAFFFEAAALGKAHFAGETPM 

LGSVVL(^AVFEDAIH)DFGQVHLSDLVAQGF^ 

AEQAIiPDDARLGVAAVEDGDFVQIPAFAFBGFGFIHDEP^ 

VADGKYRCVTARHEFEPGVLQFV^ 

QAFDEGKLVMVEDIiKG^QAC^ 

IGGGLVGKH 

SEQ ID 3731 

ATGATTAATCATATTGAAAAATATATAGGCACTA^^ 

OTMWTACTTTAGGGra 
AGAAACXIATCATTGAGAATAAAAAAATOTTCTO^ 
GATGATTTCCAAGTCTGTTATGGCGAAC^ 
TGCTGGATAATGAAGTTGGCAACCTTTCGGA 

SEQ ID 3732 

MDJHIEKHCTISHGSKSDSGSQYKI^^ 
DDPQVCTGBHYHIVFPIiLmYKQEABLVEKKGV^ 

SEQ ID 3733 

ATGATTGAAGATAAAAAGATTGTTGCCTTGAATTTTG^ 

CGGCGACGTTGAACGG(^T6TGTCGGAAAAT^^ 
(XXKIAAGGOTTGGAAGGTGCXXjOGGGC^ 

SEQ ID 3734 

MIEDKKIVAIOTAVEDGMTGa^^ 

SEQ ID 3735 

GTGACGGGTATGTTGAGGTTTTTGACG^ 
ACGCGGCGCrTGTTCGCGCTTGCGCCW^ 
GGCATTGCTGTTTGCGCGGGGCGTGAGGGCTO 
ACXSGC^ATGGTCGTTCCGCCGCTGTTG^^ 

SEQ ID 3736 

VTX3MFLTHIRPIjPAGPQAHIiPGTATRVLEAVI6W 
TGMVVPPLIiAPFCALAGNNALRFLFSAQKSRGliAD 

SEQ ID 3737 , , 

ATGGGGTTTTTGCAGTTCGGACTGCTGT^^ 

CGC(XX£MGGTGTTGCGCGGTTACT^^ 

SEQ ID 3738 

MGFLQPGLLFAAAGGVLGAWASLQERGRPVQAVLEAnSAIAAAAAAER^ 



MmmMA , PCT/HB02/02069 
WO 02/079243 

-287- 



ATGTCGTOCGT^^ 
CGOmMCAGCAfflCCGM^ 



amAACCOTW^CGOX^^ 



ACAT 




S ^^!MVRLPARIJ^^ 

SEQ ID 3748 

UJNKVGRIVG3CRSPFETLLLFRRTFYQWEQGVLSQSGL 

TICA^TCCCAAinCTTTTrcT^^ 
FKSQVFFWHCPACNKWQTFTPNKIEV 

GTTACCG(JHG1CGGACGTTGCCGCCSC6(XGAAACAGGGCGCGGGCGAGOT 
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SEQ ID 3754 

cpQ ID 3755 

ACCAA 

SEQ ID 3756 

LAAASGEVSAFPPIPnSGGRGKFLPFCLREGGVIQNYQAHQ 

(mACCCCXSAACOSACGGTra 
GGCCCAC<mGCAC(^G^ 

CGAAGMCGCGATTGCX5GACAAAATAAGCAGCGGC^^ 

CCCTTCGGCGGGGGACGMGG(^CGGCTTTGC(^^ 

AAGCCCAAGATTCirayr^^ 

T 

AVSIKTGLAVHRFFNDFKRLHIAHPFFFI^^ 
PFGGGRRHGFAMQIXSRPSVAVNVRRVKRDDGYHIALI^ 

SEQ ID 3753 

SEQ ID 3760 

MPSBSLSBCEACPQ6AASVVVRIGTAFSPPPIIGAPCNPV 

GGGAAATGTTGAAACGTOTGCCGTC 

SEQ ID 3762 

MNAEGFCERQTACRPIXjRRNAGDRADIi^^ 

SEQ ID 3763 

ATGAGCGATTTO^^ 
CCGCAMGGOr^^ 

TAGGGCCGAAGTGOTAAAATCGGCTCAGGCTT^ 

CCACAAGCCGCTOTTGAAAGACTACGGCTACACGCAGCAGGAM 

GATGCftGTTATCGCAGGCTTTGGGCAGCGGOT 

GAAATCAAAAAGCTCGGCAGCGAAGGGAAATTGACGGCGGATGT^ 

CTTTGACGGTGTTCTCGAACAACTGGCAMGC^TG^ 

TCCGATCGTCGCAGGTTTO£CX3TO 

GGGCTGATTGCGAAATTCGGCGATGAAATCGA^ 

CCGTCAAATCGTOTTTGACGGGCTGACAGG^ 

TTATATCMCATATTCGCAACCGGCTGGATGCTGATTAA^ 

A^TAAACAAAGCGGTAGGCATGATTMCAGCATGATTGACTTTG^ 

ACGGCGGGCTTGGCGAGCGTATTCATGACAGCTTGACGA^ 

C<m<&«aCACGC^^ 

GGGOAAATCJUUfflCCCGW^^ 

O^C(mAMCGG(XGTGAAAAT^ 



(XATTGCAGGAAC(K3AT^ 

ACGAGQGCAAACGCGAAAACCMOGCCGCAAAGACGCGC 

CATGGE6GCGAGGG(XK^GAATTTCAGGCAGG(X 

CAGGAATCG^CATTTACAAGATGATTTTCGG^ 

T7m«>3GCTTCAGGCGCGGC^ 

(XOTCTCAAACGACO^ 



AACGGGACmAMTTPGTC 
flSQVRQVTSSETEYIAV^ 

eikkmsegkltadvif^ 

GLIAKFGDEIDVFGGGWSNLSDVIRAVV^n^ 



GEIKAHmH^^ 

AWERIAIAEQDPAY^ 

QESAITOIFGTKETLBTAAAA^ 

HAQTIREMADQQGGGSWIINSTGGDF IHKNDLAKLLKQMKRDFKFV 
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SEQID3765 

ATGOGCTATTTTAATAGAAACGGAAGG^^ 
TCCMACCGATTTGGACGGACGCATCGACAGGATGC^^ 
CX^ATCGGTTGGGGAAAATATTTCXKTrTO 
AAACC0G6ATTGTCAQGCTTCAGACGGCA1TO 

SEQ1D3766 

MRYFNNOMPDNLILISVL^ 
KPGLSGFRRHFLFRRLPLPPRTLPATIS 

SEQID3767 _ 

GTGCCAGAAAAAGACTTGGGATTT^ 

MATCGCTGAGTTTCAAACCGAGCAGGCOT 

ACACGACATTGATCAGGTCAAGTT^^ 

CAAGTATOCATGGTOT 

TCGAGTGCCTTGCCGA^ 

G^GTTGGGCGGTTTCAACCGCTTTTT^ 

ACGATCGACCAAACCOX^^ 

TTTraXICGCGCTGACGOTAM 

CGTCCAGGCTTTTATAAAATCCCGAAGGGAII^^ 

CA(^MTCCACMTTOTram 

TTCCGCACGCAGGCGGCTGTTTTC^ 

SEQ ID 3768 

WEKDLGFmVAATVHHAAI^PSDNR^^ 

QVCMVLLIIKSGIGFDGGREIALSV^^ 

TIDQTRALVVLRQFRQDARAtf^ 



SEQID3769 

GTOCTATCGACATAAAATTCAATACT 
CTTO^TGACAAATTCTCATCATT^ 

SEQ1D3770 

vsimiKmiiGViuK^raFOTiD^Em^ 

SEQ ID 3771 

GTGATTTCCGCCTTTCAGACGGCTGGGACGG^ 

AAAACGCCAAAAAGCGCGCCACCAT<X5ACATGATCATGGC^ 

mTACATCAGATACCCCGGA^^ 

GATTTCGAATACTCCA!KOT6TT 

GGAAGAAAAA6CCTCTGAAAACCAAA 

SEQID3772 

VZSAPQTAGTAMTDNUQIAWM^GVFVAAYGIMWTO 

DFEYSMLLRDWDNI^ 
SEQID3773 

TTGCACCCGAATTGCACCCGAATAAAT ATGGCAACCATO 

(X^GGCGTCGGCGGCGCGMTAGA(?rcGCAAATO 

GCGGGGCGCACGGGAAGAATCATACCra 

GflfiGIAICOOTftOCAQC^ 

A^QCX^GGCAAGGACAAGACGGCG^CCGA^ 

XTTTGCCATCGAAACOGCCATGCGGGCAGGC^^ 

CCATTATOTTGCGMXE^^ 

GCATACAAGGCCTGCATTTTCACGACACCCGCC^^ 

^KnTTACTACCGCCCCG^ 

SEQ ID 3774 

IBPlOTINMATITQRNGmVQIRMKGVSRSATFERA^ 
EVSPTSVGRELmSAVCEHAMKEHGLIiRBNPViaiSK^ 
PLSLRAAELIEQbf&TOIT^ 

SEQ ID 3775 

TTGGGACAATTCGGTGGCAaAAATATAAGCTGTC 

SEQ ID 3776 

U^FGGTKYKIiSAFQRAAFYLMyNTQ^KjT 

SEQ ID 3777 

TTGGCAGACGGGCGGAAGGGTTTTTTTGATGC^^ 
GATTCGGTCCGGAACAAATO^^ 

SEQ ID 3778 

I^XSRKGFFDAIIKINIFFFPVKTW 

SEQ ID 3779 

ATGCCOGCGCGTGTCGGGTTTGGCGGGGCC(H^ 
CCATTOCTTTTGGCAGACGG(K36^ 

SEQ ID 3780 

MPAKVGFtSGAVWttNGQKPACPS^^ 



WO 02/079243 



-290- 



PCT7IB02/02069 



SEQ ID 3781 



SEQ ID 3782 

LRGRAGGHiPVHPMGPAKHmiGHMRDCLQYrai 

SEQ ID 3783 

TTGCWSGTTaGOT^ 
TCTTATATATCGCffTTCCATCi^ 

SEQ ID 3784 

LQVQIFRQYCXQSMCPKVSGIAGPSAG 

SEQ ID 3785 

TTGTCTAAAAATCTGAACCTGCAAA^ 
GOGTCGGTATOGAOX&GGCGACXJAACTGGT^^ 

6ATATTTT6GTTCAAG6AGCT6AACCATT^^ 

GCTGCG(XXITQATTQGC(X3GGAAJOOT 

CATCGGCGCGG6C<^TCTGGTATTGG6TTTGGC^^ 

GGCAAAACTOACGTC&AG^ 

AAAACGTGCCGCGCGTGTTGCCTAAAAACACGGTT^^ 

(XGGAOTTTAACTG06G(MCGGTAT^ 

(XSTCAAGGGAXCGflGCATCAA 

IQ 3786 
LSKMIiNIjQTRHKVElfAPIiPYRPIPn 
DILVQGAEPimDyFACGKIJWPRATDra 
GKTimVIAPmmPILAAI^ 
RQGNEHQTQVA 

CCCTGCCC(XrrCAGGAA 
SEQ ID 3788 

MRKCRLKQIQTAFDAWKS^^ j 
SEQ ID 3789 

ATGCOTTTTTCAACC^ 

SEQ ID 3790 

MWSTQPTLFVSSHTNQKLHNYNHFFVQYQIT 




SEQ ID 3791 

TTGTCCATATCGGOTTCTTAAG^ 
m^CGCGmGGACAGCAGCCGCCCGAACAG 
AGGAAAAGGTAAC(KXATCCATATTAATGAC^^ 
ACAGOTCGCACTCTGAACCAAGATTCX^ 

SEQ ID 3792 

LSISAFURLFt 
TASHSEPRLR 
SEQ ID 3793 

MGCCGTCT^^ 
TAATGCTA 

SEQ ID 3794 

MPSEFVSDGISASSKFWSATQFCnilFCPVWRIiKUiHHISIini 

SEQ ID 3795 

TT6TTTATTCCTA(^TCGTATTCATCCT 
(XX3CXmxmCCCCCCCGCGA 

SEQ ID 3796 

LnPTSYSSYFPKCERFMKQSV^ 

SEQ ID 3797 

CCGGA 

SEQ ID 3798 

LPRIAVLFVLSAARMiVIJ 

SEQ ID 3799 

ATGAAGATAACACACTOmTTAAAGAAAGAACT 

CACTCTmACCOTAAMTCCGCJ^ 

ACGCGGTCGCCXXECGGCAGGMMGCGG^^ 

AAAATCAT(XXmCGCTATraTO 

ACATTAACGGCATTGGGAACTTPTCGAA^ 

CACACCGTOXXmOTAAAAATa^^ 

S MWraOIiKKEVQKEPLRSm 
KIMPDGIVYADSPGS^GKSDAGGFTRCRINRSKEFADRRNHI^ 
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SEQID3801 . 

TPGTCXXnt5TCT6AAT5TATA6AACG(X^ 
TGTGGCTCGACCGGGCAaAAGAAGT^ 

TTTTXXX&CATTACGCTO 
ATTTC<KOTITOK^ACAAACTGCC^ 

ACCTGTTGTCGGCATTCGGCGACAACATAGGGA^ 

GTTTTGTGGOTACCGTCTTCGGCAATA 

TCCCCETGCCXEAACTGA^^ 

03(XmCGGTGGttGGCGGTTAT^^ 

GCXX^TGCTGATGCTGATMTGTGTTTC^^ 

(X3CTGGTGCG6ATAATGAGCCAGACGCA^ 

CCGGGTCGATAAGATGTTTCATCAGGAC^AGCC^ 

ATTAACGACGGCAAGCTOXXXATAK^^ 

ACATTTTGAAAAACTAOjAACGTTATTT^ 

SEQID3802 

I£LSEFIERRTSFNPMVILmFFV^ 

PSDITVGAPKRQQQALMITVFHWGVHA^ 

SIAWSAI SOKKCMCVLSELNIA^I^ 

VUWFTVFGNTAIWL^ 

ALLfflilMCFSIiWKGLSAIHOT 

MX3KLPHIKHQTTYKPYAYPFIXjRV^^ 

SEQ ID 3803 

ATGAGGGCGTTOGCCGTOCGCMCCATAAAGGAAGTTOAT^^ 

GCAAGGAAAAAACCGCCTACCGCMCGCTCMGCAfiGAAGCCGAAGCC^^ 

ATTTAOTACCGCCTATTTCGCCGTCTCCGC^ 

GCCTC(mX3GCCTGTOtf^ 

AAAACGTGCGCOGCTTCCCCGAAAAACCCGACTTG^ 

CGCC^CCTCOTCGCXAACC^^ 

CCGATTTATATGGCGTGTTCGATTATTTl^ 

CGATGAGCMGGCAAAAATCCCTTGGTATTTCAGATGGTTC 

CGCCAAATTGCTGGAAGGCTACGGACTGAQCGAAGC^^ 

GTCGATGAAGAATTGGTCGMGTGCCGCGCGGCGAAGTGGG 

GCTGGTTGAAAACGGGCGATTTCGTTACCATAGACGAGG^ 

AGAMTCCACAAACTCGATGCCGTCGAAGCCGC^ 

GAAATCCGCCGCCACCTGCGTACCGTGCTGGCA^ 

AGTTTGAAGGAAACAAA 

SEQ ID 3804 

MRAFAVM^SFMNRTttHF^^ 
ASAGLSKELAGLKAQTPVmiWTDKSRPAGEn? 
PIYMACSIILVKSVFPFSNVLKQALI^ 
VDEELVEmGEVGELTOGGSVMRGYI^ 
EIRRHLRTVLANFKI E^QIHFKIXSLPRNATGKVI^RVLK^ 

SEQ ID 3805 

ATCGTTGCGGAC^XSAACO 
GAGGTTCKXXXXX^OTTCCTATGTCTG^ 

SEQ ID 3806 

MVAKSERPHLKTICRIiKTE^ 

SEQ ID 3807 

TTGCGGATACIGGCGGGCATGQGCAAG^ 
TCGCt^TTTCGATGATTTCGCCTG^^ 
GGGTGAGGGACTGGTGTIIXXATGTCCATGCGTTO 
GAGGCTGCCCGCGCCXXnX^GGCGGAAAGC^^ 

SEQ ID 3808 

LRILAGMGKYQILDDKFDVDHAAARW 
EAARAGGGGKFVGQAAGEAAVHVACFGHGIVVKID 

SEQ ID 3809 

TTGAAACTACGTTATTTOTATTGCACAAAAAAATGATO 

SEQ ID 3810 

SEQ ID 3811 

ATGCCGTCCGAACACGCTGCCGCATTCTATC^^ 
AACTTATCTATACCGTCATCAAAAraTTATC^ 
fGTCGTATTGTTCGGCGC(OT^ 
CGCTTGAGCGGACAGAAATTGACTGCAOCGC^^ 
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ACTGCCCCTGGCCA r 



====== 



jIjXjVHFSAQGTHKAAEMLADSLEELVRAAAIRQ 
SEQID3815 




GGTATCCAGCCATTTTTGTTCGGTATCGGtMTGTCeK ^ZVZZ^ww^ 
GCAOmnTCGACC^^^ 

GCGCTOmGCCGGTTraXCG^^ 

AGTCGMGGCGGTATGCGAGTCGCCGATTTGCMGATGTO 

GCGG 

SEQ ID 3816 iraniTiY^owVFGGPStVEPVFHAMAAHC^^ 



CCTACCACCGCMCAGCCCKTCGAACCGCTO 



OGTCGGCSASCGCTACCTCAASTTCGCTTCCGACGM^ 



GCCGCTACACCGATOCCGTCAACGTCAftCGGCAAA( 
nri^cmCTACGCAACCATCAAGTACACAGCa 




m-iiw.^^ ZZ^«^iqVWWRW^^ 

ACGGCTGGGGCGTGTT6 



SK2ES555& 

SEQ ID 3820 .uaaiBiiLmsP^^ 
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AGVYGYTFQLFLDFSGYSDOTGMAMLIiGFRI^ 
WALCRLKYIAPI^^ 

SEQ ID 3821 

AT(XXETCT(^<X3^^ 
TGCAGGAMTGCCCGAGTATTGCGIAAJCC 

SEQ ID 3822 

MPSEGREGI^ASLGQVRIWWFKFRQTRK^ 

SEQ ID 3823 

GTGGGGATTGTGCTGACGC^GCGGTGTT^ 

CGATCGTGGCGGGGATTCCGCTCGGGCGTATGGT^^ 

GC6OTGCCGAGCQTGAATACCGGTTCTW 

AGCTACATCGAGCOTT^^ 

CCAAACATCXIGCGCGCATTTPTGGCGGGCXXXa^^ 

GGCGA?IH?rGGTCGTCA6CTrGGGGAl^^ 

CTGCTQGGACAT 

SEQ ID 3824 

VGIVLTHAVFWSITASLAVRIAFVGKGNQA^^ 
SHEPFVIQIGGFSARQVTV^ 

llgh 

SEQ ID 3825 

tt(matcggcgcatttatito^ 

cgtggatotco:gctgacttcgctgg(^ 

tttgcctcgtggcgtwgaaat^ 

SEQ ID 3826 

IAI(y^FOTAEHPIALLSDIGI£^ 

SEQ ID 3827 

GTGTTCGGACGGCATACGGAGTATGCGAITOX^CAAT^ 
CAAAACCCTGTTTGCAGGCGTATCTTTACAAOT 
SEQ ID 3828 

VFGMiTEYAIADNFTKYTAPFHI^ j 

SEQ ID 3829 

GGCGGTTGCCGCTTCCTCCCCGAACACTrrc^^ 
TGCCGGCGGTA(&GGCGGTAGGC(£ 

TTOGATGATTTCGOCTGCCAAGGCGGTT^ 

GACTGGTGTTCCX^TGTOCATGCGTTTTC^ 

COKXXXX^TGGCGGCGGAAAfiCCGGTTGGTCAGGCGGC^ 

GAGTTCTTTG(XX3CCGAGCTTGC06W 

GCGGAGAGTTGGTCAGAGAGGGATTCTAC(X3TCC^ 

CGCCGCAGGOTCGATGCCGTCTG^ 

CGCOUmOGAOTTTGGCT^ 

ATCAGCGTATCTGTCGGGTTGTTTO^ 

GTTCGAGCCTKK^^^ 

TTCTTCAGGTTCGCTCA^^ 

TGCCCAITCTCGGCAATTTTCTTCA6W 
CGCGCCGAGTTTGATGTTGTCCGC^ 
CGCACCAGCGATTTGATGTCGTT03(a 
CCAACGCGCCmATCACCGTCCO^^ 

TGCWXXX3(X&TTTGCTAT 
OTOGTTCAAATCGCTGATGCG6TTTA 
CCAGT(XK3CAAAGGCTTGGAAACCC^ 
GCGGTGTCGAGGCGGTGGTCGGACAT 

SEQ ID 3830 

GGCRPLPEHFPPQFPDIAVMGQQFVAYFGFCRVAQFGVVIiLPAVEA^^ 

FlfflFACQGGLVAFGSQHFGADAV^^ 

EFFAAKLAVADIXjELRGVAVPFGHIRPRARKGGGMX^EGGE^ 

FHPRALAHNLRQLCQGLRMFVEEGEIGGAEAIW 

FFRPAQDPVEQLIDFVPMNIEA\^ 

RAEFDVVRHAFFADFA^ 

IPPFAPAVl^RFGIDE^ID^^ 

PVGKGI^TLIRQRLVQHGQS^IiEFAVVELAGEEQGMAAGGV^ 

SEQ ID 3831 

ATGATAATTGACGAAAMTC^ 
CAATAGAAAGCCAACTCATTTCTTATCGGCAA 
TTTAGCTATTGAAACTA^ 
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SEQ ID 3832 

ffilDEKSTSDVIiKQYQELGFD^^ 
SEQ ID 3833 

SEQ ID 3834 

VLKINAPIAKATO^I£BAFISIPYMSHPAKIOIHVN 

SEQ ID 3835 

TTGTTAAT 

SEQ ID 3836 

UJHYTFLVCRTAKSKSGGSSRLPFGETLGTIi 
epn in 3837 

TTGAMMCGTTCXMATGTTGAAACATTGCOGCC^^ 
ATAACQCCCGTCAGATOXAGTAIXAACGAAAGGAAAAATOT^ 

GCGTTGAAA 

S SvpySpUariju^^ 

ALK 

SEQ ID 3839 

ATGCCAT 

SEQ ID 3840 

MPSEGAERASDGIGETQRLQTAPLPLYFNAAS 

^ rTCOTAAM^TGCGGGCMTAGTGCCT^^ 
TTCATATGATGATT 

SEQ ID 3842 

LVKIAGHSAFRQTKlfAEQKffiSRIJ^IiQRElVCRACPTSKIHMKI 

S Sg\ISccaaag^ 

CGATGGGCAGAAACATCATCACGCAGGCACTTTATTACCCGGATGCGGAAT^^ 
AAATGTGCGTCTGGAAGAAAAAGATATTTTGACCGTC^TGAATCATrc^ 
TCGATTTGC(XK3AACAACCTCACCGMTACGGCATOKXTA 
TOGAGAJUXOTGGAAGOA^ 

GAATCRCAATACTTATTATGTCGCACACGMTATATGGAAATATTCAA^ 
TTGCATGTGTCraTGTTTCTTGGATGffiGGAGCATACGCG^ 

TOiiAACGATACCCTGOTTCGGGCATACOT 

AWXC(raCOTTCTOGAA(^ 
GAOGAGCAAACAGGCATTCAWSAAACGGTTGCOGAAW 
TTGGCAGACAGGCAPTTTCTCGITrCGGCGGM 
ccn in 3844 
S^SQTMHAIBAC^^ 

SIQlKMLTEYGIAYISYWnfPGWKRQEQIiSDIHQFAGRDAMEPIiEARTR^ 
LHVSFVSHHAEBTREHLAIAGGDYIAKE^ 

- . » iiimmilimTD0fflM»!«W.VT1^ 



LADRHFLVSAD 



ecQ in 3845 — — 

ATCAACCAGACAmACCT^^ 
CGTCCGGGttGOTCAAAOXSAAATCA 



TmCCTGCCraAAAACTATCC<£ACCT 

AACAAAATAC03AAGa^«3CGCAACAOTACGCC^ 

GGCGGCATTGGGGCAGMOTTCTTraGGCAGGCTTGCCOT^^ 

CGGtXXXjACAGATTCTWTTCGiaTTTAAIGOGACCGA^ 

CGACCTGATGGCAGAATGCXX^TAGAACACGIWAOCTGCTGC^ 

AGCGCA 

S Sl!?JS5mQHPIKHHU 1 UlAYQI^ 

rAL^SHTRQRHAMOT^ 
Sk 

cpQ in 3847 

TATCATAGCAACCTTCGAnKTGTCTGAAACOT^ 

SEQ ID 3848 

YHSKIiRCRIiKPCPDIRTDACR*KHSGKCREFPFKPI!H 
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SEQ ID 3849 

TTAACGGGmAfiGCMCGTCCCAAAGGTGATACTA^ 
GCX3ACCC(^TCAT(X:CCTITr^^ 
SEQ ID 3850 

SEQ ID 3851 

GTCGAATGTCTCCTCTCTCCTGTCGG^ 

TCATGCGCTATTTTCAAAJUGAAC^ 

ATTGATTCGCCACCTCATTAGTCJ^ 

GATGGAAATGCCTTA(Xtt£T^^ 

CCTTATGTGGACACCAAGGTGCGACCTATAC(nT^ 

ATTAAAACTTAAAGGAGCAGGAGCTGGTGGAGGCA^ 

GTCTCAGATCTTCAOTGGTTATCGTTK 

GCGGCWP«KOTmGA 

(XaTACGGCTTCTAATATTG^ 

SEQ ID 3852 

VBCUiCPVGIXKSEGTVDAIWiSIjDLEEKCLQVTC 

DG^PAC^SI^ASVSi™^ 
VSDVDLVIVGEGRIIJRQSLAGKAPIGVA^ 

SEQ ID 3853 

ATGTITO&CAGGTTCTTOT 

CGGCGAGGATATGCACCGTGAMCGGCAAAGATATTG^ 
TGGGTGATTTTGAACaAAACATACAGGC^^ 

SEQ ID 3855 

T«reCGGATGCAGGCTGTCGGGCATTATTGGAAAAGAT^^ 

TTATGAAAACAAATTTCAAACAGAAAATTATCGAACAGGCACGCAGA^ 

mCAATGTTTOTaCAGATGCAGCAG^GAG 

GGCTATATTTTGTGCAGCCACGCGCAGCACGAGTO^ 

AGCAAACOTCAGCCACGAG 
epo ID ^85fi 

UWQAVGHYWKRFPNmBYKHKOT 

GnLCSHAQHECNDHCHDHEEAE^^ 
epQ |n 3B57 

AT^GAAGTGAAAAAAACCA^ 

TCAACGAGTTCGGCXSAAATCXXjCAira 

GCAGAAAATGCTGCGCGACGCCAAACCCGACCGCCTGAT^ 

GGC(^(X3TCTTTACX}GT(X3TCGATCCGCGCCAGTT^ 

GCACCCXXXSAACftGCTTGC^ 

TCAAAAATCACGCTAO^ro^ 

GATTTGCCCAAMTGACCGMGGACTCGTCX^3^^ 

SEQ ID 3858 

HSEVOTKmiSGF^ 



DLPKffl'BGLVRAKGVFQVLGlWVWL^^ 

MTCCAAAGCCTCCAAAACCGTTGCCAAl^^ 
GGAOTGGTGCAAAAGGCAACCGATMGGCCXST^^ 
' <ITCTTTATCMTCTCGCGGACAACGGT^^ 
(XX3(X^CGCGCGGCTTTTATCAAA^^ 

FFIWiMJNGSLDmGQYGYPVFG^ 

SEQ ID 3861 

GTGTTACTGCCCGACAACAACGCGACGGATTTO 

ACCCTGCCGAAMOMTCTA(^Arro 
TOGTGCCGACGGTGTTTTrcAAGOCGTT^^ 
GIXMAAAATOGTGTTGTCGTAAAAGCCOT^ 
AOGCGOTTGCOGCCATAGACGGCAGGGAAACCGCX^ 
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SEQ ID 3862 

VLLPD^TDFH&YGYVIiIKAAR^ 
VENKVVVKAFSGIAHEIGIK^FGFVQNQTD 

S ^TOlS^MCAa^GACA^ GACGGCGTCGCCCGTTOXX^AAAA^ 
SEQ ID 3864 

SEQ ID 3865 

ATCCCXH^TGAAACCTTGCCCC^ 

SEQ ID 3866 

KPSETIiPRYSDGCIiPIKKQRQMPPPPVQTSPRSFKTA 

SEQ ID 3867 

ATGAGCAGACCOSTAC^^ 

(^CCCGGCGCGCATGCCCTGCATTACGGCAGCGAAT^^ 

AAGTraGACATTTTGCACCTGCCGCGC^ 

TCTTGGTGGAAACCGMCACATCCGCTGC^^ 

GATTCCGTGCTGACT(?rcGCCAAA^ 

CCGTCATCTCGOXKm'ACTTOT 

CAAAPACGGCTG6CTGGTCGAAGTGTGC 

SEQ ID 3868 

MSRPVPAVFGSVFHSQMPVIAYREGK^^ 
PTIJGTDPVIGKAGSPSETALLYILASPVGDre 
DSVLTVAKDI/jfWSERNFT^ 
SEQ ID 3869 

ATGCTCmACCCMCCCAAC^^ 

CAGACX^TTACCGCmC^Ara^ACra 

CXsACATCGGACGCGAATTTTACGAAATGAACP^^ 

AATGCAG(£AGCCAAAATCTGCTGATT^ 

CGGACGGCTTTGTCTATCAGGACGACCGCAAATC 

(X^(&GCGCGT<XX}TGACCTTO 

TGTGATGMGTGTTTGGGGAAGGGMTTTTGTTGC 

AATCTTGCCMTOTATTGCAGCM^ 

GCCAAATAGTmTTCACAATGGTOATCCC^ 

GTTTTOCTGATGAI^ATGATTTOT 

CTCAAAATGTTGGTACATTA6CCGAT6CAGTAGCCGAATATTTO6CTA 
TAGCAAMTATTTACCTTCCCCAAGOT^ 
GTGATGCAGCTTAACGCCGMGGACAAAACGGTMCX^ 
TCACCAAAGCCCGCATAGAAAAAGCCGCCGCCAAAAT^^ 

SEQ ID 3870 

MLH^PNPTQPNP^PNPTQPNPSPDLSOT 

NAGSQIUiLIRGIMjEVLKHLKNAYANSVKMIYIDPPYTO 

OTEWGEGNFVAQMRKRTAKSDVPFGISQDY^ 

WPIMDFLNISNPVMRYFKDDIMKAOT 

VMQIflAEGQNGNRRKCVQIiPEK^ 

SEQ ID 3871 

TTGCCGCTTMCGATGAATTAAGCGAAGMCAGCTGCAMC 

CGTATCTGTGCGAAAAACGCCTGTATCTGCTTAACAAGW 
CGGCAGCAATATGGCAAGCGCCATGCAGCACGAACTCGAC^^ 

SEQ ID 3872 

LPLNDELSEEQLQTIjLTTWIiYIXaAALOT 

SEQ ID 3873 

ATGCGGGCXKjTTTCGGCGGTTTTGGQ 

TGCAAAGCCAAMCKSTATAGACGGCCGATOCCCGACC 

GCTGCACCGTTGGCTGGGC(7rGTTC^ 

SEQ ID 3874 

JIRAVSAVLGWIXaATPKYRTMJENPEIiIiFAAKQYA^ 

SEQ ID 3875 

GTGGT(H3AAAGC(XJGAAAAAGMT^ 

TTAATTCGTCGTCCATGAACGATACGGGCGACAAGGCATO 

GACCCGAGATAGCGCGAAAACGTGGGGCAATATCAAACGC^ 

GACGCGTTTMCGACKXKSCTGGTCA^ 

ACGAAAAGGACAAAAAC^GACGTTCAAACTO^ 

CGGCTTGGAGTTGAAAAC(^ 

GAACGCGCGOCACAGCCCAAAATCAA^ 

QCGCGGAA(XX3GCACG(£GTCT^^ 

CAGCGACGATAGAATCGAGCAG^^ 
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JATCCCAACGTTTTCCAGCT 

TACCGTACAAACTGMmTTTTGTCGATAG^ 

GAAACAAAAMTATTGCAAAAATACCCCGATATC^^ 

AMGCCGCCTATCCCGAAGCCTTCCCXW^ 

TGTGGGAGCTGATTCACX^TAAAGC(X3TTTTG^ 

ATGCACGGOXTAAACGM(K^ATATCAACMCGGGOT 

AAACTGGCACAAACGGCAAAAATCCAGATGCAGAC^^ 

GCTTCAACCGGTTITrcCTrcAT^ 

AAAAGCA(&TTTGG(£AGATTTCMGATA^ 

GCXOTMTCGTATTTACCAAAATACCX^^ 

TTGTGATTGAAGCCAAAGGGACTGACGGGGC^ 

SEQ ID 3876 

VVESAKKffiC(aOSNAPITIBQFVKAENKm 

DAFraWLVKGVRVFQEEMQGQffiAAVKLVSSM 

ERAPQPKIKFLTLFFIDDIAGYRSGNBLSGSI^^ 

NPNWQIC^SSGSTTSKLQEVGM^ 

KAAYPEAFPKGLDKGKVSNAKDEGKDTIIM^ 

KIAQTAKIQMQTLHQAFY^^ 

GVTVraiPRNSIHPVAGGGTYSPDFAYIVKT^ 

SEQ ID 3877 roTCGTCTO^^ 
TCJATCTCGACCAAAAACAGCCGGGCTTCC^^ 
GGACX^GGCAAMCGAOTTGTTCGCTGTG^ 
GCCACCATGTTTTPAAATATGTTTTACGGCGQ 
TMTCAAATACGCCGMTTGAMCAGTTCCCCATCATO 
CGGCCGTATCGAATCGATGTTCTCCGCCCTGCA^ 
TTGGCTTTTGACAGCGAAAAAATGCCCGA^^ 
AGCCGAAAACCTGCGGCC3CA 

SEQ ID 3878 I 

MIEPGIKIMVCIiSGGKDSYAIiIiDIIiRRLQASAPID^ 
AWLNMFYGGKLKAMPPKLVSDNGHHIVIRPIiAVVK^ 
LAFDSEKMPERFSDGSBEDESEm 

SEQ ID 3879 

AACCTCGCCACGCTOTTTAAAAC^^ 

ATirTCAACGCCTTATGCTGATTGCCGGCT^ 

GGTCGTTCTTTCTGTCAAGAAACMCGACAGT^^ 

SEQ ID 3880 

NIATLF*NSISRTFLPCGIFKKTITLTO^ 

SEQ ID 3881 

ATGATGAAAACCTTACTG(XXTITTCGGC^^ 

ACGCGACTTACAACACXj(XICAAAGG<XX3CGTCATCG 

(&TGTITCTGTTGCGCGCCAAAGCCGT 

GMTGCGTCTCA(^CGGCATCTGTTCX^3TC^ 

AAATCCXjCAGCGGTTATKGTGGATATGCGCGGCTACC^ 

MTCATCGCGCGCGCCCAGTATCGGGGGC^ 

OTCGACAGCGTTCMGATNX&AAAACTO 

AAACCAAAAATGCAGAA 

SEQ ID 3882 

imTLtfFFGVARVSGEDRQ^ 

ECVSDGICSVIliHHRGIIi^ 

LDSVQDSENFTALAVIKFSAAQK^ 

SEQ ID 3883 

GTGTTATCGGATM^TGTPTACTCATTTITC 

GCACAACGCCCCOTATTCCGMGCGGTTT^^ 

CCGCTGGCTGGCGGAATGCGGCPCGGACAA(K3^^ 

AGAACGCCCGCCCGCGCCTGC^^ 

SEQ ID 3884 

VLSDKCVYSFFVHQNMEI^ 
RTPARACVEARLARPEWRVEIKITAVKRDAATA 

SEQ ID 3885 

ATGAGTAAACACACTTATCCGATAACACCTGCCGTAC^ 

TGTOKX^GACGAACAOTGGra 

TTTGGGTGCGAAACACATCGAACCXXKOGCGOT 

TCGGTGATGGATTTGGAAACCATCTATAT^ 

SEQ ID 3886 

MSKfflTPITPAVRVU^IBFEPmAYEEHGGT^ 
SVIffll^IYINGGKRGFIIGIRPIH)lJniJfi , KTIQAAV 
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SEQID3887 



SEQ ID 3889 ,^ 1)WW ^ in(y _ WI ^^ 

CAGMGTTAAG(X^TTGTTCQGCATA 
IGVIVLGTLSGALFFKE^ 



HaCGTMTCAAAGOOTAAACGGC^ 
CTGCIGAAAAACAMTOJT^^ 



CGATATGT 

ESS 

ATACTITO3GCGTACTGAC00G(aTGG0GGCCTTG0GCGGC^^ 



CGGCGTGGXSATCAGAATTATARGGGGAACMTATOjMCIGGCW 



CAGCTAaaTCTWO^^ 

TTCTTCGCCTC 



s s======= 



ftC6GCT6GCTGGTC6AAGT6TGCTGATGCOT 
TACiAAG(X^TCGGTTAAAACAAG(X?KX^CAT 
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CCTGGAAATACQCTTCAAM 

GTO^TMGmTTTCGCCGCT 

CCCAAGAGITOXXXAMTCX^ 

CCTCWM3GCTCGC1^^ 

<KXXAATGCGGGCA(X£TG^^ 

AAACCGTCCJAGCCATCCCAATGTTTGCC 

SEQ ID 3894 

MPSVRRLPTSSTVWRKT^ 



($CGQAVCGHFVCGVERFAAI&^ 
ccQ in 3885 

acotgccoxxxxxx^^ 
tcgggcataagctgccacatca^ 

sgischisptqvfilippqnilprkadfiisg*ikvh • 

SEQ ID 3897 _ 

ATCGACTATAACCAAACTGTTTTAT^^ 

TATCOTATGGGCMGAGTTGGAATATATCGATAATGTO 
TTGATGAAGCG(^TT<^ 

SEQ ID 3898 

1QYNQTVLSHLQKFWKHHDIKG 
mGQELEYWM^FWLIiPITQTERi™ 

SEQ ID 3899 

ATGTTGAATMTGTCTTGCAAACX^CGGCGGCXKiA^ 

TX^TWXGCGTTTGGTATJ^ 

(^TAAATCAGTAACCCATCTGATTCTGCOT 

AACCGGCOjGCCGGCAGCCTGCCCCGTTIOT^ 

GCGATGAMTTGTC^^ 

^CAACCTGCTGGACATCGTGC^ 

CCCGIATACTGCCGCAACGGCCMTGCCTCGA^ 

C^CGTAmCGAACGCTATACCGAAGCC^ 

TGKXaTCACTm^ 

GCOK^CCTTACGCCATCGCGCCT^^ 

AAAACGAATATTCGCACGAC^ACTATTTCAGCACGGT . 

SEQ ID 3900 

MLNNVIOTAA 
* l^AGSLRRFVVLVVGOTRAA 
PEYCRHSBCLDHIIJ.^ 
AAPYAIAPS(^PMVMWFSKAFRQH(WIDFQCU^ 

SEQ ID 3901 

ATCCCGGAC(X^ATTGOT 
ACGGCATTC(WAACTTTTGC^raG 
GTOCCGGCAGCTAAAAATCCTG^ 
SEQ ID 3902 



SEQ ID 3903 

GGOTGATGGCTCTTACCTATGCGGGTGTAGAIAGA 

SEQ ID 3904 

GIJ4AVTYAGVDRLAAHFQQAITHSITGAPQ 

SEQ ID 3905 

ATGMGTTATGGTGTCAAAATCAGGCTrTCAAMCM 

SEQ ID 3906 

MKLWCQl^AFKTTF^NHEQAVYHAGAVGTL 

SEQ ID 3907 

ATGACCGATATTTTCACTCCATCCAMCGTTC^^ 
A^CGAT!AAGCGCTATGCA6GCAAACCGGATATCGTTT^^ 
CAATATGGATTTTTG^^ 
AAAGCTAm^AGGGAGCGATTAAATOGGTTG^ 

SEQ ID 3908 

ffi^IFTPSKRSFWMSmSKBTKPEVLVRK^ 
KMCRERLNRLVEEIKDAV 
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SEQID3909 

SEQID3910 

IJCI£RQDVSAVADRVHISTNAN^ 

SEQID3911 

ATTTCCTACCCTTTGGTTCGGG^ 

GOTATCAACGGCGGCAATCAGAACACCAATGT^ 

T(MTAT6AAAAAC<nm^CAACACC(^ 

0CAATGTCTKTO3GGCGAATG^ 

TAAAAACACCAGAAAOTGACTGCGOTAa 

GGTTTTATGAGTGGAGAACTGGAACACACGGATGA^ 

SEQ ID 3912 

MKASQUHAVLLAAAFG^ 

WYEKRi^EVFSWQMADPLLPIAPKPGOTFAKVW 

GFMSGELEHTDMELSFlKWEimrc 

SEQ ID 3913 

GTGCAMCCTGCATTGCCTTGAG^ 
TCTGCOTGCCGCTCCGTCCM 
GGAAGTAGAGGAAMCGACGGCTATATTTCGATA^ 
GTGCGGCAGTTCGCCGCCCGCTACGCCAAACTGTC^ 
AGACGGCCTTGGAAGCCGAACXX^C^CCCGCG 
CATTCCCGO«5AAACCCGCCCCATCGT^^ 
GGCATCAA(£AATGCGTGTGGGAAATTGCGGGCA^ 
MGGGAATCACGGCGTACAGGTGCGttKJF^ 
TOOTATGCCGA(X5(^GCTCCTCCT^^ 
CCGCACTGGCGCATTAACTCACATCCCGG^ 
(X^TGAAAAKGGCCTGCTTGTmTTTC^ 
TTATTOAACGCCGTCCTGCT^ 
TGGMTATCAAAGGCGTATATCra^CTO 

SEQ ID 3914 

VQTCIALRDFCFYPFQTASLPKGPHPMTHRLCI^ 

VRQFAARYAXLPRHKPALIHLADA^ 

GINQCVWEIAGMCUffiRTLPAPTK^ 

HIWRINSHAGAKKTGYGGQSKTY^^ 

VnOKGVYPELRFHRTIAHSKAVYYRYRQNQ^^ 

SEQ ID 3915 

ATGGCQCATTTTPTCAATATPAATTAICCGACTTACAAAGGT 

CCCTGCGCGCCATTGAAMCTTTAAAATOT 

AGTMMCCCGAAATCG(£GCCGCCATTGAAAAGGC^^ 

ATGMCACCAACGAAGTCATCGCC^ 

ATCCCACGGGCTITOKXTTGCCGTOT 

TCGCACCCAGCTTCAAGATGCOT^ 

AATTTGGGCGCAACGGCAATCGGCACGGGCGTGAACACGCCC^^ 

MGCGACCTOGACTGCGGTGCATATGTGATOTAC^ 

GAAAGAAATCAAOTGCCCGMTTG(^GGCCGGTTCTO 

ATTACOTOXXX^^ 

GCGTCAAAGGCATPACGGTC^CCGCGAAATCT^ 

C^G0CCm0CGGCAAA(3G0GTC0G(^ 

SEQ ID 3916 

MAHFFNINYPTYKGKKMTVR^ 
MNTNEVIANLMjEALGYI^ 
NLGATMGTGWPKGYAELVVKKLSEVSGLPCKLTE^ 
ITFAAESGQIiQIfiVKEPVIAQCWETISLI^^ 

SEQ ID 3917 

ATGAAAAAAATCATCTTCG(XGCGCTCGCAGCGGCAGCC^^ 

(XAACGTCBGCGGTTTTTACGGTCT^ 

CGGCQiOTHaAAATCCGCCGA^ 

OWGGCAAAACCOTCCXXJTCA^ 

GCGTGGACTACCTCGTTAACGCCGGTATGAC^^ 

SEQ ID 3916 

MKHIFAALAAAAVGTASATYKVDBYHANW 
RGKTAPVKHAEKFNCYQSBIAETEVCGGDPSTTm 

SEQ ID 3919 

TTGAAAAAATGOXXATCGGAAAGATI 

ATGAAGACGGCATTTCVGCTGATra 



WO 02/079243 



-301- 



PCT/TO02/02069 



AGGCTTATTGTTTTGCAGCTT^ 
GCAAACTTCGGTTTC(XX^ 
TTGAAGTTGMTTTGGTGGAAACGAAGCGGATC 
TOGTCATGTCGATTTOCC^ 

TMCA 

SEQ ID 3920 ■ 

LKKCAIGKMPQQDSIYTCPDBIVHIVCBIiK!^^ 

ANFGFRHRALVAVEFFGFEFIX3GGF^^ 

VIFVHFVGGGGSAIXOCBRGkliDyFhDWIiCGPVKAF^ 

SEQ ID 3921 

ATGGGGGTTCATGITGCGGACTTGGTGCGGTT^^ 
TTGAGMCCTCCATMTGCGTTTGACGATTATAACGTO 

SEQ ID 3922 

MGVHVADLVRFSYQISBFYHKL^^ 

SEQ ID 3923 

CXjAAAAGTCTTCCAAATGACGGCT^ 

SEQ ID 3924 

RKVFQMrADCLFNSRIPnQiAV*TAKRAPRRH 

SEQ ID 3925 

ATGGGTTTOTGCMGGCAAAAAAATTCTGATTAC 

TCGJOAACTGGAAfiAGCGCGTCOmA^ 

TTGGGATGGCTTGGACGGTTTGGTGCATT^^ 

TACAGamJCCCGCATTCGO^^ 

TGG(MAAGCCAGCCTTGAGGCAGGCATCCXCTn 

GGCGAAATCACTIA3Hn i TGACGGC^ 

SEQ ID 3926 

MGFLQG^LIT(MSERSIAYGIAKACREQGAELAFTYVVD 

YSLPALAKAARPMMRGRHSAIVAIjSYW^VRMP 

GEITYVDGGYSINAIiSTEO j 

SEQ ID 3927 I 

GTCTTATCTTCATATTOGA6G6TM 
TGCyUUyCCCACCGTPTACCXaGCAGGAAA^ 

SEQ ID 3928 

VISSYFEOTSANLVQTQKQSTOT^ 

SEQ ID 3929 

ATGAAGATAACACACTGCAAATTAAAGAAAGMCT^ 

AC6CGGTCGC(KXX3CGGCAGGAAAAGCG^ 
AAAATCATGCCGGACGGTATTGTTTATOX^ 
ACATTMCGGCATTGGGAACTTTTGGAATCAGGCAAAACGCK 
CACACCGTCCCGGCAGCTAAAAA^ 

SEQ ID 3930 

mraCKLKKEOTPLRSFVPmARSAAD^ 

HMPDGmADSPGSRGKSDAGG^^ 

SEQ ID 3931 

TTOTCGGCG(X2ftGftM^ 
CGGATATTTCGGGCGGCAATTATATT 

SEQ ID 3932 

LSAADHiALKINSGIKASFIiFMKT^ 

SEQ ID 3933 

TTGAGCGCGTTTTOTCTGCCXSC^ 

GCGACTTGCiXXjACCGCGTAAATTTCKK^ 

GGATGCCGTAGCCAAAAAAGACGCTTCCGTT 

SEQ ID 3934 

I^AFSSAADNTPQHSKWjQVHVRADAKRVKAAH^ 

SEQ ID 3935 

TTGGCAGGAATGTATGGCACGCAAGGCACGGC^^ 

SEQ ID 3936 

LAIMGTQGTASVQVGFAQIilKGASTAVHGIP 

SEQ ID 3937 

TTGTTGCCGAACEGGGCCAAA^^ 
CCTOAATCAOTGTGOJAAGC^ 

SEQ ID 3938 

UiPNRARPILRLPSSAA^ 
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GATGCGTTTGACAAIACC 



DAFDNT 



SEQ ID 3941 

GAWGCAMCGOGOTAITGJGTTCAA^ 



BmNYV^VQRGQFDRSFAVVGLPRTYVMAEIJJP 



TGGGTTCATAMK3ATTGTO 



ssssssssssss^ 



A' 

AGACAAM 



*ICCGGATWOTAC^ 



'ATQGTTTCCGGCCAGCCGCAT 
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SEG1D3948 

LFEVMKKPLISVAAVLLGVALGTPYYI^^ 

APETEKVLERFFGKQVFVSL^^ 

EGVDYNVKimmVTDLQI^ 

SGIiFTHDPVIOTKIFEFTI^ 

TVI SLKNHMiKIJK^TLQNEPDPDPDEGEMVSGQPH 

SEQ ID 3949 

AT6GAATTCX3GACG6(^TAGGGGA^ 

SEQ ID 3950 

MEFGRDRG7IEIQUOTIGKHGGKIKGCIU 

SEQ ID 3951 

A*reGCTTTAGTT6AATTGAAAGTGCCC^ 

(£GACAAA<XX3ACGAT<^^ 

GGCTGCCGCCCCTAAAGCCGAAGCXjGCTXKX^ 

CTATTCXX3CGGCGGTCCCGGCGGTTACTC 

TOCOTOCAAAG0CTTS1TO 

GGCGACGCGTACGMttGGCAGCCOT^ 

TX^TCGATTCCAGCXjGCGCACTGGCAC^ 

TGTGGTTGAAATGATGGACGGCXTGATGCAAGGCGCAGAC^^ 

GAGOCGAAAGAAGACGGCGTTTACGTTACOTTGAAGGTGCAAAT^ 

CAGAAAAA^GGTCTTGCXXOTACO^ 

A0CGRATTGTCT6(XtAAAGCATCA^ 

(XmaCXXXSACOTGGGCGAATC^^ 

SEQ ID 3952 _ 

MAlVELKVTOiaSHENVDIUVEVNVGDT 
VIXSGGPGGYSWAAADEGIiKVAIVERYKTIjGGVCIi^^ 

OTAYEQAAPTGEKXIVAFKNCil^ 
EPKHXJVYVTFEGANAPKEPQRYDAVLVAAGRAHO^ 

TELSAKASGRKITKANFPWAASGRAIANGCDNGFTK^ I 

SEQ ID 3953 

GTGMGAATGCAGTTTACGCCCATCAAATTGAAACAGAOT 

OTATATmTCTTTT^ 
TATTOTGAAGMCffC&TTATTOT 

SEQ ID 3954 

VKNAVYAHQIETOLYDGCnSTTTDKEIAKKPATS 

SEQ ID 3955 

GTGATQGTGTTAAATTCAAACTTTAAATTAATAAOT 

SEQ ID 3956 

VlMiNSNFKIJTyGKFIJYIEAIiVANKMSKIKDS 

SEQ ID 3957 

ATGACTATCGTAGAAATCAAAGTCCCCGA^ 

aX^KXXWGCCGCCCCGGCT^^ 
(XXXSTTCAAATCAAAGTTGGC^ 

TGCCGaCCCGCCGCCGTCXXTACATC^ 

GA3TK3GGCCAA(3TCAA^ 

TCGGCGGCGGTCTGGACTTtXn^^ 

(KHKmATTCCTCACOTACCOT 

ATCATCAAAGCCTCCGTTTC(XXKjCTGAAA6OT 

GCTTGGTTGTTCCCGTCATCAAAGACGT^^ 

OXCTXTITACCATTTCX^G 

GGCAAAGAGTTTGCCCCGCGCC^^TGTGCCC^^ 

GCCGCATTACCTTA 

SEQ ID 3958 

MSIVEIKTCDIGGHENVDIIAVEVK^^ 

AVBIKVGDmEDOTLITLErDKATMDVPCTM 

DI£QVKGSGLKGRIMGDDIKAFVKSVM^ 



GKEPAPRUiCPLSLSFDHRVIIX^^ 

SEQ ID 3959 

ATGTCCAOCCAATTACACGATGTTGACOT 
AATACTKXX^GACAAAGGCGTACGTATGCCCCACCX^ 
03WTTCGTGCGCTGGAACGCCGCCGCCA^ 
CACTTCTOSAAAQCCA^ 
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CMCGTATCGGCGACTO^ 



QLHFYKESVI^IIiQBGINEPGMffilMIAAA^ 



mVAMAMAEQGTOSKmQQMmGrKADSAPSWKR 



QACATCACTCCG 

iwiiiiwMfinfi^, M _ 

lC GCAGGCT^^ 



GTCFTTGC 1 1» CM^sS^^^ 



SEQID3966 

LFRFDLEtFVFAYYPSQTOIKGIiM'PSliPIKHST 

SEQ ID 3968 
SEQ ID 3970 

VGYMVHSLSAIFLLQKH^^ 
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SEQID3971 

TTGGAGGTTGCTGCAAA^ 

MGAAGCCCCTGCCGMGCCGCGTTTGCCAGCCACM 

AGTCGAAAA03TCGTGCm»GGi^ 

CTX3CTG(XXXrPGmGACXGCCA(^ 

ACCACATCOAACCmTTCC^^ 

CTATGATGCCATGTATGACGCTCACTGC^^ 

GCGGAAAGCGGCGAAGATCTGATTGCATA^ 

MGTACATACACCAAACGTCAAAACCATTGAG^^ 

GCTGTTGCGTGGCGACCATGAGTTTAACGACAT^ 

TC(X3CGGCCCCGTC(^TC^^ 

(mTGCCGOXJAGCCl^^ 

GCGCCmAATATACCCMOTATC^^ 

(^GMTAACGACGAAAAAGGCATCATTTGGACCAAA^^ 

CC^TTGCHBCG(Xa^^ 

OTGAAAGAAGGCMTGTCGAATACGCCGMCGCC^ 

SEQ ID 3972 

LEVGAKPAFFALQSAPDUiOT 

LLIU^KDRHERDFCMGPTCmiAD 

AESGEDVIAYSOTSDYAANIELAPTL^^ 

SIX3PVG?TGKVYADFATEKGADWVIGANE^^ 

QNNDEKGIIVmftMAPFEVVI^^ 

SEQID3973 

TTCGGGGAAAGTGMGTGGCTTCAGTATTTO 

CAGMOTGTGGTGTTTATGAAGC(X3TCAGGCAGC^ 

GGATTCAGAC(KOCGGGC(^GG(m^GATAT^^ 

CTQAAGGCAAAAGGGTTCAGC^TTOT 

GGGAGACTCGTTK&TGAmAC^^ 

TGCGGGGGACAATTTGGCATTGGAGCTGGACGGCTTGGT^^ 

SEQ ID 3974 |i 

LGESEVASVFLEGEAVEVGNirciGRN^ 

LKAKGFRHSACVSDFAAADRI^ jj 

SEQID3975 

ATGOTAAAATTCCAAAAGGMCGGCACTGGCATTCGAC^^ 
GCAACGATGAAAAATTCGCGGCAATCGCCM 
CAAGTTTGGAC^AGGCTGAACXjGCAGGTO 
AAATCGGTGCTCGATCAAGTGGCGGCGCAAG 

SEQ ID 3976 

MHKIPKGTALAFDFGEARIGVAQGDAEM 
KSVLDQVMQAILHGFIiBGGP^^ 
SEQ ID 397^ 

GAAGTC(^ 

GTCGAACAGGATGTGTTCGTCGGCGGGAACGGTCAGCCACGC^ 

SEQ ID 3978 

VQDGLRRHLVEHRFAFFAQDI£FRKQAFGTD^ 
EVECQCRSPWNFMHG^ 

SEQID3979 

A3^CCT(nCCAAC(^CT^^ 
AACCCOTCOGATTACGATGGACATGATT^^ 
CCAO^CTCCGATCGGCAACK^^ 
AGCATAGGCTATTCGAGCTGG&GCAAAGGGCAGC^^ 



SEQ ID 3980 

MHbSKHFLVAMPKffiDAITSQSVVnCH^ 
SIGYSSWSKGQLERELADNmTVPADEHILFDI 

SEQ ID 3981 

ATGAAAACCCTGCTCCim^ 

CCCttXXXXXXXmmGAta 

OKyOGATAC(XXWCGGCTACX^^ 

cramcGmxxriTO^ 

GCCTACGAATCXmTACCMCyU^ 
ACACAACCXKXmAACCGTCCCG^^ 



WO 02/079243 



-306- 



PCT7IB02/02069 



SEQ ID 3982 

SUMPAAALTKNNSIUCGERSAGLSVNGTGDYRN^^ 
AYESQYQEQYAIMGPYSVGKTVKASDRI^^ 

SEQ ID 3983 

GCAGCGGCGAAACCGATAAAATCGGAC(XX£^^ 

TTCGTATGCAAAAm 

AAA(XX3AAAGAAGTITrCAAGCGCGOT 

AAGGATCGCCGAAAAATTCCTTAAGC^^ 

(XXXJATAAAAAATCCCGACAAACGC 

SEQ ID 3984 

VRPSKQSKFCBIflBNG^ 
KHEVFKR(rTGFAFKSRQIVRYyDPKRKAPAYLW 

SEQ ID 3985 

GTGTCGGAACACAATATCCGTTGCAGCAITO 
GGCATAAC 

SEQ ID 3986 

VSEfflniRCSIQWPHNlRIiFEPIASIRAKKVQDWANGIPBRHH 

SEQ ID 3987 

GTGCAGCXITTCAGTATTATCCAACTTTTCCG^ 
ACCGTTCCGCCOAAGTGAAA^ 

GGCGGCAAMGAGTTCAAAAAAACCGCCAAAAACCGCAAAGAAGC^ 

AAGO^TG(X^GCCCGTGTCCGGATCCAM 

AAAAAGACACTGTTTCTGCA^ 

OSCCCTTGCC^ 

GAACGTATCCGCCAAGAGCGTATCCGTCAGGCGCGCGGC^ 

CGGAAGTCGCCGCTACGAAAGCGCAGATTTCCC^^ 

TTTGCGTTATACGCGTTATGTAAACGCCTCCMTCGGGMG 

AAAATTCAGGCAAACGTGCMTC(XIT6CTGAAAAAACA^^ 

AGAMGGGAACGAGCAGCAGCTGAACAAGCTCTTGAGCAAT^^ 

(XAAAAAGCCAGAAAAGAAGCGGCGCAGCAGAAGGCTGAAGCG^ 

GA(^TTTCAGCCGCATGCAGGGACGTTT^ 

CTG(MCGGTTGm(^TTCCGCC 

AACCCTTCGGGCTGGATACGT 

SEQ ID 3988 

VQPSVLSNFSEGRTMRYKPLLLM2&VFSTPAVAM©W 

K^GKFVSGSKEKNAKTQPE^Q^^ 



KIQAHVQSLLKKQGWTD^^ 

IX3FSRMQGRIJa(PVDGVPTGLFGQ!^ 

HPSGHIR 

SEQ ID 3989 

TTCCCGTTTGCOGTACCTGTGTTT^ 
GQCCQ 
SEQ ID 3990 
LPFAVPVFSGFWKSREKVK^ 

SEQ ID 3991 

GTCTCCGAC(£GGT(XX&CTCGC^^ 

CCGACGCCmaCXMCCGGMTTGACCMTACCGra 

CAGTACCATGCCGTCTTTGCCGTC^ 

TCCAACACCAGCCOTTGAGCCKsTTT^^ 

GTTCGATCAGGTGATGGCGGACGCTTTTCAC^ 

CCGaTTTTmCACCGCTO 

ACCACTTTGACAAAACCXjTOTCCT(& 

AACOaXXaCMACCC^ 

OT^AAOTTCTCTC^ 

CCGATCAA£X7TATCCAGCCCGAAGGGT^^ 

CGTATMCCCTTC(TO5CGGAAATTO« 

CCOBOGCAATGCTT^ 

AACGTCCCTGCATGCGGCTGAAACCGTCGGCACTO 

CTGCGCX^TTCTTTTCrGGCTO 

TTGTT^GCTCCTGCTCG^^ 

(X^G(^TTGCACGTTTGCC^ 

G(XX3TTTACATMC(XOTATAA 
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ATCTCOGOTTOT . 
(&CGGATACOT\^^ 

GACATCATTGGTGGCAGCGGCMCGGCGGCTTTCMT^ 
TTTTTATTTGCAGMACAGTGTCTTTTO 
ATCCGGACACGGGOTX5CCATGTGCCOT 
(XXX3GTTTOTTGMCTC^^ 
GCCTGTTTTTTCACTTCGGCGG^ 
AAAAGTTGGA 
SEQID3992 

QYHAVFAVASLGADDDRFRRQKRIUIHADRAGQKPAGVVA 

PHFFHRFADRHTAWWIIDFHNBIAAF^ 

DFVIPPVLIX&KTLHRQTYGHAA 

RITIAGGNFAQTGIDADWIAVID^^ 

IJmFFSGPFa^FGFSQPSFCFRIIiNAVFGFPFLQ 

GVHTRITQKAWI/ntFGVFQEQGNRIRPAVF 

MIGGSGNC&FQFVGIGFGIFLFKVFFVLlffl 

(&FFELFCRLFAACFFTFHCGIFAFFFFAFRRF(n^ 

SEQ ID 3993 

ttgggtgcmtcacxxxxetggc^ 

GTCAGATTMGGCAAACTACTATCATGACAAACX^^ 

CGAGATAMGGAGTCCACCAGCGGCGMTTK^ 

AAAAGCGGCt&TTTCa^^ 

AAAATGCCGACMGCCGATAGTCGT^ 

GACGGTCGAAAGCX3TCAATAC(XjCCGCAAAAGAGCTGGTAAAGG 

GGCGTGTC(^GGCX3TTTCTGCCGTCTGA^ 

ACCCTTTGGCGGGTATTCCTGCCGAGTTGAAAACGA^^ 

CATC£TCGGTACGCAGAGCTTCGGTAAAGG^ 

GCACAGGGGATTGTTKEGATGTCGAAGTAAM^ 

(XX}TTG(XX3TACCGCTTGAAAAAGATGCGGA 

SEQ ID 3994 | 

I/3AISGVAVSLAVQGFAAEKDGRDNEVIjP^ 
KSGDFIVHDNVSTRGMVSEAVKRMRGKP^ 
GVSAAFIi^EAVWSTKGRDGKDGMVIi^^ 
AQGIVPDVEVKDKERTFESREADLV^ 

SEQ ID 3995 

ATGTCAAACATCGMCAACAAGTTMGAAMTT^ 
TGGAGTTOTTATGSOTTOjAAG^^ 

SEQ ID 3996 

MSinEQQVOTIAEQLGVNE^^ 

SEQ ID 3997 

AGGTGCAGTTCOTAOK^^ 

GGTTGGTCAGG^GGAAACGTCG0GGGGGTKXXX3A^ 

GGCGGCGGCGGGGGCGTTCAAAAGCGATGTGGAAGTGGTTAOGO^^ 

TTGTGGTAGCCGCCGCGTATCAGTGCGTCGATGGAGTAGOT 

AGTCCATTTCTTTGACGGCGGCGCG^^ 

GGGGATGAGGAGGAGCAGGGTTPTCATGGCAGGGOT 

SEQ ID 3998 

IiFLVWFVG(X3FRGNQQFPAVAVrc 

(MGGGVQKRCGSGYAGERVGFGGGGSVIG(^VSGGAVGVVVVAm 
GDEBEQGFHGRVCFVItfGFECKWIIiR^^ 
SEQ ID 3999 
ATGGACGGAAATCTGCOTrcr^^ 
TA 

SEQ ID 4000 

HMNLPSERLERLHAHOTEVGSAYADVDnRNPFS 

SEQ ID 4001 

TTGACGATGCCGTCTGAAAGGATTTCAGACGGCA^ 
CAAGGCGACGAAGCCGCAGACAGTACAAA 
SEQ ID 4002 

LTHPSERISIXSIIiCMGQVl^ 

SEQ ID 4003 

ATGGAAGTTATCCGATACCCAAATTCCOX^^ 

AAACCCTGCTCGGCGTMCCGGTTCGGGCAA^ 

CGAAATGC«XX5AGTTTTTCCCCG^^ 
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AACC^CACATCGAGCAGATGCG^^ 
AAATGGTGCKTK^ 

CGGCGACGTGATTGACGTGTACCCCGCCGAAAGCTCC^^ 
CGTCTCmaXTPA^ 

MCAACGCCCCGTCGAGCMCAACGCATCGM^^ 
AC^amACCGCCECCCA^ 

MTACGAAGAACaACACOK^ 

CAAC(&C£GTATTCAAAAAGG^^ 

ATC6ACACCGTCGAACGCGTWM 

TCCTC6AT60(XaCAAA6AAGGCTTCCTG^ 

CATGAAAGCCtXXXTKXiACGAAACCGMC^ 

TAraCGMGAAGACAGTGGCA^^ 



SEQID 4004 

MEtflRYPNSPFKLHQPFPPAGEQPT^ 

NERIEQMRLSATKNLMI^OT 

RVGRYTVFPSSHYVTPRDTVLRAC^ 

lOQNLVDYGmPSARimPLKFHEFEKVMPQTV^ 

IDTVERVEIIRDLRl^LFDVLVGim 

SEQ ID 4Q ^^ QmmWa QQUMW 
GTGGCGCACACGCTATGTATOAGGCTGCGATTAOT 

SEQ ID 4006 

VAHTLCIOAAITISKSIiRPKQQIIiFNIANPIPQHSSFFKL 

SEQ ID 4007 

ATOXrKBTTTCCCC^^ 

C06AAAGCTGQG6CAATCTGTCA6AAGAATTCCGTT^^ 
ACCGAGTATGGTTGACGTGGAAMCAACGGCCACACX^TT^ 
CCGAGCGAAAACCAAATCAAA(X3CCGCACTT^^ 
GCCTGTCTIXX^TCTGGAACGm 
GACCACGCCGCX}GTGC^CAGAGGGCGTArcATGGTTGGTGT^ 
CAGCCTCTGMTGCACGTGTAGTTATTGAA 

SEQ ID 4008 

. MPRFPRTLPRLTAVLLIACTAFSAAAHG^^ 

PSENQK(ffiTFEMEAHFVHIJ>ENKQPL^ 
QPLNARWIE 

SEQ ID 4009 

TTGMTCGGATTCAAATATOTTGTTGCGGOT 

SEQ ID 4010 

LNRIQ ISIiLRLLIFFRIKVAMPHGYPNKAGLIEK 

SEQ ID 4011 

ATGACATATTTAAAAGTMTAGCMTTAGTATAGTTTTATACATTTTAW^ 

AGCiATATCGATCTTACCCAAATAATTT^^ 

AMGMTGATAAAGACTATACTATTO^ 

SEQ ID 4012 

MTYLKVIAISIVLHLLIiQINIiKMM 

SEQ ID 4013 

TTGGGACTTGGTATTCAAACGGATCGGATMCA^ 
AAGTTTTGAMCAATGGTTTGMCXXXIAAAAACG^ 

SEQ ID 4014 

LC^IQTDRITDRRIKKCRIjK^ 

SEQ ID 4015 

AACATCGAACAACAAGTTAAGAAMTTATO 

TCOTATGGCTT^^ 
CmCACMCA(OTCTGCTG(^CC(XXm 
SEQ ID 4016 

NIEQQVmiAEQLGVNEADVKNESSFQD 

SEQ ID 4017 GTftCAGAAAGAACCGCTC^^ 
CACTCTmACCGTAAAATOXX^CGOT 
ACGOGGTCGCGQCQCG^ 
AAAATCATGCCGGACOTATTGOTATGCCGATAK 
AanAACm^TTGGGAACTTTTGG^ 
CACACCGCOXXXJCAGCT^^ 

SEQ ID 4018 

MKITHQQjKKEVQKEPL^ 
. KIMPIXSIWADSPGSRGKSDAGGFTRCRINRSKEFADRRN^ 



WO 02/079243 



-309- 



PCT/BB02/02069 



SEQ ID 4019 

TTOTAGACTfcCMTCT^ 

TTTTGCGCGACCGOTTGCTGCCGAM 

CCGTCTT6CGGA6ACG6C6GATTT66AAGGGAAM 

GATGCCGTTTATGCGGA^ 

TTGC6GATTTTGTCCACAICGGCATAG 

CTGCCTGGATGAGGTCTEATGCCG^ 

GCXaGACGCTATTCCGTCTGG^TCGCCC^ 

OX«XnamCATTCCCGTTCTG^ 

CTGAACCAGCTCGATATGGAGAGTTTGG^ 

TCXTACTGCTCCACCAAGGCACGCGCC^ 

AGCCCTGATGAAGGGTAAAACCTTAGACGAAGCACGCGC^^ 

ATCCTGCTCGACCGCCTCACGCCC^^ 

ACGGCAMCAGTTGGCAAAAACX^TCATCGGCGAACTO 

GGCA 

SEQ ID 4020 

LLDYI LLYFNPVLKNQGKQMDAFTRAWYALERHYQDKB 
DAVYAK3RDVLPEIRRELNRAL 
GAGFSESETAHHFCAVSADTEAAQSFGE^^ 
l^LDKESI^HCSRASIWSPMa^ 
II^RLTPCNL(34IMAyBHKTFVQGAIWHVOT 
SEQ ID 4021 
ATGACAGACACCCCCGAAAACCAAACACCAAACGAC^^ 
CCTTATGGGACAGCTTCGGCATCGACTACX^ 
AATCGCCCGOmmmX^C^ 
GATGCCGTAGAAGTTGTCGAAAATATGCTGCM^ 
TCATCGCCAAACTACTGCCCAAACTC^ 

GCK^^CTACGCCCO^iam , 
SEQ ID 4022 I 

MTDTPENQTPITOLPAGHSI&IRSFV^ 

DAVEVVENMLQDGSLKIHIFFPDPWHKKRHH^ 
SEQ ID 4023 

ATGTATTACTTCGGACAGGGCACAGCCGCCGACTAC^ 
GACACGGTGTCAAACCGGAT 

SEQ ID 4024 

MYyFGQGTAADYHKARKWFEQATSQKNSMAFYNLACIH^ 

SEQ ID 4025 

ATGGGCAAAACAAGCGCAAAAAACAACAATCC(XACGGGCAATAC^^ 

CACMGGCTTGCCGGAAGCACATTGGCMCTCGGGCTGC^ 

TCCCGCCTACA(XCC(CTI^CGMCTCATCCTAOT^ 

TACCTGCAAGGCAAGTATCTGGAAAGAAACCACAAAC^^ 

TGGGCATAGCCCTCGATACXjGAAAAAGCCCGGCATTATTAC^ 

TGACGGOTCAAAGATTCCGCCATTAGG 

GCAAAAAMCAGTATGGCGTTCTACMCCT^ 

GCAAAAAAACX3TCCTGCAAGAACTGCTG 
SEQ ID 4026 

MGKTSAKNNNPHGQYLIAQYCRHGTPP 

YLQGKYLERNHKLAIjHHAEAAAAERHPEGIiRIPG^ 

QKPQNSDTAKPKPmVCITSI^^ 
SEQ ID 4027 

ATGAOTCGACAAAACGGAAAACGATC^ 

CAGMGCGGCCTCG(X^(^TCATCCCGa 

ACAGGGGCATCCCTATGCGCOTACAATTT^ 

GTCCGCCTGATGTACCTTCTGTACGCGTCrc 

(XGCACGCCGCCGGATTTO 

GACACGGGAOWjGCCGTCAATCATTT^^ 

AACAG(mCACA(X3AAAATGACCC(X3A 

CGAACGCCATCGMAAGGTTTGCGGATACC^GG 
SEQ ID 4028 

KIRQNGKRSIMNPDLPYR^^ 

VRIJCYLLYASRHFEEALE^ 

NFRHRKM1TMPMPHWPTSTCKM 
SEQ \D4029 

TTGACGAAGGCGGCAAOGTC^TATACGTC^ 
(XaCATCX^CCACCATC^^ 
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CCAACACTATCTGGGCAAAAGCAAACCC^ACATCAT' 
ATMSGCGMCGCAAAGTTTOTTGAAMJTGGC^ 



REDLEKVEGISKAIABTIYHHia 

■ 



GTCCGTGCGACAAA6CCI 

»IXXX3CCTGAAATGGGTAAAAATCGA^ 

SEQ ID 4032 

VLGHCFSFJ 

IJFAFFYNIIiGIPIAALGFI^^ 

si—-— 




=^ EE 



OTCAAAGGC 

SEQ ID 4034 



J — " « "^j^iiJ\^SNSF6FGGTNUTbWwwRfa 
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ifCCCAAGOGIK^GACGGAGTTTGCA^ 
QUOPVIVYIJCmDIHFSVtRGVDGNmi^PS^ 

SEQ ID 4038 

LQDIVAGCMPAGGySELTKTGTAIiPRPGSKRTIP 

S ATCK!C^ 
AM^WaCAACGACTGAAG^^ 
CCAAATCMCGCCGTCGATCTGOIAAAACGC^^ 
MTBACQWlGCSmSCM^^ 
AOTTGCCAATATOXXXAAACCOTACC^ 

GGAMGCAACCTGCMGMGTACTGGGCGACGATCGGGtKXjTAMTGGCGCATTATTAAAA 
(KBACXn'AAAAGACCATACCTA(XGTCA<Km 

S K^i^PAGmAvm^ 

SEBAPAGKGVSMaTCDSPFYItBtP^ 
KPNTDIFK(^IiEMDKAGYLKTKGCTADHV^ 

S !^ramLlGATCGTGTmTC(mCATC^ 

CAA !l 

SEQ ID 4042 _ 

HRHYEIVFrVHH)QSEQVPAMV0WKOT 

Q 

SPG ID 4043 

TOmGccGmcTOA«:a^ 

TCATTGAAAC<^C«3H^^ 
AjflCTTCCAAAOGGIGAAlCTTACOGITTGOCTCOOIW 
CATGTTATCTCCTTATGGATGCTAAAAACACCCTTCTGCCATGCGAA^ 
CTTGCATTCCGCCGAAAATlTCGGTOCAGAOGOCATTraAATGTTTTO^^ 

^EQ ID 4044 

HWSLWMVKTAFCHAKAB(3WQ1MII1/jFAK 

AAMTGGGCAGCAAITGCCTTGTCC^ 
AAGCAGAIXTTCCCTGATGCCGATGCTC^GGAT 

SEQ ID 4046 



cpQ in 4047 

^KX?W3CCC6CATCACAGGAACGAAGGC^ 

SEQ ID 4048 

MARQSFKRRKFCRFTAEKIQBVDYRQ^IiOT 

SEQ ID 4049 

GTGTGGACGGCCGOTCnTCGGTTCCOT 
TGGAGAGTACGMGTGGAAGTGGCTTTACACACTGATG(^ 

TGGTTTGT 
SEQ ID 4052 

HP5mPUKH^AFVLFSVY^ 
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ATM«KX5GCAGGGmCAGMM^ 
SEQ ID 4054 



AATCAACGCGG(^CTGOTra 

SEQ ID 4056 ^^^^^nmvnmmr^ 



TMCEKC^LTRTVTQDl^^ 



KAAVCKPIDYPK 

S lre^0 0 -~ «.«n«ni«ll*^^ 

G(mTCmTOCCGGCCGGA<3OTA 
SEQ ID 4058 



AAAAGGCAATGGAATOCGC^^ 
jXXXWrrcCATCCGATOCGACTGCCTCCC^ 




CGGTCTGAATATitjwwuaa i^wu»wwi ^» ~ ___ 



LSSAKUCESFGIHLIAIAGKMIiTfHIjL 
SEQ ID 4065 



GAAAAAATACAGAAAGGCGGAT 
SEQ ID 4068 



WO 02/079243 



-313- 



PCT/BB02/02069 



SEQID4069 

TTGTOTCAGACGGCATTGCGGGCAACCTT^ 

AACCTACAAACTCXTTGAAAATAC^ 

TCCGGCAAAATCCGAAACAACACACCCGGTGGCAGGCAG^ 

SEQ ID 4070 

LFSDGIAGHLCGGRKTLSYNLSRPKISIRSEM^ 

SEQID4071 

ATGATGAAAMGATACTG6CAGTATCGGCACTATGCCTC 
AGOTAAAACCACAAAAGMGATTCGGCGAAM 
GGTOGTGTCGCTGAACMTACCTGTGTCTCG(^^ 
. CTCAACCAGTGCATAAAAAAATAOGGCGCAC^^ 

SEQID4072 

MMmiAVSALCI^AAAQAAITPYGmV^ 
UJQCIKKYGAQGQCGIiETVYCKSSYY 

SEQID4073 

ATGOTACATTTTCAAGTAMTCCAAAa 
AfTOC 
SEQID4074 

I^TFSSKSRQKCRLNVEAKVQTAFFVIGSNGAGSNNCPIPYS 

SEQ ID 4075 

TTGAACO»TTGAAAGAAG^^ 
CIOTX5CGCTTTT««X^ 
CmTOKSGGCTGCOT'CCACa 
GTCACCGGC(XTGTCATCATGGTCATCGGTC^^ 

(xrmAccTTTGCCxsmaxxrATOT 

CACGACAAGCATTGCACACGttXCCTGGTTC^ 
GGCGGCATCATGGCAATCGGCAATGTAACGGGAAAAGACTACAC 
CGGTTACAACOTACGGCGAAGTAACGGGTGCGGTGATGATTACCAA 
GTTTTTGGCTTCCATTCCGATGOT 
AACCTGCTCATCGT^^ 

TGCTGCCCGATCCGAAAGAATCC I 

SEQ ID 4076 I 

IMLKEDGMNQLKIA^^ 
VTGPVIMVIGIiSVAVAASSMft^AKKQVIDm 
GGIMAIGNVTGKDYTKDPGLDKTLAGDGLGVCV^ 
NLVIVSmTOIGGm^KLGSFSFVGVGirAVIAIVI^SIxLPDPK^ 

SEQ ID 4077 

ATGGAGTATAATCCGACAOTTCCATA^ 

SEQ ID 4078 

MEYNPTLSIFNPQAVPNPYFRRHFPVIKHroKIP 

SEQ ID 4079 

TTGTCGGCTTTCTOTTGC^ 
TTCG6GACXX3CTTGCG6ATT6MTATGGAAAGTGT^^ 
nTTACGCAGCCCCTCGGCAAACCAGTA 
SEQ ID 4080 

LSAFLLRADWGIXjLPALPFAVFI^ 

SEQ ID 4081 

ATGGGTTTGGCGGCAA(XCrcTATTTGGCGATTO 
GCGCGTATGTTGCC^TGTCGGCTATTATCT^^ 

ATGGTCGGC^TACGTGTCGGCGCAGTGTTT^ 

MTTTTTGAACGGTGCGGGCAGGGCGOTCAAAAC^ 

ACGGCGCATGGTGAAAGMGCCAAAAACATCACTGCC^^ 

TCGCTGTTTGAAGACMTGMGTAC^ 

CCGAACTAATCGAATCCAAACTGGCA^ 

AATTGTTGCCX^rcTCGAAAGATTTGG 

GTGATGTTGAGCGAAATCTTGTCCTCGOT 

ACCTTTTGGTCGCCGGCATGACCGGTTCG® 

GCTCGAGTTGAGCATTTACGACGGCATTCCGCAC^ 

CIKTOX^CGCCGGTGTACGCM 

CAAATGGGCGCGGACGAAM^^ 
ACTACGTCAMTCGCMGCCC(^GCCGACTATA^ 
AGTCGCCTAT(?rTTTGGAAA(XAAAAAMCCTCC^ 
CCCTCCGACCTCAACGGacm 
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SEQ ID 4082 w*,,*^^ 



PSDmGSEKIWHKDHL 
SEQ ID 4083 




^ ^„ - - - - — - — - — - * - » Aftfifit (wwunw * a fli "i -n «-i-n «ahhm .iJsu.tMUiiunjw vnnw* wv«»»»jC A 3CCGCC 



KVGVQSHLRMCMLLKAANRLSAPAAA 

SEQ ID 4085 



SEQ ID 4086 

MRPKyRTVALFASPCREEGLSFYQHHKPKGIj 



AMWX3ftCGGTTTCCGTCCGG6TAAXOCACCTTTAAAaM^ 



j^pi^FAKAKIS^G^^ J " D " 

KLEPTEHQIKAWANFAJSYEDPQBVIIMYADTSRIflAPTSIAVESHTO 

ACCGCCTGAT 
AAACCGCGCTTOTT^ 

SEQ ID 4090 „,,™. IOT ™.™* B ^^ 



FALPBSRDHHQPLISG6WGQASDIBI 

S5=55S5SSSS^ 

SEQ ID 4092 .^^^iWAmimGunn^^ 



ilTTTGGGCTTGG^TCGGTftCGCITCTGGTCGCtATC^ 



TMGTCTATGSCW0GGGGKTGOJKK6A 



ACTTO3GCAACGTGGTCGCC 
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SEQID4094 

MAKK(XjFSLPAKKBKRFIFK 
LSRAFS^KYFDRFYCM^ 
FLKLKARSIKIORRMDAILLRMPIFG 
YEDEVDHAVGRLSAMKEPIIIVII£^ 
SEQID4095 
ATCCGGAACAACCTTCATTTTTGGGGAMTC 

ACTG(^CACCAGCAGCACCGCGCCGCC^ 

TGTGTbGCTATGCOTACACCGAGTAACACGA^ 

CGCAT 

SEQ ID 4096 

IRNNLHPVTCNPRHERVQQYGYICT 
CYAMRyi!E*HDKPAVIffAAGFRAGF^DIHUITyRPLTOMH 
SEQ ID 4097 
ATGATAAAACCGAAC£TGAGGCCGAAGCT(XXX^^ 
TTAACGACACCGGGGCGGACATCTTCX^T^ 

SEQ ID 4098 

MIKFNIjRPKLGSSALIAn^LYSSbVl^ 

SEQ ID 4099 

ATGAAGATAACACACTGCAAATTAAAGAAAGAAGTA 

ATTOCGAAACGCAACGGGCGGOCCTATA! 
AMTCATGCCGGACGGTATTGTTOATCCCGA^ 
CATTAACGGCATT(XX3AACTTTTGGAAT^^ 

SEQ ID 4100 

MKITHOCLKKEVQREPLRSFVPEVTARSA^ 
KSCim/IiFMPIARAAAASRmVIiPVAASTVP 
SEQ ID 4101 

ATGMCaCACTCCAAOKXX^ 
CCGaTTGaGATTAAGGAATTGAGO^TG^ 
TGCaCACCACTGATTGGC<^TTTCCATm 
AAAGGCGATGAAAAATITCCCmATGATTC 

GTCGTGCAAAGTaGCGCGGTTCAT^ 
GCATCMCGGCGGCAWATCTGTTTTGCTO 
TACAGACTATGGGTTTG(X3CftX3TT^ 
MTGTCTATATGGCGTACCCTTTATC^^ 
AC(XXK5GAAACACCCGTCGCACCTGm 
GCAAAMCAGTTCTAMCGCAATAAAGGAAAAGTCATCCC^^ 
SEQ ID 4102 
MOTfcQRRKTHQVLIDHIWG 
KGDEKFAFTCIKTAAENDXAVRIGVNTO 
ASTAALSVUiQEGIGOTIRISLTPBPGSPRTQEVVVGQE^ 
TGETPVAPVYVWSERKVTLKGKN^^ 
SEQ ID 4103 
ATGGCAGCAGATTTTAATCGGACMCAGGGGT^^ 
CAGAAAAAGCCAATCAGGTTTCCMTATCAAM 

CGAAATCTATCAATACCTGAAAGTTAACGACAAGG 

ATATTTCGACAAAGCCCTGGCOGACCCCAOT 
GCAGCGCAAMCAGGGGCAATTCGGATTCGCG^ 
GTTGGGC GATGCCGATTACTACTTTAA^ 
GMTATGAAGCA(^?TCCAGGCAAATTTCCCC 

SEQ ID 4104 

MAADFNRTTGVIMPPKPSKHSIjUjVIALGACSTSYRP 

YGWFLCGRLNRPAESMAYFDKALADPTOT 

EYEAQLQANFPYSEELQTVI/rGQ 

SEQ ID 4105 

ATGAAAACCMTCTGCTCAACTACGACOTC^ 

ATTTTGACGAAATGACCGATTreGCAAAATC^^ 

TTTGGAIGTCGGTACQGGGAACGGCGT^^ 

CGGCAGGGCTTCAACCGCAATTTMCCGCTGC^ 

GCATGGGC^GCCGATGGCGAAOTCGACM 

CCAMTGGACAGGTTGCGCGATGTCATGCCGGTO 

ATGGCCGCATGCCAAOGCTATCT^ 

AAGATGTTCCCTGCAAATTCAAm 

OGTTACCGTTCGAAAAACT^^ 

GGG 



WO 02/079243 



-316- 



PCT/IB02/02069 



SEE™ 
6 

OCRMR 

SEQID4108 

IiIQnAQFLFEEHB3YK3UiKTIjtflKTSNQTaiPCMETiF^^ 

SEQID4110 

MAIERTISIHPDAVGKHVIGKIYSRPE 
VQBMiEIAYFFSQTEICPR 

GGCATCGGGTTTGATGATOGATATGGTACGTTCMTCGC(OTGOT 

GIGFDDGYGTFHRHAISLFLLKKNQIGILPKK 

TCCflOTGttCCTGftGCTO^^ 
SEQID4114 

TTCTCAAG 
S 23p!™ 



SEQID4120 

UlPPVEKHND&HItGGISKGGVLIMCRYBPKCIiIiFQMP 

GGGAAAC(X)CAAGTTTGAAGGAMTCATTTATAATACCAGCGGTAAG^ 
SEQID4122 
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SEQ ID 4124 

IiEGAKTQQEQVAYANSAE 

SEQ ID 4125 

ATGAAATTTGCCGTCTATTTGCX^ 
M"GCGGCAGAACATCMGTGG?rGT^^ 
ACAGOTTCGGCGGCGAATTATCAAAT6TATGAW 
CACGGCGCATTGGTATTGGCACTACGGAATCKK3GAACATTA 
GGAAA(&OCXTK»flAAMTC 
. TACCAMGATTTTAiTCGAAACCTGTC^^ 
CACGCOGCATTTTTGAAG 



HAAFLK 

SEQ ID 4127 

TTC^GCCCGAAAGATTACGATAAACXIMTCAAA 
GTGCACACCGAGGTIWGGACMTCCATCAT^ 
TCTTAAAGGCCGCTCCGAGCTTCCGAAAAT^^ 
ATGCACGAAGGTAAATCGATC^^ 

MHEGKSIRAVIK 

AAATTCT«GAAATCGAC(n^GAAM 

CCCTGTGCTGCTOSGACACGAAGGTGCQGGTG^^ 

TTTTOCCTCTC^AAW 

GCTGTTCGACTTTCAGTGAATATTCI!^^ 

GGTACAJAACACGGCAAAACTACMGAAGGCGATPCn^ 

GAT&CCAATCCTGCAAAATTCGAGTO j! 
SEQ ID 4130 



DOWPAKFELAKQCOT 

TTACCGATAACGATTTGCAATGGTTGTACTTATTGCAA^ 

ACGCCTTTCCTTGTTTGAAAATCAAATAG 

TCGGTAAAAGCT6TAAATTTGCCGCAAATTCOT 

S tf^AKA^GISWiTLRF^^ 
SVKAVMLPQIPEEA 

SEQ ID 4133 

TTOX^ACTATTTGCC^^ 
TTCGCGGTTKJGGCAGCTTCGATTTGAACCA^ 
ATTGCGCGAGCGGGTC6AOTGGOTTAAAA 
SEQ ID 4134 

TTCGCATTGGC6GCG6CATT6ACG6AGTT^^ 

cTTCGTCtmTcrrrcAg^^ 

AGATTTCACGGAACOTTCAACC^^ 

AATACGACGGCTTCGACTTCTTCTCCTr^^ 

ATACGOCGAAATCGGTAATGGATTTAACCGCACCGGA 

GCGGOTTCGTCGATTTCCAAAAT^ 

OTTGTTCGATTTCGACGMT^ 

TACCmGGW^CAOT^m^ 

CAGCAGGCCCm^lOTXXraGGTCMC^ 

AAAGTGGOTCCAAAACGGCGCGGCGGGAAACAACGA^ 

(XJTCGA(XAAA(MCCCGG^ 

AGCGATCCAATCGGCTGCGOTTTTOGCTT^ 

TGAGCX3TTTTTGMTTCAGCT 

RFHCTFNQRTFVVHADEVAERIAFQ 
AAFVDFQNHDFDmQLYDFARVYVFVGPV^ 
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QQAVDAAQVNECAWGDIPDDAFDDGAFLQVFQQGFAFFT^ 



oxiaramsKxrrccAaTCcaTCGca 

GAACATCCCGATTCTCAGOTAC^^ 
ATCCGGTCAGX 

KKRVAKGRHFAFGPAIiWSGWIiraANDSVWRAVNWWIiTHFVB 



CC(^TC(XGCCXmATGCGMiaC^ 
SEQID4142 



TKLKEGDEVEAVIVTVDRKl^IKI^VKAKD^ 

TCAGCTGTTGGAAGAteGCTTT&COTGCAAGAftA 

SEQ ID 4144 

VPSBIOTMJ'CQGLQTPTNraTPHRCTEKSYICI^ 

SEQ ID 4146 

VGVCSPWHGRRVNWSDGTMKFBLSEftOCiaUi'nJCI 



CMGGCGTCGAATGGCRCGATGAAGAAAAOGWTO 
AAGTCXTI^TTGGTATCGTAAAOT 

SEQ ID 4148 
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SEQ ID 4149 

ATGTOTCCCOTimATACCM 
AAAACGGCGGCGGCGCGTATGTGTTTTTATO^ 
CGGCGGCCCGATGMCGGCAAACCTGTl^^ 
TATATCGTTAATATTATTGGAGGAAATTT^ 
TGCTTTTTGTCGCCGTGMCCMTGGATTTTO 
CGTTACCXTTTCOjGGCGCAA!^ 
TTCGGCGTGATGATTACOTATCCAGCTATCTGGATAAAAA 
CGTCGCTOTCAGCTTCGGCXn^G 
GCTCCTGATTGCXXXXjTTGACAAC^^ 
GTTTOGGCAACATCCCXiT^ 
CGCTCGGTTOXXATOT 
GCTGGTGATTCTOCTGATTTTCGTCA^ 
SEQID4150 

KSSPSNTNRQWSSRLTYILTVAGAW 
YIVNIIGGNLNISSPVIXSVVTKGFFTEHIENS^ 
FGVIOTLSSYLDKNENLVQTAVITH^ 
VFWPSILSYGPWKDVSVFGKNIFDAFm 
SEQ ID 4151 

ATGATGATGAAAGAGAGGGMTTTGAAATAGAAAACCATCTATA^ 
TTAGACAAGCGGATTATATCGTAAAC 

SEQ ID 4152 

MMHKEREFEIENHLYKGRSIIXHTOT^ 

SEQID4153 

ATGTGCGAATCGCCGGCXM^AAGGTGTTGCACGCGGT^ 
ACX3GCATCGGCGGCACAMCGGTTTCX3TTTTO 
SEQ ID 4154 

MCESPAVKVLHAVAGGSNHAFDLVW^ 

SEQ ID 4155 

TTGGAAGCI^TTCGGATACGGCACATGACAGGAC^^ 

GCAAGC^GTGC(^TGGCTGATTAGGGGCGGCTACGTC^^ 

ACTAACCGTC(HTCCC^ 

AACATCGATATGCAGGCGGTCGGCAGGCTGGATGCGGATAC^ 

CACCCTKTPGCTGACC^ 
CPCAAACCC^QGAATGGAAATTTAK^ 

SEQ fiD 4156 

LEADSOTAHEOTSVLSFSSra 

NIDMQAVGRLDAITITGVLLni^KLNHNL 

LKPGKJKFIBCPKF 

SEQ ID 4157 

ATGAACCttXHTTGGATTATTACCGGC^^ 

CGCCCACCGGTTCGGTCAAGGTCGAIKXXK^ 

MTCGMCGCGGTACGATGGCGTTGCTGTTGAGTTATO 

TTGGCAGGCATEACGCTGCAACTTGCCAAC^^ 

TGATTAGTGCAAAGGTCAAAGAGCGGGGGACGGCGGCC^ 

GGTCATCACOGGXXXXjTCG^^ 

GGACAAAIOjGCCTGAC(XjT0CC0GT^^ 

SEQ ID 4158 

MNPVWIITGKEAIU)SIjRNRWVI^ 

LAGITLQIiANGGFDIAAHKFFAIiIiIAASVILG^ 
GQIGLTVPVLLTAQVLWVIIPLVIiAAGIFRKRRI 

SEQ ID 4159 

aitctccaaactcaamccatcgccc^ 

cmtactcgacctgcca(xx3acaaaacaatatto^ 

attcatcgcaagccgcagctatitcgacagggtcgtc^^ 

atcgaaagcgcgttcgtcaccaaagcc^ 

acgaogtttacgcagctaccgawkxx^ 

CGTCAACCGCGCCCGCGACCAAGGGCTOGAACC^ 

TCTTTC(mTGAATATCA(m^ 

TCmXTimTCCl^TTCAAra 

CCTGTTTTCATGGGAAGTCTATACGCC^^ 

AACGaGGAOKa^TCCTTG^ 

GCACGGTGMCX3TCAGCA3TOXXGAATCCAA 

CGAACCCGCCOTXmCAGCCGCA(^ 

GACGAACTCAT(X$GCTTGTTGCC(Xs^^ 

O^OTGTTCAACAmCCAAA^ 
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GCCCAAAOCCOTA! 
(XATCMTCCGGGACAf3AGGGTCAAACTGATTGGAAGC 

SEQ ID 4160 



S ^°5ta1iw^^ 

TOSMGCmTrcAB^^ 

SEQ ID 4162 

HHPYCVRPKQIACRYFIPHRISVKmiffiPFPHPHKVPVKL^ 

GTGGGACGGGTOTGGG(K6GCAAMCCCTTGCGGCGGGTGCGQCAAGGCITGCGGM 
VHLPEKRFCMIiPniKRflUUMGFBM 

TC^GAMCTTrGGG8CAGC0CGGCACGTTTTATTGCACGA^^ 

TGGSTLRDFVDSIX3KSGYPQQEYTVyGRHNQPCIilCGGLWKETIiGQ 

SEQ ID 4167 

CTGCCTCCGGIGCGGCIGG'nTCGTTG 

CTCCGAAaHTTCAAOlACAAM 

SEQ ID 4168 

VppvEEPGCEEHFQAAEHVIiIiHEIiSEIGHKTVSIXSILSVCR^ 



GAAWXMCCGTGCCGTTGTCQ^GGCGATGCGGATCTO 

SEQ ID 4170 

ESIRAWAGBMJVAV 



SEQ ID 4172 

IlPSGCCSHSAPVPA6PPSlJnwiU3i:riw*ii*v W1 .. iM .«.^* 1 t"^.x „ Tmrt m ™ n rauon D T;iVkr<»nnrc,D9 *T WfJ 

HPIMPAAPELFRUILPH^ 
HALPRHPTOTGDAHPPHRRIPRLVHRRHRDYTIR 
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SEQ ID 4173 

ATGGCGGGCGGTCAACTGGTGGCTGACCCATCCGG^ 
CGCGCATCGATGCGGACCK^jGCGGCGCACT^ 
CATATTGCGTAAAGAAATCTTTGCCTCCCC^ 
GGCATTGTCt^TTCCOTGCT^ 
TGACGCGCG&XSAftjT^^ 
GCTCCTCX^CGCGTTTTATTCAG 
SEQ ID 4174 

MG(^LVADPSGEMTAWVGLT<X;iGSGKSAMQYFADLGVPRIDADAAAH 
GIWPLLTEKRQFISLIRRVLTISAPLEKKIGRVM^ 

SEQ ID 4175 

TK?ICG6AATGTGGCGGTATGATTCGCOT 
KaAGOXXmAGCCGCAG^ 
SEQ ID 4176 

LSE0G<3MIRFCGCIiKMVTGVPQKKCRK^ 

SEQ ID 4177 

ATCCTTGCCATACTGTTGTCTGCAATAC^^ 

ACAGAAAATCC^TGCCGGGCGGCTTATGAGGCT^^ 

SEQ ID 4178 

MLAILLSHLGLVSTTAAAGTSEPAHRH^^ 
SEQ ID 4179 
ATGAGTCTGAAACTGOWAGTT^ 
ATACTGGGACTGGTATCAACAACTGCCGC^ 
AACGCCBTaXttE^ 

m 

SEQ ID 4180 

MSLKIiPTVCPFHPSILYW^^ 

I» . . 

SEQ ID 4181 || 

TTGITAMTTTTGCAMGTATGATTTTTGC^ 

TTTTGTCG6CTGAA jj 
SEQ ID 4182 

LLNFAKYDFCHAAADKFHFLTDW^ 

SEQ ID 4183 

TTCXIAAMTTTMCAATTOGCAGGGGCGGAAAA 

SEQ ID 4184 

LQNLTIRRGGKPEVFLFRRKILI SSPCSRSRSKGKKPTRFLSVKNYQIKQML 

SEQ ID 4185 

ATGAATACACCGACTGATTTAAAAGTAACCAAACGAGACGGAAGAT^^ 

CGCAGOTCGAOTGAAATCG^ 

CCmCCGCACGTTTGGCGATTT^ 

CTTGAGGATTACAGOmAAGAATTTG^ 

GCGTTACCCGCXIAGATTTACGAAACGCCGCAGTTOT 

TKX^OTTCAAAGTATOTTGCCC^^ 

ACCAGCGCGATTGTGAAATACGITOXXM 

TCTTTAAGATGTTTCAGGCG6C6GTCAAATCTTO 

CAA(XX3GGGCCTGGAAGACAACCGTATCCG(^ 

TCGGG6TT6TACGAA6C6TTTTTTGCCGACCAAGA(^^ 

TGATGCAGGAGCGTG(£GGGACCGGGCGCATCT^ 

CGC(X!TGCCGACCAAACCGCTGGA(^CATCAACG^ 

GATTTGACXXH'GCGCGCACTC^ 

ATCTGGCGAAAAACGGTGTCCGCTACAGCGAOT 

GCCGA^rc(n^AAATACGGCCTGCGCAACOT 
TCAAAGCATCGAAAGACGGCAm^ 
MTGCaAAAAOTCGTCGara 
GGCGTCAAAACOCTGTACTACCACAACAC^^ 
SEQ ID 4186 
MNTPTDLKVTKRDGRlEAIDLDKIHRWr^^ 
LEDYSREEFDELNAHD^ 
TSAIVKYVSQRAGIGIHAGRIRGIJ)SEIR^ 
S(&YEAFFADQDEFERLYTKYEQDPNI^^ 
DLTVRALDALU^ 
ADIVKYGLRNSTLTAIflPSm^ 
GVKTLYYHNTRIX3ADDTQTO 
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GTGGTACGOCGATGCOGraCTOGCCGGCAOGGCMCCCTGCTT^^ 

TQGGCG<^TGGGCAGACGGGAMTATflC3GTTT^^ 

SEQID4190 

MAESRQTHvQVKCPTCQTAV^ 



CCGACCTTGCCTATTimC^^ 

RI^PRTI^SRV^PIiSDGISmQCQAEE^ 

AMTAAAAATTAATAGMAAGTCGAC 

SEQID4194 

HJOTSFLVEYKmFISVUiVFWUWIimiQHVLYGEIiKINRKVD 

GACCGT. 
TTTQGTTTCGA' 



ETVSFSETIHSRSYTHIIRNI 
IIKLIARDEALHLTGTQHMUnJiRSGVDD 



CGGTrTGGCGGGGGAGGATGAGGGGCTGTCTGACAb-riWITCAAX 

SEQID4198 

MSRIG 

CQGCGAAACCTTTTACCGACTCATCAGGCATAA 
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SEQ1D 4202 

HGRAN 



GLVNEFPIISIEIXaMDENDWEGWKLLTBKIj^ 
SDRMAKYNQliLRIEEELAEAAYYPGKAAFYQLGR 

SEQ ID 4203 

ATOATGCAfl^^ 

GOiAAATTGCCGGCGTATTGTTCGACATOT 

ATTGTOX3(XXOTGCXK^ 

TCCAGCATGACTTTCATGGCAACCXXX^^ 

SEQ ID 4204 

MQTFRKISLYMTLWLC^IMAGYIAAm™ 

SSMTPMATALLSAVLSWRLSGKEAV 
SEQ ID 4205 

TCTTCGGACTGGTGATGCTCACCCA^^ 

CGGCACGGCAGCCCTGCTOTCCTO 

CACGCAAATCAGCCmTCaAA^ 

SEQ ID 4206 

PKTDLNFHFAAAIiGPTOLATVA^ 
HANQPYQTAKNRRD*TAIQISPFSLRV*NHH 

SEQ ID 4207 

ATGGATATTAAMTC1AACGACATCACCCTCGGCM 

TTACCCGCAAACTGGGCATCCCCTATATC^ 

CAAAGCAGAGTIC(^CATCCCXX3TCATTACCGACCT 

CTGGa3GCAATGO!GGAAAC^ • 

TATGCGAACGCGGCAGCAGCTTCGGCTACGAC^^ 

AACCCGCGATGCC(X3TTCTGCCGCATCC^ 

SEQ ID 4208 i 

MDIKD©ITLGMSPFVLFGGINVLEDIJ)STLQTC^^ 

vaametgnvinikkpqflspsqmkniVekf^ 

IMCDGPSALPIillMNFIJRIKA™ 

SEQ ID 4209 

ATGTTCCGTACTATACTTGGCGGAAAAATCCACCGCGCCA^ 
ACXjAAAAAGTCGCCATCGTCMCMC^CAAC 
AGGCGACATCGTCATCATTAM 
CCGCCGCACACCGTACTG 

SEQ ID 4210 

^TILGGKIHRATVTEADLNYVGSITV^ 

PPETVL 

SEQ ID 4211 

TTCGATGCATTTGCCGCOTCCT^^ 

AAACAAGCCGTGATGCXSGCAAAATTTGT^ 

MTCCTTGCCGTCAAACTCCAIOT 

SEQ ID 4212 

LDAFAAFLSEIiEKADDAARAALTEHiKI^^ 

SEQ ID 4213 

TTG^TATCTTCGGTGTGAAAGTfljAT 

ACAtrPTGTTGGAAATAGGGGTGAAAMrcGAC^ 

SEQ ID 4214 

LGIFCWKVDIAAVPFQIFRQCGACSIVGFFQFGQ 

SEQ ID 4215 ■ 

ATGTCGATITTTCACCCCTATTT^ 

SEQ ID 4216 

MSIPHFYFQQIiSTBGFDGEEDVIMELTVKQILVEVFL 

SEQ ID 421 7 

GTGOGCCCCGCX^AAAAACGGCACGGGCGGCAGCTT^^ 

AATCC 

SEQ ID 421 8 

VRPAKNGTGGSFCIAVATCTNNT^ 

SEQ ID 4219 m 

CAGTCGCATCCAGCATTTACGGTTO 
AATGGGGGTTACAOCCGTTACCTACCGGTC^^ 
CcmTTOX^AAAOXKTGA 
OOTAAAAACAAmTMTAA^ 
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catcaa^maaaacaticai^^ 

TCGATATTTTGAAGQOG 

HjlEKJOmENRMVRLKANASIfAIAKDK^^ 



CTOTGATI^CAOTOCOGCAATCAKXTC^^ 
AAACCCOTATCAGGCGATGAAG^^ 

S ^SraiKA^ 
UJNSPAISIVKWAQAQPPAVUW^ 

>CQIIItG0CT0MMC0G<3GCA6ACITTQGMWK9^ 



ACCCTOGCATGGCOT 
CGAAM 



ATCGAGGKamimTACXOTCAmOCCTACTAAAAA^ 



GIJiBDIiAEIMGDPVVHEroiGHyMVTffill/METSEBMIiIiBYAGE&QI' 

IEVEDfrVKTVIXTLQGKELPKGH 

^ ^TCTCCITTMAAAAGCA(^^ 
TTAGAGmAnTICTAAACmACCACAAAGTGCACCGGGC^^ 
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SEQ ID 4226 

MCFKKMVSMVIXX^SVaiGFLCRFPLIG^ 

SEQ ID 4227 

ATGGACACGOnXSCTTTT^^ 

TGCTGTTGCCCTTCX!TCGACAAAAM 

AGGCGCXjCTGGTGTTGCAOTTCrGCAA 

GGCGCX5GCGATT(XIACGCGAAATGCAGTCCT^^ 

(XAMGAGCGCAAAAACGGCAAACTCGCCAAAGTOTO 

CACGGCAmTO^TAaTCCTTGACMCCTGM 

GCTACGTCTGCCATTCX^GTTTCGCCCGT^ 

CTTGGCXXXXaMGGGTTTTCAACGCG^ 

GTCCGCGCX^CCGCCGAAGAAAAAGMGCAGGCAn 

AAGACATCCGCAGCTTCCTC£A& 

CACAGCCTTCCTGCTGAACGACATCCGCAMGAT^ 

SEQ ID 4228 

MDTAAFLKHIESAFmFSIXSIDmYLPEDKWLALK^ 
GAAIAREMQSCYEYTDEQTI^ 
HGIMEHLDNLNRYVKNDIRFV^^ 
VRATAEEKEAGIKLDKNQTIjLD^^ 
SEQ ID 4229 

GCOTCCCACTTACCCTATTT^^ 
CAAA 

SEQ ID 4230 

LRIAGKyGGRQISGGAKGHARHSAKPPIAAETAPRASPU!RFPTW 

SEQ ID 4231 

ATGGAG6CCCGCACACCCCG6(mATGAA0CCC^ 

CCAATGCAAATGTCOXXSn^ 

(XX»COCTCCACATC(XX^<^ 

GCCGCGCCGGGTTTTTCCAAAAAGAATIXK^ 

GCCTTCGAGCTACCGGiU^TGGGCG(3TG^ 

ACGGGC6GTAAAATGTT6GTGTCG6GGACGTT6GCGCC(^ 

amCAAACGCTCGCAGATGATGAAA^ 

CCGGATGTATGTMCGGCCK5T(^GGGACATTGCCAATO 

SEQ ID 4232 

MEARTPRANEAQAPRADEMKKBSRP^^ 

KYGWAELIVAIIGIETNYGKNTGSFRVADAIATLGFDYPR^ 

TGGKMLVSATLAPGADVQAIIGEKTALTRTVADLKAY 

SEQ ID 4233 

GTGGCGCX^GAAAATGCCGAGTTTGATGAGTGCCGGATAAATGTCT . 

SEQ ID 4234 

VARH2lAEFDH3lINVyUlEPDVKTPAEAGQ 

SEQ ID 4235 

AT6ATGCTTTATTTTGTTCG6CCATACM 
CAiraSCAGCCGTCGTCCACGT^^ 
CAAA(XX3ATTACGATTGAAG6GCCTGCCGACC(^ 
TCCGGTATGAGCCTGCX^GCMTGGATGCCX3GT 
CCGATGCGATGGTGCGCGAGAATAAAATCGTCGGCGATGCGACC^^ 
CAAAGGGCGGGACGGCATTTTTTOCMTACXIAGCACGCA^^ 
AATATTTCCGTGGGCAACAATATGGGCTATGTGC^ 
SEQ ID 4236 

MMLYFVRPYRTKQRDTMHTSALRIWLKAVLII^ 

SGMSLPAMDAGIYLEKAAPRALVE^ 

MSVGHNMGWUffSERI^VFENIAVGSRD 

SEQ ID 4237 

ATGGCAGATATGAAGAAAAITACCCCTCAAAACCTGCGTC^ 

AAAAATGTGCXKX3GGAGCGTTTCGACCTGATO 

ACTCATCAAACTCGGCATm 

OTTTCCATTCCGAAAACGA^ 

(mT(XX»AT(XX3CGTTTG^ 

CGCCCTCCAACGCGAAACC^^ 

CGCAAAGGGACX3(XXXXXXjGCTC^^ 

gcaaccgcctgaccatcgc(xtgatgaaot 
aagccaccacwtc^c^ 
acactcgacctcagwxxittot 
ocagottgocgaacaggtcgatctc^^ 



WO 02/079243 



PCT/IB02/02069 



-326- 

ATCCTACC 



AAAGCAATCCTTGCATACTTTTCCTWXJGOHX^^ 



CGCTG0CGGACGTTCGCGAC6GCCTA 



DEABDADIB - _ 

SEQ ID 4244 

VMXX^GHGPNVSYGKiailiLIIAYPD^^ 

GASASIPDVFSRYAARSKjFFKTR 
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SEQ ID 4247 _ _ 

CC6ACTT0CTCAW^GCAAATGGAA!TGGC^ 

SEQ ID 4248 

. MMSSTINDAYRTLKNPIDRM^ 

SEQ ID 4249 

ATGGAAACGGGCGGCCAAGGCGCGGTAGGTTTGT^^ 
TTTGGCAGAAAACTGTTTTTGCCTIA 
SEQ ID 4250 

METGGQGAVGLFQVFGADIESGFNPl^ 
SEQ ID 4251 

ACTTCTGTGCOSTC^^ 

CCGAGCAGCCGCTGGTTTOACACCC^ 

TACCX30CCCXX^CGGATC0GAAGC6QCGGQCG6T^^ 

SEQ ID 4252 

LILSRRFYAIJfVEAL^ 
YRPGRIRSGGRFGRHCGraPSEGLYLFS 

SEQ ID 4253 

ATGACGGGATCX^GGTTTCCGCGC^ 
GTGCTGCCGGAATCTCAAGGGATTCTC^^ 

SEQ ID 4254 

KPGCRFPRGRIRHSRFRRNVGFWF^ 

SEQ ID 4255 

GTGMTATCGGAATAGTTGACATAGTTGAGCATGATGCOT 

ACGGAAATATTGCCXJCTGACTTCGCTGTC^^ 

OTGCCCTTTGGAMTGTCGTTGCCGACGA^ 

CATCGCATCGGCAGAACCATGMGGTATACXKXX^CGAAra 

CTCATACC66AACGGGTAACGGTCAGGTTGTO6AG06TTACGT^^ 

TMTCGGTTTGTCGATATAAAGTTOGTTO 

GGCTGCCGATGCCGTCTGAAAAATGCTGCCGGCGGCAAGTAT^^ 

TAAAGCATCATTCAAATGTGCCTGTTTTTAT 

SEQ ID 4256 

VNIGIVDIVEHDALIAAAIX^IVEHFBPFGK^ 

GOICRIiKNAAGGKYQHGIjQP^ 

SEQ ID 4257 

ATGAG(X3TAGGTTTGCTGAGGATTCTGGTTCAAAACCA 
GTGTCATTTCXJCCCMGTCGCTTGCGG^ 
GGTGGAG ITCCACTGTGTGCCGGTTTTCXXj 
TTGGTOTTGTCX^GGATGACC^GTTGGCGGGT^^ 
TCGACMCGACX^GGCA(^GAC<MCCC^^ 
<X3GTTWXX3TMGGACGGGC^^ 
QGGACGGTAGGATGCAGCTGAOTTTCAAAAGG 
OTCTTTGAACATCGACCAGCTCGGTTT^ 
GTCGCTCTATACCTGTTTGAATATO 
. CTGAOTTTGCCGCTGCITr^ 
TC(?K?ITOCCACACTG^ 
a(£CAGGCTGTGTTCGAGCTQCAAACAG 
TGCGACCGTTGCCGGGGGCAGGGTTATMGGGGC^ 
TTGOCTATMGGAGGCTATGGTGGATTTGCGCCGGGCC^ 

SEQ ID 4258 

MSVGLLRILVQNQVVTVERAS 
LVIVEIHJQIAGLLDWVGSRSTSL^ 

LTPAAALKSmQDPDIIMVGEIIU)LETADIAim 
CDRCRGQGYKGRAGVYEVMPISEE^^ 

SEQ ID 4259 

M^CCGTCmACCa^^ 
AAGGCTTGGGCCTACGOTGGGTGTAAAAft 
TTATATCGACCOGAAAAGCTTGGTTTAI^^ 
GAGAGCTTCCACGTT 

SEQ ID 4260 

MPSEPPAASDPSGAVQQGRN^ 

ESPHV 
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SEQ ID 4261 

ATG(X&TACAGCGATAAA(^ 
TGCGCCTGCAAATOAAGTGAACG^ 
TCP6GATGACGC(KITGGCAATCAAAAA^ 
CGCAAACGTCAGGAAMCAGA 

SEQ ID 4262 

MAYSDKVIIHYENPRMVGTFI^ 



SEQ ID 4263 

GTGGATTTTTACCGGCGGCMTTCCAGTT^^ 

CGTCGTTTTTGTCGAAAGTGCCGA^ 

■> 4264 - Q I,mmnKWmm^^ 



SEQ ID 4265 

TTG<&GAAGCTGA^ 

GMCATTGCGCCTGTTTATGACACAATTGGC^ 

CTTATTGGCACGCOTMTTATTAGACAAira 

SEQ ID 4266 

S a™g^ 

GCCCAAAATCTGCTTTTGCAACCCATTACA6GCMTTO 

SEQ ID 4268 

MSKYGGEDVHHNIWRH^ 

SEQ ID 4269 

ATGGAGAAmAGAAAmtXCGACAAACTCAATATC 

SEQ ID 4270 

MENLEISKQTQYA^LLnGK'IWIEKEIRKIKQEF j 

S ATCGGCTO^ 

AATCATTTTTGTTTATCMGGAGGACATCAOT^ 
SEQ ID 4272 

MflSALRIjMSNKGC^^ 

S A^rar^Acccca 

ACGCATTCGGCTGGACGGCAGMGAAGCCGTCG^ 
CGCCATCAAAGGTGCGGCAAAOTCTA 
ACCTACCTC<XXX3TGCAGGAAA 



ACCTACXJTCGGCGTGCAGGAAAACGGTTI^ * ^u^-iww**™-"^*" 

ACATTCCCGCCATCGGCGAAATCTGCCGCGA^ 
GTCCGCGCACAAAGTGTACGGCCCCAMGGCATCGGCGCG^ 
AaTTGCCGACCCATCmTCGTCGGaTGGGC^ 
GCATCGAAGMGTCTATATCAACGGCGACCTCGAACAOT 
(^G(X3GATCCGCCTG(^CCTCCXjCCTC 
GAAGAAGTGCAATTCGCGGCAGAACTGATCAAATCC^ 
A* 

SEQ ID 4274 

MTVKTPVYLDYAATTPVDKK^ 
mGVQENGIJDLEEIJaA^^ 
TLPraQIVGMGEAPRIAKEELAQDTAHyiiKIjR^ 
EEVQFAAELIKSKIGKLRELSPLWEMFKDGIDLNSIEWAAH 

SEQ ID 4275 

ATGCAAACCGTCAAACCTCGCCAACAGGAATC6M(^^ 

SEQ ID 4276 

SEQ ID 4277 

ATCGCGTCTGAAGGCAACAGGTCT^ 

SEQ ID 4278 



SEQ ID 4279 

GTGTCGGATAAGGGTTGGAAAAGACATCtf^ 

GCCATTCAAACGTCCGGCAGACA 

SEQ ID 4280 

VSDKGYiKRHPBSGLLKKILNY^^ 

SEQ ID 4281 ^n*™***™ 
(XJGTTTGACACCCOTITCT^ 
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ACIXTTCAKSUSCO^^ 
CTGC^TTTGATCATGCGGATTTGC^ 
TTAAAGCGGTTOGGACATCXTTATOT 
TTTGGGACTGei^ 

SEQ ID 4282 

RFDTVFFQKGICFQAGFFLPQTIjFQLVS^ 

LQFDHADLPIJKLTQCAFGNRFRT^ 

NCGSGLPLPHGIPCPARRQ 
SEQID4283 

TTGGOGATGAACGCGCAAACCGGCGOCGTCAAACTCA^ 

GCCT(XXKXMGCCCGGCGGC(m^ 

CKmCCACG(£GCG(XX?rc^ 

GGCAGCXGCGTOGTCCAATTTA^^ 
SEQ ID 4284 



SEQID4285 

TTGGTGGTCAGTCTCATGGTGGTGGTTGCCCGT^^ 

GGCGGTATAGTCI^TTTTGCTTTATTO 

T 

SEQID4286 

LWSUJVVVMVGFPIVGGrVQYIiSVSVKYGGAVArc 
SEQ ID 4287 

ATGAAACGT&TTTGA^ 

GGGAGMTACTTCGGATTTCAAAGATATCCGCTTCC^ 

G(XGACGGGTTTTAOTGCATGGCACACGCAGACGG 

GCCGAGAACACCA(KX5CGCCGTTTTGGTTra 

ATTTGCAGGTTTTGGGTCAAra 

GAACACGGAACGCCGCACGTOXIGCAATATCGTCGGAC^ 

GCGC(^TTAAAGATTOTGGGGCGGCMGC™ 

GCCAGCTCGACGATACTGTGTCCGCEATCA^^ 

GTGGGCTTTGGGTGCGAMGGCACGATGATAGGTC^ 

ATGGCGTTTACGGGACACCGCGATATTCAAGATGTC<^ 

SEQID4288 

KKRDLSKMTQEDLHRVAKRKMPRM^^ 
AENTSAPFWQLYVMRDREFMmiKRAKDAKCSALVLTAD^ 
AMKDIA^GGmiKGIMEPEDAEKAAKSGADALWS^ 
MAFTGHRDIQDVDASILRSKDWGRETV 

SEQ ID 4289 

ATGGCGAATMTAAAACGTTGTTTGATTGGGTGGAAG 
TTTTTGAATGGCAAAAATCGGCTTTAGAAAAT^^ 
TGCAGGCAAAACGATGATGATAGCGGCATTGATTCTGTATTAra 
GACCCGACGCATCCAAAATTTTTATra 
GCATTCAAAAACTATATMCGATATTCACACOGA 
CAAAGGCAAAAAACAAGGCGAATTAGATTTAGAAAAGGAAATGAACGAGOGTA^ 
GGCMTOCGGCCAAMTGTGCTGTTGGMTTTACCGCCA 
MGGTTATACCAAAGAAATCAATITGGTATCCGG^ 
TTTCAAGCOTTGATGTTGTmGAAGTAAGAC^ 
TCAACAAGCTTAAGO&TAGCM^ 
AAAATTATCAAAAACATAATGTCATTATT^^ 
CCGTGCCATTTTTACGGTGGATAGATO 
GCTGCCGCCACCGTATCGGAAMGCAGTTGATTGG 
GTATCTTGGAAGAATTGTTTTATTACACGCAC^ 
• MCATTTMGCICAAATCTGMTTTGCAGAT^ 
MTOCGCAAACGCOTCATTCCAAGTTCAOT 
CCATAAAAATGAGTGAAATGGAGCGGCATATTTTCAATAAAGCA^ 
CMTGMTTGCAAMTAAATTGTTCAMGACTGGCAGAT^^ 
GAAAAACATCTGAATGAAA(nK3ATATACXX3TTTATCGGTAC 
CTGCTATT<mC<mAATG^^ 

GAOTTTTGCATTACCAAATTTTCATTGM 
AAAAmTAamxamTCGTO»TC^ 
SEQ ID 4290 
MANMTLFDWVEDRRSMLEEMEOT 
DPTHPKFLFTEnLQGDWIPIRKVETFSQYSIX5 
GMPCK}NVUiEPTAWE«AEVQQmm 
STSLSDSNNANEQGKTRTEQAIi!^ 
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MAWSFJCQLIGRGVRYTOAFEGXgPNK^^ 



EKHLNESDIPFIGTKEFTPKKU^ 
DPUffQIFIHKGGHLVEHDSWKEAFIiKSITVB^^ 
ceq in 4291 

C*mTftCOmCCGCATTCGmGACG<a 
MTTATCAAAGGCMTAATCTGATTOXATCCAT^^ 

<a T »AAmTCAm^^ 

TMAAGmTATOGATISAJAC^^ 
TTTATTCAWTMTAGTAMTAGCCAAACAGAAAATAMGAMTAAATGCT 

AaumTmmcA^pmjrAAAca 



CCAACGAAAACGACATCGTTCTAGACTACCAOTG<^ 

CAACAAATTTTGGCGTG<X3AAMTK^GACGGCATQAM 

CTGAAmt^lOTT^mGACGAACAW^ 

TCATGATGATAAAGCGTOAAGCG5TGCGTTCI&TC^ 

SEQ ID 4292 



kgkvbliyidppvotetdsfa™ 

FIDIiIYNSOTEKKEINGOTIiKEIDIIiLKKISPIS^^ 



aNDDDKALSRATYQSVKHQAEKKDGE 



TOOCGCGGW&B^^ 

TGIGaOTCTCGGTTTGG®^ 

CGCCCGATCTOrTAraQCMT^ 

CTCCAAGGGCGTATGGTCCAGGATtnCTTCGATGCTGTCGGTCAGOT 

GCTTG(mHGCAOTATreCTTGm 

1TMTGAGCGGGTKKGATTCC<^CGATAACAAAAAACAA 
ecn in 4294 



ELI/nJEIjYSD*GNDNAVKIVIFSI^QKNAAKRH PSENKKQ 
SEQ ID 4295 

SEQ ID 4297 

ATGAGGATTM 

SEQ ID 4298 

MRIYNPTOHFNF TS p ftPA TTOKNM(^RRlQfflFGQKHPVGFGGKNRPVHYRPVIE 

TTTATTTG 
SEOID4300 



S TKOGG^TATCA^ 

SEQ ID 4302 

LRKYQHIIQTQPVFLTAAQAEDAAVFIRRKAPIEHISF 

GGCATACCGGCGTGCTCGATCCTITGGCMCCGGAOT 
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O^CCGCCGGCTTTACCAT 
ACCGTTTTAAACGATTAT 
T(£CGGAATATCAAAAGGAAATATGOCGTAT(^ 

SEQ ID 4304 

IfTNKPAKRPVNGVLLLDKPEGLS SNTALQKARIUiFHAEKAQfl'G^ 

PPMFSALKHBGKPLYEYARKGmEIKF^ 

TVLNDYAVHHLQCXjQRPRFEEDIJ 

SEQ ID 4305 

TTGMGACTGATACTCCCAGAATGAACAATCTG^ 

GAGATACTTTCCACATCCG^ 

TGATGGGGCGGAACATGGCTATMCGCTATGTTCTGC^ 

TACAATATAGACTACGAAGATGAATATGAAGATTTCG^^ 

GCATT<^TCTAATTGATGAAAAACACTCTGTTO 

SEQ ID 4306 

yCTTTTARMNNLIPKHLAAYAHSDNLQIEGGHRCFSLSCQG 
Y^DYEDEYHDFVNSEGQVPLIDGRI I SFDSLKKNGFDMS IDLIDEKHSVREIMEELS 

SEQ ID 4307 

ATGAAAAAACCCCAACGCEGCTACGC^^ 
ACGAAGTCGAAATTACCCGCGATTACAGCCACTO 
ATTGTCCAAACGCATCAAGCTO^ 
GAAGAC 
SEQ ID 4308 
MHU^RG^QlHVKEQIMRBIiAELVRTGOT 

ED i, 
SEQ ID 4309 

TTGTTCATTCTGGCAGTATCAGTCTTCA^ 

TTGAACAGCTTGATGCGTTTGGACAATTO 

TGTAATCGCGGGTAATTTCGACTTCGTTGATAG^ 

GGCGTAGOCGCGTTGGGGTTTTTTC^^ l! 

SEQ DD 4310 

LFILAVSVFNRFFRRHLOTGGQAm^ 
GVAALGPFHEYPQIHOQ^AFLPPGRLRKGVL 

SEQ ID 4311 

AATCAATTTCATAGTTTTAATAACGAAMTCTTGGra 
ACGCTTTCCGCCAAATCGTACTGTAAGCTC 

SEQ ID 4312 

nqfhsfmenlgv*shp*flqkenkmsi^ 

SEQ ID 4313 

GK3TGCGTO^3AAAC^ 
GCCCCGGCGGCAGAAATGCCGTC 

SEQ ID 4314 
VCVAETACAVRVAKYKNYII^YQiaTLKSRRFGYTO 

SEQ ID 4315 

ATGTCCAACGAAAACCGTACCTt3TTCCT^ 

TGTACGMGGCX^CAACGGCGGCACGCCTCCGGAA^ 

GGGGCAGGCGAAAMGG(XX^GGC6GTTG0GGTTTACAA0CATT 

a3ATCGGGTAAAA0X?PGTOGCG^ 

TTACCAAACTGTTCGGCAAATGCK&OT 

GTCCGGCGAAGGCGTGCAGCAAGCCTTGCTGAA^ 

ATCCTGTTTATCTGCGGCGGCGC(nTK^ 

CCAAGCTGTTCGGCATCGTCGAACCGGAAGATTT^ 

TATTTTAACCGAGCCGAAAAACGCTTTGGTCAAG^ 

CGCAAMCCGGCGCGCGO^XH^TTC 

AGGGCAGGGAACCGGMTTGGTGTTCGAGTCC 

SEQ ID 4316 

MSNmTCSFCGKSKSHAKHLIEGKNAnCDECVM 

GSGKTLLAQSLARIOiDVPFVM™ 

ILFICGGAFAGLEKVTRQRTBK^ 

RKTGARGIiRSIVERCLL^ 

SEQ ID 4317 

ATCEACAAGCTGTCGCACATTTO^ 

CGAATGAGGCGGCOTAAAGCTGGCAAGGAAATA^ 

CGGCGGACAGCCGAAATACAGTAAGGATTATGCACCGC^^ 

ATCGAGCCGATACAGGGCGAAAGCGGCATCCTGOXX^ 

TGGGG(^TACGGGCAGGCPGTTTGCCTATGAACATO 

TGCCGCCCOTTCCAACCGGGCACG^ 

AGTCAGGGGCAGAAACTTCAGACGGCATTGCTGGATT^ 
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TTCGC 

SQGQKIiQTl 

SEQ ID 4319 nATOWXACTO0 ^ 



GTGCGGTGOTTWGC6CTTGCP 



SEQ ID 43Z7 ~i™,vYYMRAGGATAATAAAATITITrTCCCATi 



CATi 
GCA« 
GTGGfTTC 
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SEQID4330 

MNFSFVPLFLVTIJliIiGVVSNN^ 
IGVAFMGGIPVGPLIAADILSFVAGKV 

SEQ ID 4331 rcrc^^ 

CAAAAAGCACXX3CACGTCTGTGTGCTACCM 

SEQ ID 4332 

MVIFLILSCVAESGBLCEGNK^ 

SEQID4333 

ATGA(XATGGATGCMTGCCCC^CCGCCGTACAA 

(XX^TCATCAGCGAATCGATGATTC^ 

CGCACGGATTCKXH'ATTGGGCGTGCGACGCTTCC^ 

CCCGCACAAGCCGAATTGACXOTGAACJ^^ 

AACTGCACACCCTACCCCTG 

SEQ ID 4334 

MTMDAMPQTAVQNENI^ 
PAQAELSLNKHKNRRUjTIETEHEVENKGNPV^ 

SEQ ID 4335 

ATGCTTGCCAAMTCCC^ 

ACGACACGCAGGCGCGGCTCGMCGGCASCAGAM(^(^TTGM 
TCCTGCCCAGGCGCAGGCAAACGGCMCAACG(n^ 
GCAGGCGCGOTATCGGCAACGCGCTGGCAM^ 
CCAGCAGGGATTTGAACACGCGCAGCCTCCGTGCAAAACAA^ 
CAGAAGGCGG 
SEQ ID 4336 

MLAMPIXSDnQPIFHPKRTHMKKIIASALIATFALTA^ 
AGAFIGNALANKFTRAGNQDSPVARRAI^^ 

SEQ ID 4337 

A^KXXXTPAACAAATACAAAAATGCCGTCTG 
GTTGTCGGGCGGTAACGCTGCGCCTGCGCCGOT 
GACGGGCGACGGGGCTGTCTTGGTTGCCCGCCXX?rc 
GTCCTAAATATACTGCTTCCCCMCra 

TGATTTTTTTCATGTGTGTCCTGTTCGGGTG^ 

SEQ ID 4338 

MPIiTNTKMPSESPSWIDAPGLPPSAB^^ 

WNiLLPVFAtmTmvcL^ 

SEQ ID 4339 

GTGCAAAACMCAGGCGGCGCAGGCGCAGCGTTACCG<XXX3ACM 

CGTCTGAAGGGCTTTCAGACGGCATTTTT^ 
CGTGCGTTGGC^GOXXSGCAmTC^ 

SEQ ID 4340 

VQNNRRRRRSVTARQFARPSITAVPLCAVSAEGGKre 

SEQ ID 4341 

TTGGGUUX3ATTGAAAAOT 
TTGACGGTTCGATA 
SEQ ID 4342 

IATIENFRKFKYLDSFLRGNDAEGFQ 

SEQ ID 4343 

ATGTCGGCCGCMTCAG(X3GCCCX^CAGGGATArcGCCC^ 
(XX}CCAGCCMGC(^CGAAMTArcGATAAAMCGGCGGAT 
CAAAAGAATGATACCX3AGGTO1AACCCGTGCTO 
ACCACCCCCAACAGAATC^GCGTAA 

SEQ ID 4344 

MSAAISGPTGIPPINATPMTVPISKFVTNTGC^ 
TONRISVTRNRGTKLKnGYDTLTKDSP^ 
SEQ ID 4345 

ATAACGAAAATCTTG(m , ATAATCaATCCA 

AAACAGCC^OCGCCGGCCC^^ 

GGCGCGTTOX^ACGTTTACCGTAACA^^ 

GCGGCGCAGCCGATGCGGAACGCGACATCCGOGGTO^^ 

ATTCCCCGACCTGMCAMGCCGTCAAACXK^ 

JKmCCGOXmTrCCC^^ 

GGATTAAAAACCTGACCMCGMGAAGCTGCAAAAATCATC^^ 

CCAAGTCATGCCTCyU^GCCGATGCCGAAA^ 

COXmACTOPTCGCC^ 
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QCTGGCGCAA(XX3CACGCTGACGGA? 
GCCGAAACAGGCGGAAGACTACGTCC 



GGCGCGGCGATCATi 
ACGCAAAGCCOSCGACTCm^ 



SEQ ID 4350 i 

ULQSVSVQVAVDVmQAACDSPVTSHSFYKfPSU^V 



CTTCACCCTCTAIGCCM!CACa3l^ 



ACAATACG 




(XXTTTGTCGAGGCCTCCATCGCC 
APVEASIA 

SEQ ID 4356 

VGGRWSFSVPPQSPLQIIiKNGSDSIIiFIiU^mV 
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SEQID4360 _ . 

OT)PSVWEAAPFGAT\nfflII^RYHNVHRAQFEELVPIiA^ 
CGSVn^VAlAKEMADDLNDHIHI£NDILFARVIiDS 

SEQ ID 4361 

ATG(XGTCTGAMTACGGmTin^ 

CGCGTATAGATTTCCAG^CTTCGCGCW 

TGCGTTCCGTATCOmG(^^ 

QiCGCOGCCGTCGOCGTATTTGGTAAAAAAAC^^ 

CGGCTTCTGGCGTTQGCGAGCGAAAQCGGCTGAT^^ 

CATATTGCCCTT(^A6ATTGCGGATGTGC^ 

GACGGCGGAATCGAT(XXXXX^GAA^ 

CCKJTCGAAAATA5tXGa 

SEQ ID 4362 _ 

MPSIJRFFRFRRHFHVSAQQFGRNAA^ 

HAAVAVFGKKTHAEIFIK^ 

IXKSimtaiAyDBSPaCLSGVFVQPS^ 

SEQ ID 4363 , , n 

ATG(&TACGCAG<£AGTTATCACACAT^^ 

MGCGCXSCAAACGGCCAATCTGACCGACACCTT^ 

ATGCTGACCCTGTACTACTACGGGCAGATACACGGACTGCTGGTO 

GCCCGATTGCCGACCTGACCAAAACGCAGGTTTAC^ 

G<^CA(&TGGGC(mGCra^ 

CX^GCGATGCAGCACAAAATCAACCCGATTCCT^ 

SEQ ID 4364 

METQAVITHIVRWLDEIl&Q^ 
MLTLmGQIHGU;VTGTGNKVEDFGVG?Fm 
. RAMQHKINPI PVCRIPPBLLG 

SEQ ID 4365 !! 

TTGTTAATCCACTATAAACGCAAAATATC0C<7OTTO 

SEQ ID 4366 jj 

liLIHYKRiaSEHSRTIPTGSNRHOTSASPPEKKESGT 

SEQ ID 4367 

TTGAAGCCC^GGCGGCATCGTGCCGCCTAAATTGTO 
ACCCGCCCTCAATCTATTCAAATTGCACCGCC^^ 

SEQ ID 4368 

LKPKAASCRMCQQYYNTRHLAAS^ 

SEQ ID 4369 

ATGCCGCCTTGGGCTTCAAAACATATTCATATTlTn 
T(X5AAAA(XACCTTGCCGGCC?GTG(^ 
CAA«5GTGCGGCG(XX3C(?PTT^^ 
CTCAAAGGTTTGGGGCAGCTTTACGAAGOT 
TCGTTACCAGTCTGGCGGCGGCGGCTITOXX^ 
GATTGCCm(mTTAAAACCGCATCGCAACGCGC^ 
ATCGAAGCCI^CGATWX^ 
CmCCGCCMGAAraCT(X^CCGCC^^ 
CXXGC^CCAAAAACTTGACGAGCAGCGCGCTTCAATCGGC^^ 
OTGCMCATTG(XX3TCCTGATCTTTCCGm 
CCGCATTGCTCGAA^TTTOX^ 
amCOTCTCXIGAOGGCGAAA 
SEQ ID 4370 

MPPWASKHmmiJDSMSDFFHSDV^ 
r^GLGQLYKAYERIiKAQLQAEXjAFAAERKKPIj^^ 
IEACTIPWSGVGHETDFTLADFVAD^^ 
IflHCRBDVSraQDIV^ 

SEQ ID 4371 

TTGOTATAAATTTCTGTMCOT 

SEQ ID 4372 

LFlNFCNFEFQPSSINYRNVSIVIPAQAGIRFIiSFSHF 

SEQ ID 4373 

TTGAAACCCGGCTTGGCMTGCTTGCCTCACG^ 

TATCGGGTTGGGTT^ 
AGGGACTG6AITCCCGTTTTCG(XX5GAA 

SEQ ID 4374 

IjCPGLfiMlASRSKNFITSSPCQI»KNnSP]^TO 

SEQ ID 4375 

GTC(XXXXXXXy^TTAGCGGTGa 
TCACGACOGTAGGTTGGATTTTCAGIAT^^ 
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GTCTTTCTCAAAGCACTCGACCTTTTCTCC 
VFVKMDtfS 

CC«3mGMCC(XXmTCAGGCGW3T 
SEQID4378 



SEQID4379 

TTGATGGTICAGCGCGKTTGAGTGTGAACGATi 



SEQ ID 4380 



EVEPFKTAl ILKFKCDHWFVAFA 1 



55552^ 
GTCG 

lYUffiQGSEQVJNDVHffMiENADRGYFE^^ 
(XXOT3AAGTCGCCAACATC^^ 
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REVRRIFBSQGIiTVSRLVRVAFGPK^ 

SEQID4385 

ATG^TTTCCACAATAtaGGA^ 
ACGAGCCGA0GGCQGT(^TGTGGC(X3CGG^ 
. CAAGGGGCCTTTCGCTCGAACTC^^ 

TOCACGTGCXGOHXX^CC^ 
CTCGATACGCGAGCGGTTGTCGCCT 

SEQ ID 4386 

MDLHNIREDYSKRELSEAIXMNPIEQFERWLDEAVRAEVNBP^ 

YFESRPYQSRLGAWASAQSOTPNKAVLVAKAAAVGUOT 

SEQ ID 4387 

TOAACAGGAAAAAACATATGCCCTCT^^ 
TATACCGAGGAATTATGATGCC&ACGGCG 

SEQID4388 

LNRKKHMPSENSIQVSIXjIFSRGKYGREVPCV^ 

SEQ ID 4389 _ 

ATGAACEACACCXSTTACCCTGCCCGACC^^ 

GCGGACAATQCAAAGCCGAACTGGCCAGCGGCGATATTCAAAl^ 

AAGCGATATCMCATCMCATCCCCXXjCTGCAAAGCC^ 

CCCAAACKXmmTTTOXrrTCT 

ACATCCGCAGGCGCGAAAACGGTGTCTGCTCGGAAAT^ 

CAGCGGCAAACCXXaTCATCCTGCCGGCM^ 

CATCMGATGATTTGTATGCCCTCGAAGAAGCACAAGGGT^ 

AAACTTGTTTTraCCGAC(^ 

SEQ ID 4390 

MNHTVTLPDQTTPAAGDGETVI^^ 
PKAPPFAFYAGQHDLLLPGNVSRSYSIANS^^ 

HQDDLYALEEAQGLACMiKKTCFTPVLSRPGBGWQG^ j 

SEQ ID 4391 i 

ATGACTOGGAAACTGTMTCGGCTTGGAAATO 

AATGCGCGCTGCICGGGCGTACTGCCGGTAATGAACCG 

CTTCTMCCCGJiCTTO 

OTATGGMGAAGAOX^GGCAAATCC^ 

CCGCCGAA(KX:GTTGCCTACGCCAAA(XOT 

AGGCCAAGCGGMTTCGGCACGCGCCGCGAAATTAAAAAOT 

GTTC^GCAGGCGACGATGCTGTTOACCC^ 

CAGATGCCCAAATGCAAAAAG(X^AAGCAGAAATCCCCGAGCT^ 

OTGCAGGCTGCCTATTTTGAAGAAGCCGCC^ 

CCGAAATCATCGAAAAACACAGTTTGCAACAGATGACCG^ 
AAAAGCCCTGAATGCGATTOTGGGGCAAGTGATGAAGAC^ 
SEQ ID 4392 

MTWETVIGLEIHVQLNTKSH^ 
HMEEDAGKSVHEGL^TGIDI^CT 
V^ATMLFDPEKGETRVMRLKEDAHDYGYFPDPDLLPVIISW 
SPITAPRLAALVGKIADGTLSGKLAKKAFEAMWAEPETSI^ 

SEQ ID 4393 

ATGMGCAGMCCGCACCmACCTTCCX^ 

AOKXnTOAGCAGTTTtnmamC^ 

TTTAGATGCGATGACGGCAGATITICP^ 

TTGTCXX^TCAACCGCAACGACMCTGCGTCG 

(^GCCTCCGTAC^GGGCCGTCAGGTGMGAAGCCAAW 

CXX!ATTAAAAT1AGACGGCGCGATAGGCATTGCACAAAAACA^ 

TTGGAACGGGA(X5GCTACTGGCATCTTCC^^ 

CAAAGATGAAGGACAGTCTGAAAACAGAAGCACGCGGOT 

SEQ ID 4394 

MKQNRTFTFPDFRTVYSYAPLYRLQHLK^ 
LWLNRlfflNCVEBGAWSLSIiRDEAGNRLYMA^ 
LERWYWHLPQTPARKDI^ 

SEQ ID 4395 

ATGACC(^TACACATTGAAACAGGCA(XK^GCCTC 
ATATCACCATCGACCAAGATAAAACCCTTGCAGMGCCCGT^^ 
AACCGGCTCGCGOiGCX^ 
GACGA(7PTC(X^ATG(&TTCG^ 
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<M<^ACACCCGC<airo^ 



GMGACTACACOCGCGATTTCGAC 

TGGTCGAAGlTTCOCTGCCGCAAACw-™. ... _ >rl ^, 1 __ 

EDYTRDIiDi^IiKGVKIGIiPKEYFSB^ 

CTGnCGACCAraGTGaMMfGfPCGTTl&MTCT 
SEQ ID 4398 



(OTTGCGGAATTTOXXSGACGT^^ 

VGUtntEVNIAPAAAVIGMAATVHVTIGLTAGKWGVIPGREAEF AGGLVLIAIGTWPIiLSHlfililQ 

CGTOCOXraAAGTACGmCCGOTCT^ j 
TTCCCGCAACTGGCAGGT 

UmilTE^AWULWWJjt-HuriaiVjisxitoiAaii^unii . ICYAII jVAGKVSRlJj(^WnjffiI»aJl^VSraiVSG^ 



FIQU6 

GTCGGGCGTTGTTCGATTTGGCTTCAGACK3GTAT 



===== 

CTGGCAGAAA 

35525^^ 



TATCGCCGAGAAAMCGGGCAGCTTGAG 
^TGCGAAAAACAAGGATGATTTGATCCXj 
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NAKNKroiilRKYHKAVNKyCR ' *** 

SEQ ID 4409 

GTGATGAOGGATAAAGCAAAGTTCATTAGGGTT^^ 



SEQ ID 4410 

SEQ ID 4412 

MMSmMPSEALPVSDGIIiUI^ 

S |tcgJ^^ 
;^XH£ACGATAGCCGGGGCX3AAACTGGTC 
TCCGCCGACCXmGCGTCGCGOX^^ 
TCGGACAAGGCOTMGA(mA 
TTTTGAAACAAAAGG(XX7KX3GCCATCTO 
CmATTGATTTGTATG?KXXTTGGTCAG 
GCCGGA6AGGGCCGT1ATTCTTGGAAAGGTTCGGGCW 
TCtn^GCTClCTGCCGCCGATGCGC^^ 

SEQ ID 4414 

HVARVYK(3)EQI/&VHTTDAW 

SDKALQTASASDVREAAPASETRPRRMANPDKQDILSWDEL^ 

AGEGRYS^GSGraGGIFmmQGENSFRFTDAQHKKVVEI^A^ 

S ATOTOOTGC^ 
AAGTTTTGGCTGARATGCTGACTTTCAAAC^ 
CGCGTTGCGACACTATCAAAAMTCAGTACC<^ 
CTCCTTGATGAAGMGAAACAGAGTTTTTCGGCA^ 
ATTGGAGCGMGAAGAAATCCTCGCTTTCG^ 

CAGCTGCTTGCCTTATTGGTGGGCGCAAAACGCG^^ 
OTTOSACATCGCCGAAAAAC^^ 
AGGCAAAGCCGAWX&GTGTTGGTCGATGCGCOT 
CAC&GTATCCTCGATGCCGCCTCCAAAC^^ 

ATTGCAACGCAAA 
SEQ ID 4416 



QLLAIiIjVGAKRGEIIVDPC^GAGGRTIiAVGAQMANKGRIYAro 
HSILDAASKLVKPQGCLVYATCSVLPE^^ 

TACTGGGGGMCGCGGTTTCCGCTACXHTITAGCATTG^ 

GCTCGCTTTTTGGTTTACCAACG^ 

TATCATATCGAGCTCGCGTGTAAATACTA(X3GCG^ 

TGCATACGTCGCAAGGCAAAGC^CCTTAGCCGCACAAGACC^ 

QOTAMCCATACGGCIGGOGOGGCA^ 

aXX^CmAACATTAAACGCAACCGAW 

OTELACmtX^NLRGLNPKm 
AETEimTEIBRIDQGIi^ 

SEQ ID 4419 

A^TOCTCTCAMCCCCmmxm 

ACAOX^CCATOK^XmTCm 

(XaGCGCGGCGGADGTATTGCACAACGCCGAAATC^^ 

ACCCCGCGCGATTTCGTGCCC^ 

TGATCACCATCAACGGCMCXXC^ 

CCACGCXXX^CCCACGAGCAAATCAAA^ 

SEQ ID 4420 

MTALKPALPDTONIRIILTRTSHPANIGSAARAMKT^ 



FGRANTQTEDIDILRGPFNTVSHRIHKKD 
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TAAAGAAWGTTCMSCGGCATTTCJCOGTCOGftACGftA&A^ 

SEQ ID 4422 

VQSFGAAKSEFIfflfia&m^ 

a^TCGGmAAAlACCGCC^CGO^^ 



am^TCCGCTCGA^ 

' " * ** w ' - - - - — — «~ ftACCGCATCGMTTGA 

GCGAAGGCGCGGCTTC 

ATGTCGGGTGAGGACGCTTGGTTGGAAAGCGGCGATATT 



GTGGCTGCCAAT(XX!AAAGTACTGGCGCi^TGCT 
VAANPKVLAQffiRIISAHV 



SEQ ID 4427 



AATAAAATATTGATTTTCCGCAA 

SEQ ID 4430 

SHLGLIQ*CRRIKCimWQTAFC*T*FSIGIIV*SY^naDFPQ 



IEQ ID 4432 

tffilGmCOTUlQKSPIDDSYNIN^^ 
HQCTSMMNffllGCCCGCIMIT^^ 



SEQ ID 4434 

WWIRTPIiCEIGI 
SEQ ID 4435 



SEQ ID 4436 

aOOOGTCCCGGCAOTAAAAATCCTGCGGGATCGGTCTGGAMT 
HMPDGIVYADSPGSRGKSDAGGFTHCRIH^ 
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CCCCCGACATCGCCGGACAAAACAAAGCCAAI 
jjAjGTCCTOCAACAAGGOTGCCTiCCGGCGATK 

ATGTTGTCGGCAJ^TCiTTCAGGAATGGCTCGGCGGJT^ 
CAGCAGAGAATK?ITO^GT^^ 



.ATGCCMGTTTGRmTKGafta 



aX^jCAAlCMGIXMAACG^^ 
AAA 

SEQ ID 4442 -...^m^iiaB pmreTwmsismMPP^^^ 

AAAATAGGGCGCA(XCTGACCGTTATTTTGGAAACCTT6GAAGtt 

TCACAMm^^ 

AAAAAAATTTCCCCCGTCCGAACTCTAC(WAAMT^ 

TTCTCAAATATOCTTCTAAAAGAACS&GGAAAAAAAAATG^^ 

WTOMAAIMIT^^ 

SEQ ID 4444 vnMVimww^TTntTim^^ 



S ^!^at^^ 

SEQ ID 4446 

HRTIVBPSVYISRSHHJLSGMPSEQPSDGIQFHE 



CGAMAAGTOCGCA^^ 
^(XTOXIATMGGMTACACGATGTCTTTCAA 



SEQ 

TTff 
ACC 

SEQ ID 4452 




IIHDVPQ 
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SEQ ID 4454 



SEQ ID 4455 ^^^wamira^^ 

CTTTA 



AGGATTTTTGCTTTCAT 



SEQ ID 4457 „ 



SEQ ID 4460 

B3LRLVLTKPSETVSDG' 

CCG ^ 1 ™V?: ..^A^^wftOTmn^^ 



cgmuuuuikc^ 

AATGGCGC(M0C0GGCTGTTC<^T6T<KXTroC^TGMT 



SEQ ID 4464 ™™,™ mra n namiMlDSVVCTJ^ 



SEQ ID 4466 



AACCCGTCGTCTATS 
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CATTCCTACCGGTAACG(mTAATTCG^ 
CATMCCGCCTCTAC6TTTACGGCGTMTGGCA0GTOT 

SEQ1D4468 

MNLFPGGFMLNPNFIPLAAVAAQDAVQRACETE3CSVLI 
AGIPDILKGIGQTVLPPLHGGWTTRRKTNHFG^ 
RGLNRKNRNKMAKVKEGLEVSEVIVQEGVYTYE^ 
HNRLYVYGVKARLSLLAASIELEETA 
SEQID4469 
TTGAAATACAGATGCXX3TCTGAAGCGGAAATCC(X^ 
ATCAA(£AATGGCC(SAA1K3CGAAA 
CGGTCGATTTGGCGCGCTACCTATTGCAGGAGTTC^ 
CGTGGTCAGGGAAATCGGG(XK3CGGATATTGGAGGMG^ 
GTCAGCGTCXsCGCTGCTGCTGAACCGCC^^ 
MTATGCCGACAGCCTGATTATCGCGCACAA^ 

SEQ ID 4470 

LmCRLKRKSGSDGISDIVRVGTSVS^ 



SEQ ID 4471 

ATGAGCATCAAGCAATGGCCGGAAGGCGAMGGCCCAGG^ 
TGAGCGCGGTCGATTTGGCGCGCTACCTATTGQ 
GTTCGCCGTGGTCAGGGAAATCGGGCGGCGGATATTGGAGGM 
GTCGAAGTCAGCGTCGCGCTGCTGCTGAAtt^ 
TGGACGAATATGCCGACAGCCTGATTATCGCGCAC^ 
TTCGCTGCTCGACCAT^ 
SEQID4472 

MSDCQWPEGERPREKLLEKGAAALSDAEIiliAILL^ 
VWSVALLLNRQNQLIAVRELSRGTVAENTinREITO 

SEQ ID 4473 

AAGCCATCXOTGCCTCCGGCAW^^ 

CCCTTTCATCGATAAAGACGACGACTACGAAAACGACC^^ 

T1TGAAAAAGCGG(X5CAATACGCGCACG^ 

CTTACGATGATGTGGCCTATTGGGACTTCA^ 

CTWEGCEATTCm^^ 
SEQ ID 4474 

METQNKPTVTDIDRPILVPPGGHKKVLLHSCCArc 

FEKAAQYAHHIGFPVFTSSIfilSKWKl^^ 
SEQ DD 4475 

ATOTrTTTCAAACTCCOT 

OTCCCMGTACAG(XX?P^^ 

GAAAATCCGTCCGTTTAC(X3AAGAACAGGTCAAA 

GTGGACMTATGCC(1AC(&GTCAGGGG(^^ 

AAGMCAGGCTGGCGGGOTTTGGTGAATC^ 

TGAAGACGGCGCGGTTTCGTTTAAAAACATCACT^^ 

CAGGAACGAAAAGGCGTGCGC^^ 

TOm^GATCCGmC(XX3ACA^^ 

CGGCAGCGTGTGGGAGCTGGATAAGGCAGTTACCATC^TGGGT 

G?rrcCGAATCAGCGCGCAGGATGCCGGAACTGTO 

SEQ ID 4476 

MSFKLRYLASVLALSSL^^ 

VDNMPTS QGHKGQFGGRNS PTALNAALLG SQFWDGRAADVEEQ AGG P LVNPVEMANDSQ EAAAAKIAKVPEYQEMFKKAFPEDGAVS FKNI TTALGAF ERTUjTPTKWDEYI^GNVHALSE 

QERKGVRAFMDimACHN^^ 

VSESARTMPELPLTAFMESKPENK 

SEQ ID 4477 

ATG(mGTAAAAAAGCA(KXX3GCAGCACCC^^ 
GCGGTACCAAATTCCACGCAG(X!GACAACGTAOT 
CAT0CGTCCTTACACC6GTTCTGAA6AA 

SEQ ID 4478 

MASKKAGGSTRNGRDSEAKRLGVKAYGN 

SEQ ID 4479 

ATGCAACTTACCGCTGTCGGACTCAAIIX^ 

CAACGGAGGCGGTMTCX^T^ 

GTATCTGTACACXXTGGATATGCAGGAAACCGTO 

GCGGCTCAAGAACAGGAAAGTATC^ 

CCGCGTCCGTCMCaTCGCGGMCAG^ 

GTC(mTCTTCAACGG^ 
AAGOGGAAGTCGGCGATTTGAACGATra 
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TTCATGCCGTCGCGCAGAJTrATCATITGGACAAA 



>y 11 1 »-»*"»^VSiTOEFVRQ8Q5»QSVWJKMiBDBSEKAItK^^ 



GGCGTGCATGAAAACATCAGCACGGMGAGGMGC<XXXX^TO^ 

TOTCTTCAGOTrCAGO^^ 

S ^^^I»mi/JSAAGVFYTO 
SsiBEEA^^ 

KGEKYFIHWKGSNCIiEACTRYTYRI^FGHSLS^ 

S SL£OTA^«n^ 

A(^3(3TTTTCAGACGGCATTGGC<3GAAGC^^ 

GA^rax^TCArinramrcGcra 



AGAMTCCATQCCCACGATATTCOGTGCATCAAGGTGTACAACAAAACC^ 
AITOCGTI^TGAAAATAC^GTATAGAOGCACTGCGCGAAGCCATTGCCGACT 

MMSUIAIFARRARTOBGRIj^ 

YMOJQIJATIOT^ 

ISVAEWTGIDAIiREAlAEYCAAAHITDBTHIP 

S ?£f5cGa^^ 



ATCGAACAGCAGCWITCCGCACAGACGGCATCGTCCGCATCGACCTCAATra 

TWTOKMGGOSOGWGCGOQSOCTCRAJMKBCCGO 
AGG©&TGTATTTC(MftX&TCGCGATGnOT 

TTTCATATTCXffiCGAGCTCCTGCTT^^ 



QOCQCaCOOOCOWCIC^^ 
ACACAGGTTIGGGCATTGGGGAGGGCATCCTATGTTGCCGAMm 

mwcra^ocoGCKaA!^^ 

CCCGTWRXW^ 



FYQG^KNRRAGWmiAmi^^ 
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QWVEGB 



LSLATi 

AVLGNAItRUKRGEHPSBQ 

r m |H )LlHj f YfTHnHTVftT * * * I rifw ii rrir" r ■ ■> ~ 

AAATAATCCATATTGAC^ 

SEQ ID 4488 

LHKKNCHNKKSYAENKliKEMVPIENGCSA3^^ 

AIKXX5GCTITrACAGACTCAA(X!iO^ 
GACGGTMTGGTACCGTCTTCTTCAAGGCGC^^ 

DQiGTVFFKAQPFHAGNV^ 

AGC^CGCOTreOTITCAATTCGAC^ 
AACCTOKmArCAGCTC^^ 

SEQ ID 4492 

VYQIGYGVCKKPHM5KMRW 

S l^^TOGACAAAT(^TAAmTCr^ 

OTTCTGCATAA^ fi 
SEQ ID 4494 _ 

SEQ ID 449S 

TJBGMMAMUIfflK^^ 

(KaTGnAAMATOAmTCAmTGrGMTCKTM* ! l 

SEQID4498 j 

tEIKEKPSIWIPTlffiQSTTVFCRSQSRKTSIiLSFYHDRTABML^^ 1 

SEQ ID 4498 



ccq ID 44S8 

TTOTCCG^^ 

TCCa^AOGCGTCGCOC^^ 
MTCAGCAAAAAACCGACCGGAACTCTQGT^ 

AAGAAAAACTGATTOMSACCCGmOTT^ma 

GiGCGCGCATTTGTGGMCAA 

n^FEHILVHGTOYKKDI^^^ 
ERAFVDQ 
ecn ID £501 

(JTGGOTACGGACMT^^ 

GA 

SEQID4502 

TOGQYFLGIBCCHIiGPLSVKfflDSMSGRfiAESffiCIfDSG 

(mCSGGCACAATCOXXaTO^^ 
AAAACGOCTCTACGMTTGCGTTCGGCAGGGATAGCCGTTCC^^ 

ACATOiGCTTTra^mi^^ 
TGGGGADQGGACGCGCTCWjGCIGGCAGCCTCGAOTCCW 

BABAGKAEAWLAGliBAlARVHKKRQHiSQAPARHlffl^ 
WGRDAIiUAASISIiISLAEaPP 

S TraK^5raGCGa«3GO«X5A^^ 
TT1CTACATCGMTCCGCTTTITATATCAACC 
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SEQ ID 4506 

IjSSFPHRRESBLVGrBTYRVKRFIflnVPDSBIfiGCMDSGISTSHPIinCT 



SEQ ID 4507 ,„„„ M ^ M » W ~- W «^^ 



OGTAT(?reGGACAAAAITCCCGAAAACGAAATCTi 



GGCSGGTOflfiGTTTOCAGCCGftftCTTGC 
iOCCATTITTOCCGCGRifcTCMQCAGCGTraJ 



GATTACCAAACAACTGACCGGT* 

SEQ ID 4509 ^..^^^^a^TCCCUMaBM^^ 



ATC MACCTGCCMTCCT^^ 
SEQ ID 4510 _^ nm T ^ mn ^.p^ MnA A E ED9G^^ 



IKPiHPMKDGLIiTiEOaGEVD 

SEQ ID 451 1 __ 1 ^ v ^-», mmv ,»ni«ipp»r>»i 
A1^TCT(^TCOTTTTCCT6C^ 

GCC 



A 

SEQ 



EQ ID 4513 



GGGTA' 

GAAAACAAGATTTGCGGCTGCGCTl 
OXSTTTCCGOCTG^^ 



SEQ ID 4515 

(yTGTGGCGATGGGTGCGGTCGGGA' 
(^AACAGGCAAAATGACGGCGGCGGGATTTOT 

SEQ ID 4516 UU „ L 

VTCVWRSGCCPVS^ 

(?rGCGt J r ^ 

SEQ ID 451 8 

LTMLUMLGIiIO 
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(&TTGCCAAACACOCCATO 

S ^^WPFAVIA^ 
OjPLLEAGMEMOTCKTTQSAAIVVFSSV^ 

CAW3CAGAAAG6A&ACAC&A 

SEQ ID 4522 

OTAEMPSETFQTAFIiYYBPFCGF^^ 

ACGGGGAmTGCTC^ 

CCTMTTGCGTOAGCCGTTATGCGGGCACAGTPGMff^^ 

CCCAAACAAGAAATCGTTGTATTCCt^ 

AATTGGAACAAGATAAAAACAATATGATTQCCGTIX^A 

PKQEIVVFRASGVMASNKRKINEAYIjDYSASSKLP 

ggccgctgattox^tgattcccatcgtt^ 

6MGCTt3ACAAGCAAOGGCGTAT6CGGCG6ATTA!^^ 

TTTACCG'K^^ 

(ETTTGAGTTTC^^ 

PTVIIVTIVMyAliIiEHWaRmATO ! 
SEQ ID 4527 

ATCG^mJ^CCGTCCGGCTTCGGMTTTCCG^ 
CGM7FGGGAGGATTTGATTCCT(X^ 

S ^^^G^PVSKRKVMRPP^ 

SEQ ID 4529 - 

ATGATTAT 

SEQ ID 4530 

MYPSLCGCFLYLOIRHIjI^ 

T(mAAACGCTO3(^G<mTO 

CCTGCACATCCAATACGGJATGAGCGGC^^ 

AACCIX^CCTACGCGCTGATTTGC^ 

SEQ ID 4532 

MDAVQLKSmV^^ 



CAGCAACCGtK^TCACC^^ 
AAAAGAAGGCGCGGGCGTGGCTATOT^ 

KRRRGRGyyRIVPPQYAAQPHUXjRIRG 

($TGCCGCCACACTATG(£OT 
CGTTGTCTCGGACAAC 

SEQ ID 4536 LJ , J tmnmCT , 

LCTYPQTLIDLVAGGIGVAMVQGNKAEAA^^ 

CTTTGCTTTCAAAC 
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^^^^ 
CMCATATCGMGAGGKGAACTCCMOT 

GACTTCGGCGAtMTCTTITOGGCGraGGCTTTGK 



MGGCGTTWCMTOTTTTCAG^ 

SEQ ID 4542 

^AKVIMLKNEIBGGUjIPPDWAT — — 




QGMSMSGCOWaCIG^ 



ClUAWUUUin l«BLWflWWJnnw»v* . - - 



KSTQVGLYSQDGGSINVDRKDNI IEGDAVAU 
ATVSDKGLVAIKPUimmVAOTlHY^ 

TUTQEL^GANKFUiHIiJKDG IHSI^YVKKGTSTPQEVVVKHLSEVLDSMNYGERI^A' 

nun (uniitnnnnnntDfflTT n?1 

NKTYTDNRQNVYLVKQ& 



IITIDSNVANLARQIX3SIHLNYKDDTRITG 



QSNAFRFHNNDPEIGYDRFSU^EKKRKWGISIiDYGH 
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KBKIFKYELALYNTTQHDKEGDI^^ 
VBKFDGGKKLFYIK^PHEPLGSRSFKA^ 

SEQ ID 4545 mm 

TTGAGAGTAC^GTOTTTTAAGCXX^ATO 

AflHXXSCCAGTCCAAC^ 

CMGGGGMTAATGAAGTTGATTTAAATGTAAAAAAC^^ 

AATCCGGACACCAAGACOX^MTC^ 

SEQ ID 4546 

LRVQVFLRGOTAFKSNTVL^^ 
NPDTKTAE^WKEKyPNNSPEPYRVGGSIHD 

SEQID4547 

TTGGAACTTATTACCACCGGGGAAGGGGCAGATGGi^ 

ATCATGACATTATTTTCAACMTGATTTO^ 
AGATCCAACCACACATCAAGAAAAGTTAGT^ 
AATTTGTICAATATGAACGTTAA(X3TICATAAAGGCACA^ 
GGAAGCTTGCCGAGCCTATGCCGGCAGGAAAAGTTGAGAOT 

SEQID4548 

LELITTGEGMXmVlKSSYGI^ 
NLFHMNVNVHKGTHHFGIHNIrf^ 

SEQ ID 4549 

ATGACCGATTTGTTTGTCXXaGAA^ 

CGGTGGAAGGCGGGAAGOCGCATTCTATGTTGCTGTGG^ 

GTTTTCCGGCGTGMGGACATCCGCGGGGCMTO 

GCGTTTTOCC(^TGTCGAA^ 

TGTCTTCAGACGACXrimGAAACT^ 

CAGATTATTGAATTTGTTGGAACAACTTW 

GGCGAGAGTITCTACAACCAAATCraGCCOT 

CCCGCCGaTCGTGCGTATCGCTTGG(M 

GCTGGCGCAAGCCGTGCTGTACCTTGCCGCCGCCGCC^ 

CX5CMCG(XXXX3ACCAAGCTGATGMGGMTTGGGCTACGG 

CGGACTmACCAAOXXSTCCaSC^^ ! 

SEQ ID 4550 

mPLPVREPDAPIiAEMiRPm 
AFLPHVESGLLTFIGATTENPSFEVNP^ 
GESFYWISALHKSVRGSHPH^ 
RNAFTKUOEI^GR^ 

SEQ ID 4551 

TTGAGAATGACTGCTCTGAAAACCACCCCGT 

ACGMGCCGTGCGCACCGACGCCGGTATGTTTGACGTA 

CGCTTTTGTCGGCAAAGCCCTTFATT^^ 

GCGACCCGTGAAAAAGACACGGCACAATTCCACAAAGTCGG 

TCCTGACCGTCAAACCCGAATGGGCGGATGTGATra 

AGTGATTCTGCCGGATACCGAAGCCGTCGC^^ 

GGCAACGATATGGACGACGA(^CCA(XXXX3OT 

GCGTTGCOTCAAACAAGTOKn'TTGCTGOT 

GAAACAATCTATCGCCATCGCCCGOTAOT 

CAAAAACAGTTTGAT 

SEQ ID 4552 

LKMTMiKTTPFH^^ 

ATREKOTAQFHKVGQEFGVAFNPRYDIiGffiiA^ 
GND)fflOTSPLEA(MGR!EVDLKD& 



SEQ ID 4553 

ATCK^jCXjATAAAATCAAACTGGAATCXGGTGCAOT 
AACACGTAGTGTACAAAGAAACCAAACTGAAA 

SEQ ID 4554 

MRDKIKIiESGAGTGHFYTTTKNKR^^ 
SEQ ID 4555 

TTGATGACTCATTACGFEACGTTTATG^ 

ATTGGGACAGACAAACCAATATCTTTCOC^^ 

TMCGAAGATTTGGGGCAAAAATTTTACGAACGOTC^^ 

ATCAACGGCGACGAAGCATOXXXX^ 

TGGGCAAACGTGCGAAAAATCXXIAAATTCCt^^ 

CGCCTCGTCTGOTGCXSQCAAG^ 

TTATTGAMCAGGCGCAGCGCGAATTTACTC 
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AAGCGGAAM 

CAACCCCAAAGGCTCATTTCTTGCCAA 
TTCfflKGACTACATCCACAAATACTATO^ 

CGRGAtXCITCGGTGCCftTGGTCGTTTGCGACAGOGCAGAMM 

ACTMGGAftGAACGCGACCARTCGGTTftARGWTTQiARK^ 

GCA(&CTGATAAAA(XXXATAACCTITIACAAAC^^ 

CAAOXX^ATTOSGA^^ 

Tira^TACCGAAAAC^^ 

TGCGCCTGCAASjCAGCCACGAATOCTTCCG^ 



WGCCiaCIOOMCOT 

TI 

SEQ ID 4556 „™„^^™»™on<nfpraPDiJ^^ 

GLAGNFDQDDPEYISLREELERI 



(XAAACAAAACATOGGTAAACACATCAAAGCCAT 



SEQ ID 4559 .^^..^...rnnrlirAPir^M^M^^ 




_,.„ . ATGGA 

GTTCGAAATAATCTAi 
SEQ ID 4560 




VRNNIrf(QNHHLTQUtf3^^ 

AGGAATTTAGAACGCCACAAAGWn'AAGTrCGATOSCWnA 
IFLLVSCSKSFYFNPVFFIiFGYRFYQAKTESGLLLVIiISKQBFIff 

■ssss-------- ss^ 

AAGGGATK^GATTTGTATOGAGAAGCTACAAATACTGAAGTACAGCAA 
VEISOTQLKHLLYGDERYmALGKEKRIiANSVQPI 
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S A&SSc^ACCG M cm 

GATTGGCGATGTTGAMTTCAATTCA 

VOIJYVDISADEPAQMAGFSADLDKLFAES^ 

GAMTCATCGGAAA 
SEQID4568 

ffiPVKLKMPSESLSreiLFGIiENTYAVLSPKERIQKTKATGIHRK 

ATGAiACGKSTOSTTOTC^ 
GTATCCfflMGATGGCAAAC^^ 

SEQID4570 

KPSEPILIKlFI/rAADBRQ 



A 



EHHPSEVD 
SEQID4573 



TAATGCAA 

SEQID4574 

MTQ] 

SEQ ID 4575 



TCTGAAAGGCTO^GACGGCATTT 
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ATCACCGACACAGCOX^^ 

TTTTCAAAAACAAATCCTT^ 

WCCAT(jTCCCATATTTCCCCCA 

TTCGTTACCCCCGMGCCGTCMPM 

CGCAATACGGTAOCA&OTIAWG^ 

AGACATCGTCCAACXCGGTCATATCTTCCCGOT 

GCCGTC&TCTGCGAAATCATCAACGJ^^ 

GCCGC»0C3GAM^^ 

(^(^GGCCGAAAGCGGCGTCGTCATCC^^ 

GCATCOXXXmAATCC^^ 

AAAA 

S ^GUl^ 
FVTPEAVNFMIKHARGLVCLPMEGAMVEKIXSLPMM^ 
AVICEIQTOIXaTKARMPBLMR^ 
QQAESt^LLHHTEIEAS^ 

GCTTC 
SEQID4580 

S ^rec^GAW^CAGCcmxxm^ 

SEQ ID 4582 

VREETAAPPAGIiRQCRLKPRIRLQTASVYQKANOTIWR^ 

TTTTTTTTGCCCGTTTT 

SEQ ID 4584 

LIAGYGIGGTSRRNTGNPMMNAAFPIKTG^ 

TTTCCTT'iraXGTCCGTTATGT^ 

niiuuwA,!^ ATCCTCAAAAACGGGCAAAAAAAAGCCCTGTTACTIOT 



AAG 
K 

^ G^(£AAAW^^ 
TCGGCTATAATACACMTTATATGCGCCGTCTC 
CGAAGAATACC^CCGACAGtt^ 
GTATTGGACMCGCC^TCGACGAAGCACTCGOT 
ACCCGAAAGAAGGGCGTTC(^ 
CAACGCC^nxmCT^ 
GGCACGACCGTGCGCTTCCTCGCCGGQ^CGGAAAOT 
T^CXIGACXjAGCGCGACGGCAAGCACGAAAGOTXJGCO^^ 
CGA(^GACGGaTGAGCGTCGMTCCGCM 
0KX1AAGTGATGACGCGCACCAT(^CAGCTACA 
AACTGOXBACaxm^ 
CGAAGCCAAAATCATCACCGGCAAAATCGT^ 
OTGCCGACTGCCMGAAAMGACCCTGCCCTC 
AAGGTAAAATTTTGAACXSTCGAAAAAGCACGTTT^^ 

TACATCGCCCAGCCXXXGCOTAC^ 
CAGACGGCCGCACCATCGAAGGCGCAGAACTTGCCGACA 
GCACGCGTCGCCCATTGATTTGACGTCGl^^ 
COOTC^^^TCAC^ 
TGGTCCm^GGGCGCCM(X?rCT 
AGGTTTCGGCGAGATGAACCCCGAGCAGCTTTGGGAAAC^ 
ATGGGCGACGAGGTCGAACCXKJGCCGCGCCOT 
SEQ ID 4588 



VUMIAIDEAIAGHCDKITVTIHAIBSVSVADNGRGMPTGIH 
GTTVRFIAGTETPGNIEYSFDILAKRIRELSF^^ 
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RQVmTINSnE^MVAKKAKVETAGDDMHEGLTCVLSm 

LMX^HTOPALSELYLVK^ 

YIAQPPLYKAKYGKQESYLKDELEKI^WIiLGLALEKAK^ 

RPIKITRKLHGNVHVmEPK^ 

KGDBVEPRRAFLENNALIAQNIDA 

SEQ ID 4589 

TTCKXK^TTGTCTGAATGGT^^ 

GTGTCTGCGCCTTCGGTGGTTACATTGTAGTTG^^ 

TCTGTTTTCACATGATTTTCCGAAAACAAM 

TCCTTAAAACGGAACGCAMTATGC(XCM 

TTTCGCCTTCTCOTCCCTG^ 

TOCATCOKAAAGCMAA^ 

ACCACCTGCTGAAGGAA 
SEQ BD 4590 

LRGCLNGGSDGLLPFGOTSHAALQ^ 
SLKRNANMPQLTLDKTDIKII^ 

™nafshfvlotllshhg^ 
SEQ ID 4591 

TTGAAACGGCATTGTTTTTCGTGGTATAAA 

SEQ ID 4592 

LKRHCFSWYKSRFTOjEVWR^ 

SEQ ID 4593 

ATGAACAAAACCCTCTATCGTG^ 

TGAAATCCGTTTCTTTCATTCCTACTC^ 

TAAAGCTGCTCCTAAAACCCAACAAGCCACGATTCTO 

GTGGGTAATCGCGGGGCQATTTTAAACMCAGTCGCAGCM 

TCAACAGCAGCCATCCTTCACAACTGAA^^ 

CCCTGCCACTTTGACGACAGGCCMCCGCAATAra 

ACAOTA3TCTTGTATGC^ 

SEQ ID 4594 1 

hTOLYRVIFNRKRGAVVAVAE^ 
VGHGMLNNSRSMTQTQIK3GTO 
TRILVCQQNHLDQYGRTSRHS 
SEQ ID 4595 
TTGTATCGTGCCATKX3TATTAT(M 

SEQ ID 4596 

LYRMRIINIMSLLTIVNNQLLIGRYFAILIVQAA 

SEQ ID 4597 

GTGATACCGATTTCACATOTATTCTTGT 

AT6CAAATGGCCGTTTG(nmTAGTGGCACG^ 

OTATCGCMCAAGGCACTC^^ 

GCGACCAAICyUOACTGAIlM 

ACGGCACACTTGCCGCTGATAATAAACTGGATATTGCOT 

ACATAOTTGCMGCAGGAAAACGCATTCGGA 

AGACXSOTGATTGACGGACAACAAACCAAAATCCA^ 

AAAACGGTACAGGTGCCGCCATTGCGGCACGCGAm 

AGATACX^TGACCAAGCCACAG(X!AAAGCCCAMGGA^ 

CAGACCACTTACGCACCCCTGATGGAGTGGCGOT 
CGATTTGATTATTGATAGCAAAGCAGTCTT^ 

ACTGGGTTC^TTTGCCTATGAATCGC^ 
SEQ ID 4598 

VIPISHVFLYANKmiSTAEQAGIRNQGQLFASSGNVAIDAN^ 
AraQQIiIIHIX3QQSTVVIDIfITOIQS 
RGLIDGQQTKIQAC^MNHGTGRnGDNIAIMTRLDNQDEN^ 
K^LVBTGRERIVDYEAFGRH^ 

KSSAKMVSCTTTGVRVVKr^QGIVNKI ILCRRKSHATFHWVHLR4NRIAKH 
SEQ ID 4599 
TTGGGTTTf&TGATAGGTTTCGCTTTG^ 
TCGTTTCCTGOTGCMTACAAC^^ 

SEQ ID 4600 

IXSLMIGFALALSlOTmmACLNPGAIIGNNITAK^ 
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TTOC&CWOiWJTQGCGTGCGCQTOT 

CGCATAGCAAAGCOT^ 

TTOACCCCATTACCCGGCAGCAGOTATACA^ 

GGCAGCCTCAAACTAGACCC&AACAATTTACM 

AAAACX^CGAAGAACMTTTAAAGOTTAATGGA^ 

TGTOGGTTGGTACAAAMGAAGTTAAACTTC^^ 

GGCAGCAATACACAAATCAATGTTTCAGGCAGCCTGAAA^^ 

TCAAAATGCACAAGGTAGCAGCAOTACCTAGACCGAATGG^ 
ATOGCAATCMTCAGATCMGGGCAAACCXXXXTGCAG 

GTATGCTAAAMTGACATTACTATCAGCTCAGGCATCCATO 

A(3T(&TCACGAAACTGCTCAAAGCAG 

CAGGCAMCMXmC^^ 

O^mGAAAACCAMCCC^ 

CCTCAGGGCAACAACCTTATCA^ 

CXX3TTACCX3ATTTGGCACAACAAGCGATT^ 

CAAACAGGCAAAGGCGCACAAAACT 

GSLKLDPNNUIKia^YYEQRLINJQIAEL 



pEGNNLISTQS©lGAAQNQIflSK^^ 



GGTGTTTCAGGCAGAGCAGGAACCGGCX: 
SEQ ID 4604 



GCAGCGACAGCGACAGTCAAAGCAGCATCACAAAAAGCGGCATC^ 
CCGATAT 

SEQ ID 4606 

U»SVAVSGKrifiQGA^ 

AAACAGCAGCAGCCGAGGCATTAGCCAATGGCGATATGGAAACTGCC^^ 
CKTGTTaGCXITCAGGOT 

GC6MT66CAAACTAACC6(XAGTCAAGAAACCGCTCA^ 
SEQ ID 4608 « _ \n*r » nrrr \ oTnnrtfvi»nT» » \tn\ ^ppy^YAIQQHPKDLAGQN 



ANGKLTASQETAHVLAHAVLGAAVAAA 

TGGOTACMCTGCCGGAGCAG^ 

SEQ ID 4610 

LGAGGSEAAAPIIGKWLYGKGIX^SIiNAEEKETVSAITRMIiCT 

S ^GG^ 
TOATCEAAAACAATGCGCTGATTGCGCAA^ 
M(XGCCTTTCGGAGCmACCATG^ 
GftOXIGCAAACCGOCCGCATCGCCATCGACATO 

axrroccGGamxxsAACGcc^^ 

GOXmCACCmACGGCCTGCG^ 
OG-raSGAAACGGAAAAATC 

CttCCXXXXGCAGOT^^ 

GTCOTACCGOTCGACCAACGGC(KX5GCAGGQ!P^ 

CAATCGGCATCCTCTACAAAACCAACGCCTCCATTTO 

GCCCAAACAAGTGGAAAACGOKCCG^ 

AAAGCCCTCAAACKXm^ 

AAA(XTKX3CTTGCCGGACTCGCCG(X^ 
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DAOTAKtMDIYGSLALTGYGHGTFDM^fljGLEGSLPHDI 
nSHBKWCTETOSQyBLM 
VVTAPTOGAAGIIPAVLHYFRKFNPHAT^ 
. KMiKLGTLMiLn)GTIKKVSIiDE^ 

TTGAAACGGCATTGTTTTTCGTGGTA 

SEQ ID 4614 

KQKLPKSAI IAPHLFRTCKRSADAALGRLTR^^ 

SEQ ID 4616 ' : . 

VGLPSIKFRFRFLFFISYKHVIKre 

TTTGGATGGTACCGATACGGGCATAGAGAAGATG^ 

AAATTCCGGTTTGCGCTCAACM 

OTATACCAOJACCCGT^^ 

CCATGCAGGAOTATTGCAGCATTCX^^ 

GATCTCCAAATCGTTOX^ 

^^UIlS - 

KFRFALNHAI£QTHFVSGKC1^ 

DLQIVPVEREPHQSDVWQHR 
SEO ID 4619 

GTGACTGTGGGG^ 

CAATTGGCGGTACGCCCGATGAGGAAAATTTTGACGGC^ 
TGOTACCGTTACCATCAGGC6GTTTCCG6ATTATC6GAA 
TATCTCAGTGTAAAAC^ 

GTCGCAGTACGGC^GATTTTMGTTGAAATATAAACACGG^ 

GGCTGATGTAAATACTCCTTTTCAAATCGGTA^ 

ttCACCGTACGTGGCTTCGACGGTGAAATGAGTT^^ 



TGCATTGAAAAA6CCCG^TATTTTCAGAC6AA6AA^ 

SEQ ID 4620 

VSVGMDNSGSKATGKYQGNITPSADNPFGL . . ^ 

Yli^WTRETKSYID^^ 
HTVRGFIXSENSIiPAERGWYW^ 

MATTmTTTCACTGTT^ 
TTGGAACAGGGGAAATGCOTTAAAGAAAGTAAAGCTMtX^ 

G6A 
6 

ATAGCTTTACTTTCTTTAAAGGCA^ 
AMG 

SEQ ID 4624 

VIYFSFGGIIILSnilAGICPYCHKFQKMNYGTSVGIGSDSPTFFKG 

AAAAGAAAAATTTC(X3AAT(£^ 

SEQ ID 4626 

MIPPKUCYITNIIYIIVICRKIiEP^^ 

TCCMGACGTGATAACXKXnTCTAATTTT^^ 

SEQ ID 4628 

MNTNLNDKDKAMDTAIRFQKRMRIPKFFFIJW 
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SEQID4630 

LSLSFKFVFMFLTLFVSKItf 

SEQ ID 4631 

AT6ACCGCAC6CATAAGCCGTCCTCOT 
TCGATTTTCMGCGCXKTTTTTCC^^ 

ATAACGCTTCATCACXmTCAAAAGACAGCrc 

TAAWOTGAOGCftCTQOGCTTOGGAA 

CCAMCCTOCTTCMTGCCC^^ 

GT&TTOCCTGTCCTraSCGGCG^ 

CMTCCGTTCCGCCTGAACSTTGtt 

CGTTGOCGTAAATTCGCCTTTTTTCW 

TTGAGCGAGTATTGTTTCAAMCGGCA^ 

CTTTATAAAAACCOTGATATTGTOT^ 

AACGGTITrGMGGACGGTTTCTOT^^ 

ACCACGGTGTTGACCACTTCGTTTTCC^^ 

TCTGACCGTTAACGGTTGCCAGCGTTT^^ 

raGGCACSACTTCATTCAGTCCGCOT 

SEQ ID 4632 

MTARISRPPYIiNVQCRIim^ 

ITLHQPFKRQPFEQELLRLTW^ 

I£DLRHQRIHQSRVDFGCFGCRKTC^ 

RCRKPAFFQVIOTiIKRRNTLPQIFQRM 

NGFEGRFLVVARLFRFGTQFGKRIFKIX31PW 

SGRLHSVRRMEIIiVRGSRPDAERVNHAA 

SEQ ID 4633 

TTGTATTGTTTGTTCAAAGGAAATTOATGAAAAAM 
AGGTCAAAGGGTTCGGCAGGCAGAAGGGGCATACGGAAATCTAT^^ 

SEQ ID 4634 

LYCIJKGNFMKKIEAIVKPFKLDIMIEALTEIGITGMWS^ 

SEQ ID 4635 

ATGAATTTGGGTCGTATTTTAGGCAGCGGGATMTG^ 

TTAAA 
SEQ ID 4636 

MNLGRILGSGIMFKWDl^fNG^ 

SEQ ID 4637 

TTGATATTAATGAACACyVCC(^TCCCCTTCTC^^ 

TCATGCTCCAGCAMCGCAGGTCGCCGCCGTGTO 

GGGCTTGGGCTATTACGGQ^jCGCGCGCAAT^^ 

AGAAGCACCGCCGCCGCCATTTCTGCCOT 

AATTTGAAAACTCGCTCTGGACA^ 

OTGTOXBTCAAT^^ 

GTCCGCAACXXKK^CGGCGCAATCTTG(^^ 

TCTtXXTTGACGATGGCGGATATGGACGAACAAAC^ 

TTGGATAMGCCXJGGCCATTTGAAAGATTACGGC^ 

SEQ ID 4638 

IJLMNTPIPFSmiKWQKQH^^ 

RSTAAMSAFAFHRRETIIiWlMRVLa 

VBWHJGAILLEKRTAKGIW^ 

SEQ ID 4639 

ATGTTATTGTATAAAATTGTGAAAATMTTCTTATTGAOT 

SEQ ID 4640 

MLLTKIVKIILIDIiFFVGAYNSyKETIiUff 

SEQ ID 4641 

ATGAAAGCAATGGTTTATCACGGCGCAM 
ACTTGGGTATTOXSAMGGCAAAMCCC^ 
AGTCATEATTTCATGCGTCAGCAAATG^ 

TACGTCCmCOXm^ 
GCGATGTGAAAOTGGCGATACCGTATTC^ 

GTCGGTATCOTXXaACT^^ 

ATCOTGCCATCACTACCGGTTTG^ 

GGAAAAGCCTACGACGTGTTCAAACACGCC^^ 
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SEQ ID 4642 

MKAMVYHGANDIRFEEKFRPQ^ 

YVRTPYADNSLVPPHDNVNEEIALL^ 

VGIPAfl^CQDIVKPGGKIAVV^^ 

SEQID4643 

TKmAATTCCTTTTC^^ 
TAAATTTAAACCACTACGGCOTA^^ 
SEQ ID 4644 

LQNSFSSNSimPTTGFSAVFVSNNKRSHQNIPLIPOT 

SEQID4645 

ATGGGATATTTTGGTGGGAACGGCGGTTATTTGAAACGAAAA 
CGACGGTTAiOTGTTMnTO 
SEQ1D4646 

SEQID4647 

TTGCCGCTTGGCGACTCAGTC(KX!ATACACAGCAG 
TATATOATCMTTAGTCATAAAAATCGCCCTATCOT 

SEQ ID 4648 

liPLGDSVGIHSSTAPKMPIHLQNNANDARYOT 

SEQ ID 4649 

CCGCTTCMCGCCAACG^^ 

SEQ ID 4650 

PLQRQRQHARRNQPQAYPFLRVBAFVQEHHSB 

SEQ ID 4651 

GTGCCTTOATCX^ATTGACGACTTAAG^ 
GCCTGATTGCCAMGAAATGCCCMCX^ 
ATCCCTWXX^(X:CGGCC(m 
GTC(nTGGCGGCAMGCGATGCTCAMGCOTOT 

TATCK^GTCGGCGGCTCTTGGCTGACA^ * 

SEQ ID 4652 j 

VPVMAIDDLSTAVDLSW&V^ 

VVGGKAMLKALYGPYADV^ j| 

SEQ ID 4653 

TTGTCTGAmCTTAACXrCCC(X^T(XC^ 
CCTGTGATGGCGAT 

SEQ ID 4654 

LSENI/TPPI PFDLNKNOTPHliENHIfVQTOPPRNPDRRRS^ 

SEQ ID 4655 

ATGAACCACACTCCTATCCACCCCAMCTC 

TAGAGGXyUKXXGCTCGOT 

C(^CATGGTTTCCGCACACCA(KXK5TTT^ 

ACXX!AAGGCTACGCCGGCATGGAATT(nCACTGT^ 

MAT(X?ITCCGGGT^ 

TTTC(KXX3AAGG(»AAGTCGGA^ 

ATGGGCOTACACCTGCCTGCCGCCGOT^^ 

TGGGCGAAATGTTGACCGAAAAATCCTTCATC^ 

GMCTGGGA(^CTTCGACGAMTTTC7IKX^ 

GMTTGCTGGACGCAGGCTTATTGCACGACGATG 

A(»CCAGC(mA(^a3ACATCCT(^ 

AfiGCK^CATaTCGAAGCAOT 

CGTGCCAACGGTATGCCCGMTTGCACAA^ 

(XXjCGTOCATCCACATGACGCXXXJAAGCXX?^ 

CGAATGGMCGTCCGAGAAGTCGAG^^ 

TTOGCGGAGAATTTGCC 
SEQ ID 4656 

Mhnn^IHPKLAEITGRIIERSRPTREKYLAn 

TQGYAGMELSLFSRDVIAMSTAVGL^ 

M(mPAAAFVHPYTDLREALTRYAAGHLARGIKNGTro 

ELLDAGLIifflDVDro 

RANGMPELHKLTPPLGILQDRGFKV^ 

FGGBPA 
SEQ ID 4657 

GTGGATAGGAGTCTGGTTCATGTTCXSG^ 

TAAACCEGTCX^GCGTTGCCTCGCC^^ 

GCGGGTGCTATTA 

SEQ ID 4658 

VDRSVVHVRSPGGTGWYV&PVCAFLOT 
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SEQ ID 4659 ,_,^ rmnmv ~ vw ^^ 
ATGAGTACACAGACAAATTTOAm^^ 

CCACCTGAAAAACGTC 

siS===s====s=^ 



CGCCAGCATCGGGCACGAAGGTATGCCTCATGTCCAC^GAACTC^ 
AGGTTCTTAAAAGAATCCGMCTCGAAMCGGCATCGCXaCCCTGAT^ 

(^TACACAACAAGCACATCGATTACGACA^^ 

ELYTPMVSRIiLQLAV^^ 
ACGGAAAACCTCAATCGGGAGCAl^^ 
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SEQID4670 

MAETP PFKMGFGPMTPSKSGRYAGLRIjMCASKCGNAPRHGKPQSGAYR 

SEQ ID 4671 

ATGAAACACCTTCACGACTTACCGGCCTGGT^ 
TGCAAGTCGGCG(5ACTG^^ 
CGGCGAAAAAATCAATACCACCGAAAACC(X^ 
CAACGTATGGGC(^TTTGCACACGMGTCX)GCAG^ 
TGTCTACCGCGCTCAAACC^^ 
CATCGCCTCCAAAACATmCCACGtt^ 
ACCMTC^AAAAGCCGTCGCCGAATTCGGCATC 
TCGGCGAAGAAAACTTCA303AAATGCTCW 
CAACTACTACGGTGGCGGCAGCCACGTOT 
AAA«^GTCG(yOCGAAACCKXXm 
CGCTTGAAAMCGCAGCMCCTGCCCGGACACCACG^ 
GCAGGGTATGGATGAGGCGCGCATCGAAGAGCTGGTITO 
GCCATGTACGAACACAAMCOTTCGTCCAA 
AMCCGAACCTCAAAMC^CGACAGCTCGACC^ 
SEQ ID 4672 
MKHLHDLPAWSKLWNHFDD^ 
QRHGEFAHEVRSGSWLGTTKQVITDVVNI6IG6SDIC 
TNRKAVAEFGIDIANMFEFWDWVGG^ 
RAVGHETSPII^ETGINGQHAFFQIiLHQGT^ 
AMYKIKTFVQGIIYfGINSFDQWGVELG^ 

SEQ ID 4673 

ATGAAAAAAAATTTACCCGCACTCKjCATTGGCAAffT 
AAGAGATTTTGGTCAMC^TCCCm 
GffTCGAACGTTTGGGCAmCCGTC^ 
GCCGTCGCCGC^CmTCTATAGGCAACAGCCATMGA 
CAGACGACAAAAGCAAAATCTTTO 
CGAAAAACTTGMGCGGCAGAAGCCACOGCGCAGGMGCGAGG^ 
TACAAGCTGTCGCGCAGCGAGTT(X5AGCAGTTTTGGAAAG^ 
ATGCGAAAGCCGAAGGK^^ 
MTGGTCAGGGAAGAGGACAAGAAAGAACTGCCAAAGCGGCTC 
SEQ ID 4674 
MKKNLPALALASMLILSGCDRLGIGN 
AVAANQSIGNSHKKTPDFFEPYYRKEGAYmTISYSVQPTO 
YKLSRSEFEQTOGIiPQTVQNK^ 

SEQ ID 4675 

ATGCCCCAGTACCGCMCTGGCAAMCGGTTTGG^ 
AAGGTTATCTGATOAC 
SEQ ID 4676 

MPQYRNWQN^IXMCAIXSEYGVAQSraTRmAKTKEIQGYI^ 

SEQ ID 4677 

TTGACGGTTTTTTGGMGACTTGGACGAAGAACT^ 
CGATATGCTCGAQTTGCCGATGCCGCCGT^^ 
TACACCAAOTTTCTCOTATCCGACGC 
AMTGGGACATGAAGGCTGGGCAAAACTCGGTATCACATOT 
TTGGGGATMTttX&ATTTTTTTGCCGC^ 
TCGGCCCGATGGTTCA 
SEQ ID 4678 
LTVFWKTWTKNCSDWMHPPPN^^ 
KWDMKAGQNSVSHPAHRTIRKIFQNDPL^ 

SEQ ID 4679 

ATG(£GTCTXjAM(X&AGGACACMTGGATO^ 

ACGTCGCCGTCCCCGTCATTGCCGGCMOT 

ACCTAICGGCGAGGTCmTT^ 

AICGCCGG(X£GTTGATTOT 

CGATACTCGGCGGTTGGATTTCCGGAMCTO^ 

ACCTTGGGCATAACOTXX^GGTGTGC^^ 

CCACCTCGTTGGGTTTCATGGTGTATATGGG^ 

(XXXXSTCTTOCTATCTCO^ 

GACATGGATATCGGCMCGTCATOTX£C^^ 

OTGCAGCCTGTCGAATTTO^TGCGOTK^ 

CTATTCCGC^CATTGCACGAMtf^X^^ 

GAAATATTCCPOGCCGGCAGCATTTTGOT 
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EIFIAGSILFTVLIPIGWIiAKPPFHSGGGGH 

S mcSS" ^^^i^i^Mmu^^ 

GGATAGTAAA' 
GTCAATCAGA 

SEQID46C 

IiRRLWHRQARl 
GCCT 

S OTTC<!gLc^^ 



ATCGGCAAATI 
GCCGCXACCGAGGOTCGMn'TCCXXSTGGGGTCG 

juSraOIQGM 

IGRSASGNAASPTPIVPEIPEIDEEI«RESGRSST(MKPRBH8GU)PPVGS 



CCGACC 
SEQ ID 4690 



PT 

SC(aAAC«X3ITACCG<mAiJkA»COT 

S ^twSavri™svwvgpgaw 

I^SEAEOTAiTmVKTMJ»ABAAAAr» 



GAAAAATATATTGAAA(WXXX&TCOTAATCAAA 
S |^ASttTCFC'STIKGtfn^IFim^ 
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SEQID4697 

ATGAAATCAGGAA^^ 

OCGCCXmOGCCGGATCGGCGTTTT^ 

GGTmGCACtoCGATACGGATGCCGTGAA^ 

AACXXXXXmAAAT(XXXmCTTCG(» 

TGTCC«XmGAGCTGGCACACGCCCGTAC(X^ 

CAGCAGGTGGCGGCAGGCGCGCCGCTGATG^ 
TGTCCGATTTGTACGGCAAACAAATTGTTO 

AAAACGCCGGGTGCC^TOCCGGAAATGGA^ 
SEQID4698 

MKSGNSEPKLHETHTDETKI^ 
NRR^^TSQAGAQVALRRADMRAQDDliRRRSALAE 
QQVMGAPLMAVVPLSDV^^ 
KTPGAALP^ESTDWSEVDRTVDEILGQSAP 
SEQ ID 4699 



t 



TTGTCTTAACGCGCCCCGTCAGCATCCGCCA^ 
GAACGGCAAAATCCGCmMCGOX^T^ 

SEQ ID 4700 

LRIGEIQIIERDGEFKWDGSVLP^ 

SEQ ID 4701 

OTAAAAAMraCACTGCACAGCCCCGCTG 

SEQ ID 4702 

VKJOTIiiSPAADIECGNGBYIRITSTLERE 

SEQ ID 4703 

ATTTTTTTCACGCCGGGGTGTCTMCXXIGC^ 
T 

SEQ ID 4704 

LQGAVVRGDGAVHLFALERGRDSBVFAVAAF^^ 

SEQ ID 4705 1 

TTGACCGCCGCGCGCGMTACCT<^CCGC 
CCGCCAATATTCAAGCCGAACGGCAGACCGCCTGCGAAG^ 
•GGCG 

SEQ ID 4706 

LTAAREYliPAETRRPLCDN^ 

SEQ ID 4707 

ATGGGAAACAGC(X&TTAA<X!CAACT^^ 

AMCCGTCGCCCAAACCCTCGAATTAGMTCAGAACCOT 

GGCATACGCAmGGaGCGACCTCGACCACATC^ 

GAC(XATTCCGCGCCCGCGTCmGCCCATC^ 

CGCCGCCX3GCACGGTGGAGGTTTACATCAAMCC^ 

SEQ ID 4708 

MGNSRI^QLPAPAAIEETDFBGIFARKKAALT^ 
D&FW^VQAHPEKYAAGPRTAYEAHAIEAPPKSHT 
SEQ ID 4709 

ATGGAATTAAAAGCCGTTACMGCCTGACCATCGACAC^ 
ACGtmAGGCGGCAGCXW^^ 
SEQ ID 4710 
MEIiKAVTSLTIOTPQfrTIT^^ 

SEQ ID 4711 

TTGGAAGCCTGCCGGAAACGGCAAAAGCCGCCCCGTO 
AWmWACCGAaXOXE^ 



OTACAACCACAACAGQX5CGCGATGGAATTAAAAGCCGTT 

CTGAOTACCAAAACGGCATGMCGGCCAAG<XXj^ 

AGACAttCTC&XXAC^G^ 

CAOXXXmTCXKXX^TCC^CCM 

TMCCCTAGACCTM(XCTTGCCGA^ 

SEQ ID 4712 

LEACRKRQRPPRSMTGRFPTSRRDRNI^ 

LQPQQRRDGDCSRYKPDHRHPPNHHKRTPDRQPNHHRPRA^ 

HAAIALIHQAAVTALARWKPRITVRHOAOTADI^ 

SEQ ID 4713 

TTGATCAAGTCX^OGCTAAAACACAT^ 
TTAACTTTTGGCGCGGCGGACGGGGCia^ 
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SEQ ID 4718 



SEQ ID 4720 



fcCTKA' 
SEQ ID 4722 



TCGCATATTTITOrPGGCT^ 
SHIPCXSCmSUISTPIPQDP 

===^ 

CAC^CGTCX^OWX?EAAAAA 
SEQ ID 4727 ^^^^^^j^™,^^ 



(^^GCOSGGCAiaCAAAAiAATACGGTCTOrACCifiAT 

S ^SLigqdgasrfi^^ 

SEQ ID 4736 

mJEAAWAHSYINElYFQFCHMECFIiNIASKII/r 
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-AIJOQMQOCGOCGCCCAAAftCACITGCAMM 



uoc 

SEQ ID 4738 . 

SEQ ID 4739 



GGCGACCGCOHTCACTCGCCTACCTOACCG 
CGCCCACAACOXMCCGACKA^^ 



HAAHNKI^RQIBAAKAKSQSKLAIIGOGLYliLIilPNG 
GCTCGATAAAAATGCCACTGGCTTTCTCCreCCCCMC^^ 

ATccGGOGCGmcGaiaronmccT^ 

(SAMOaCACtKlTOTOAT^^ 



AACGGCTTC 
SEQ ID 4742 



IRQRFmmiHSVSFEMJFM)^^ 

NGF I 

TTACTCTCCTOGAOTnATGAAAAAGWAGGCTTAACCGT^TO 

SEQ ID 4744 

Vvra.THKIFSSVKFASGEAAKIjIiSIiSVF^ 

^ATGAAMAMOTATMTrTATAT^^ 
<7KaTATMCGAACmAACT<^^ 
GAAAGKiAAAGTTCTACCTCAATATCIATTWftTlCAffl 

S ^^^TmFFIFEKI^SKISFDYllKKE^ 

^rreGCTSGMHMTTHGMACTB^ 
CAAATATMTCATTTACTB^TM'EACCAGTACCSjAOT 

SEQ ID 4748 

LAGLILKLIMtKFYQAEBHSKGHYDIVRKHnniKHITKI^^ 



^TTOCAMTMjlfflOTAAAAIAAATra^^ 

axxacGAAGoc^Aicsoaxxmso^ 

SEQID4752 

MAOTAAaWPSEENYTNTAOTTIAAPHFEHLPH 

S J^aTAM^ 

CCGIAAGCCG0CATm3A»GC(am^ 
C<aCAAACAGTCCCirauaS3^ 
ScmW3Tffl«^CAC(OTl^^ 
AOGTCTTGCGTITOGTGCATATTGTTGAGOGTAACCAC^^ 

mTCAGOaCTCaXXSA^ 
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ACCMTG03CCCAAC<m^^ 

A(XATOXSACATMTTTTGCTGCCGA03AAAAA^ 

AMOiGAATCATCAGCGGCACGACGGCGATGTCCTGCM^ 

GACGACATCGCCAACXCGCCCC&TACGGCAAAaXX^^ 

CCAGCXKCCTCATCGCCCTaGCTT^^ 

GAAAATOX3TTCAWKACC(?KXBAACAGGAAAATACAGAA 
CTGATAAAATACGCCCTTITGACCCGCACCACCGCAC 

FFGVRHGFFETHIGKRFHIIVIGGASrfGHIGlffiGQinV*HVIiRFTO 



♦KLQTDAEHQEBNPDFAQIIRElffiFRHQAB 

S toiKtSc(x^aaaatatcccot 

ATTTGAAAGGGCGGCCGGTGGTCGTCGCGTG(S5AGGGCGCGCGTI^^ 

GCTGTGT0CGCAAGC8STCTATCTGOT 

GCCTATCTTttCGTTA^ 

CAAGGTAACGCTGAAAAAAATGCAGTCQCTOGGTATGCGGAOGGCQGGCGATTroC^^ 

GCGCGCGCTACGGAOSAACACCCCGTCAAAG^^ . 
CCGAAGACCTGTGaxmAATOiC^^ 

CAGGATTTGT600CG 

S S^JSsAEPKSI^IHIimFyASVEI^PHIJ(GRPV^ 
AYLDVTRNFKMPYAGEVAKEIRAAIFAETG^^ 
ARGTDEMPVXAERERLQISTEITIiPEDLPI 



GTGCGGCAOTXA(?KXXa<XjAAAGCGGCGCAG 

TTGCCGGCGCACGCAQCGGCAAAACO^GTGATTACGCAAAAAATCAAGCAllTC 

HSMJflGCAHiAGOGOGSCGCCAAAJ!^^ 

AAAAAAMCnCKXATOICWCTmCCG 

ATITAAAAACOCCTGAAQATGTCGTTCAQACGGCCTCTMTGT^^ 

CTTAATC03CCKm«^^ 

(mTGAAACTCTTGAOrnXKOSAAimTC 

CGCAGATGMGGTCATCAAACTGGAGCAAAACTACO^^ 

jCTGCOGIOTMACOGHXsCAAKaTCACG^^ 

ACGTGTTGTOTATGIWXMKTGCTTOTCAAIXCWACGA 

GCAOGAGCACGAA^K^AimGCCa 

AMOXX3AAATCGAC6AAGCGGGCaffITOTCMCAaX^^ 

GCGAGTTGGTATCereGTOOACGAAAAGGCG^ 

CGTCTCGCTATCCACGCI^OKX^^ 

AAA? 

CGGCCCGGCGGAT 
SEQ1D4758 



uooj/tcak^avgi^ 



ERHlMYVGITIUtftRQI/ILTHCVKRKKQG 

S ATCCACACCATCCI^^ 
TCCCCnCGAIATOJ^GTTCC^^ 
gjTCMftMO^XXSA^^ 

XGGAGGACTGCQGOXXjGACATCTGCAAMTTGCGGTGATGOCGCA^ 

GATGTCGATGGG6CAGA0GGGTG0GGTCAGCC<XX3TGC^ 

CGTGCGGCACTCGACTGCCTCGAMGCGX3XAGAC 
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SEQID4760 

MHTSLTVTOPIVIGSGRTKIA^ 

ETAVRRAVANAQKlffilAAIiLCNHEPHR^ 

RAALDCLESGAD 

SEQ ID 4761 

ATGACCGCCACOXXXX^^ 

GACTGCATCCATj 
ACGAACGCAAACAGCGCGAAGCC^CGAACG^^ 

CGC(X}CGCCCAACGCGATATGAMTACGGCAGCGACAGCGAAAM 
SEQ ID 4762 

HTATAMLDRLIi^^ 

DCIHMQPVADTVLPR^ 

RRAQRIMYGSDSEKAAALEYLRQYKAKQEAAQKTAS 
SEQ ID 4763 

ATGTCCCQCAACAACGAMAGCT^ 

ACTATTTCGTCAAATTCGTCTGCCCC^ 

CCTGAAACTCTACCTOTCAGCTTC^^ 

TTCACACCGCGCGGCGGCATCXXXXST^^ 

SEQ ID 4764 

MSRNNEBLQGISLLGNQKTQY^ 
FTPRGGIAVHPFAHYGKAGTBFEALARKRLFSIDAQ 

SEQ ID 4765 
ATGACCGACOGATTCATATTCCGCGCCGC^ 
CGCAAATCGAAAAMGTTTCGGCAAAGGCGCCAT^^ 
CGGOCTGCCGflXXXXXECATCGTCGAAATC 
(XXXJAACACGCCTTC^TCCCGTO 
CGGGCGGCATAGATATOTAGTCGTCGATT^ 
GCGCAAACTGACCGGACACATCAAAAAAA(XM^ 
AAATTCTATTCTTCCXjTOGCCTO 
GTCAGGC&GAGTTOAOTCCTCT^^ 
GAAAATCGGTCAGGGCAAAGACAACGTCO^^ 
ACGCAGGACGAAACCXjACGGCGAACGACCCGAAGAA 

SEQ ID 4766 
MTDRFIFRAAAKVAPQPTYRICSGTO 
AEHAFDPVYARKLC^KVEELYLSQPDTGEQALEI 
KFYSSVRLDIWtTGSIKKGEEVL^^ 
TQDETDGERPEE 

SEQ ID 4767 

TTGCCTCGCCTTGC(XrPACTATTOT 

CTTCGCAGCAGMGTACGjTTTATCXMGATTT^ 

TTMTCGATTTCGTCGGm 

GTTTTTCACGCCGATCTCGATCAATTW^ 

TOTTGCOCAATACTTCITO 

CGCCGATCTTCATTCGGATTTGGTTGATGAACACAAC^ 

COCCATATCGOTTCGAOTC^^ 

TCGGGCTGGGACAGGTAMGCTCTTCGACITITACGCC^ 

CGGCTTCGAGGCAGAGTGTCGT^^ 
SEQ ID 4768 

IJ>RLAVLFVLSAALM^ 

VFHAOTDQFAFPTDAPSVEDVKm 

PHIAFDPGFGDBCGyGIHJYHnAARTDERVADFQSLFAG 
SEQ ID 4769 

SEQ ID 4770 

IJJHnmRGEKYSCysmARFASILRRLYG 

SEQ ID 4771 

GTGCAMGTATTGCACATAAAAGGGCGCAGGCTTGITCCOT 
TAGTGGAT 
SEQ ID 4772 

VQSIAHKRAQASSVKSAQIHIKAGVGRAVATWFTPSGKNI^ 

SEQ ID 4773 

(?n£AATACTTTGCACTTGCT^ 

SEQ ID 4774 

VQYFALAEQNSNDPLYQMQKICRHSREGGNPDLSARKFIB 
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TTCAGATTCTACGTTCTAGATTCCCGC^^ 



CTTGGTCAAAA'rtfiUiHiuAv-v. ~"~ nim ,„ r , mvwYvywYaBMtHflTrTWTBC 

ac0Q0CCGflG ^^ 



AAA' 



ffCAC&ATGACCTGC<XXXaCGAS^^ 



KSARGKAEETANK'HGAFTQDLPPGVGDPFR 



^^^^^^^^^ 

AAAGCraTTCAACGCCATTC 



AACCCCCGCGATGCAGGACAAGCGTGTOCAAGCCGA^ 
GMTCACTGGAATTGQC66CAAACCGQCCA 



ESLELAANRP 



CGGC«?ITCAT<a^CT^ 



M(KCTGCATGiAO^TCGCOK3CATTGTCGGCGATGMTTO 

GACAGCX^TTGCTCATCGATAffGCACGCCGCTGC^ 
TOCOmCCCACGAAGAATC^ 
GTTGGGCAAATCCGATGTCGTCTCCCT^^ 

GGCTCMTCCOXXX^^ 
OTTGAAAGAATTGGACACACTGTTCTTGC^^ 



WO 02/079243 



-367- 



PCT/IB02/02069 



COTGJOTTTGCTGTCTATCG^ 
TGGC6(XTCCGCCGGTATCCATACCGTC^ 
AiCGGC^CXmGCTCAACACCGCC^ 
GIMXXEOOGCCTGCAM 

CAAaQGAAAAACCCGAAAAA^ 

ACCCATCGCCGMGTGGGCAAACTCTPCXH^ 
MXXEAGGTCATCOGATTM^ 
TMTCAC(XX3TCCCCTCTTTC 
COKX?EACC<^TGCCGA 

NGTPKUCTAmVGTAL^ 
qqEKPEKIAIFGKMNM^ 
YAGHPINTMYFWiSGIIiSAFL^^ 

TCGATOEATTOmTC^ 
TTT(?TO1^AAT<X^ 

FVVECFENFRRVAGRIIOTilTQKRnAM 

AAlCa^GCAAAAAAAOTTGAATCGGCAAAAff ji 
SEQ ID 4788 



GGCG 

SEQ ID 4790 

mwmS&GSVLVIGIRPFRHFCRFKVPI^ 

S aSac^ 

AAATGGCX^TTCCGCGCGAACAGGCG6CGGTTTT(^^ 
GCKXXXXSAACACOXXX^ 

TTCCTGACATTGGATGATTTGATTCCACAQGGTGAAGCCGGA^ 
MGGCGGC^TCATCGCCATCGACMTATTTTGTT^^ 
TACGCGGATTGTCCXX^TCACCCT^^ 
SEQ ID 4792 

U/TUMMiIAQGE&OSTOI^DMKPE^WEBCIiKLVRQW 

SEO ID 4793 

ATCAAGACCAAAmOXOTmAT^^ 

raTOCCTATCCCGTraXACTCrG^ 

AATACACCCTTCCGGCAGGACATAO^^ 

TACAAlOGaaCTCAAACACTATCAAAAC^^ 

ftAAOMGGCGCGTATOEG^^ 

OGGOTCAKaAAAAGACATOSC^^ 

SEQ ID 4794 ■• ' ■■■■■~^~ m ^KLDIMLlCSYLOTEGGSASAHIVETAOHli 



SEQ ID 4795 

CflXX3GATTTQAIGTTO 

CGTTTTGGGC^^ 

AATGCCGCATTGTTCGCXlAlircTC^ 

NAAI^AI9fliANENPSLAQKLADFW« 
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SEQ ID 4797 

TTGACTATATTTGm^ 

ATTCTTCCGGCAATG*^ 
SEQ ID 4798 

LTIFVISAASSPCPDLNLIHYIFQKI^Q 
SEQ ID 4799 

TTGTACGGCAGCCGCCGTTTGCCATGCGCC^^ 

GATTTTCCGACTGCCMCGCGTTTGTCC^ 

CCGGCGCTTATCTITGGTGTCCGCACAAGTC^^ 

TTGGCTCAAAAATCGGTGATTATTTCAGATMTK 

CCAACCCGATTC&GAMGTGTGGGCGAATATTAAGCGCT 
SEQ ID 4800 

LYGSRRLPCAQTDADGAQKKTAAYRQQDPAKVTHYLTRPAGPSIX^RVCro 

LAQKSVI ISDNARFRRMGALRGTAEKLGSKVLPPATC 
SEQ ID 4801 

CACTCTTTTACCCTAAAATCCGCACGOT 
ACOBGTCECGGCGC^ 
AAAATCATGCCGGACGGTATTGTTTATGCC^ 
ACATTAAOTGCATTGGGAAOTTTGGAATCAGGCAAAAC^ 

CAC^TCCCGOBGCTAW^^ '"—^-^ 
SEQ ID 4802 

MITHCKLKKEVQKEPLRSFVPEVTARSAADIIX3IHPDSAA 

KIMPIXjIVYADSPGSRGKSDAGGFTRCIUN^^ 
SEQ ID 4803 

TTGAACAAACAATACAAACACATCCGAAAAACGG^ 
SEQ ID 4804 

UJKQYKHIRKTGTTOQIVNILNQNTQAIQPPI^ 
SEQ ID 4805 

GTCCG(XX3TATGGAMTACTCGT(^GOT 

AAACCTTCAAMC«^GCTCACGACCAA^ 

GCAGGCGTTCGCGGATCAMTCGACAGCCGGCAT . " , * 

SEQ ID 4806 . 1 

VRRMEILVRGSRPDAERVNHAA*K*IFEQRTO 
SEQ ID 4807 

TraATCTATTOGTCGCTAGCATGAT^ 

CCTGCTGCAAGCCTAAAATCGGGCAGGTTATTGTT ^ 

GGCGCAGC^GGAMTGTCGGACTACMCGGTACGGGTTTTT^^ 

AMGTGCCTGACAACTATMGATAITGTTTCT^ 

GGAGCCGTATCGCCTATGAACAGATGAGCCGWX^CC^ 

ACCCCaTTOXmT^ 

CTGT^GTGACrTTO^TGTTGCCGATO 

ACX^ATTCCCGATGTGTTCAACTACCGTTCT^ 

CGGTGTGAAAAAAATTGMGCXSGTCMTCGGCTGAAGGC^ 

<?rc(nmau^Acc^^ 

ATGCGATGCCXK^TTGAAGGCGTGCGGGOT 
SEQ ID 4808 

LYLFVASMiniPVl^nDIISRVRAAWNRA^ 

KVPDNYnLFLQGGATTQFTiMAAMNIJ^ 

LSREFDVADYGIJYAGAQKNIGPAGVTVVIVRHDLL^ 

WFKTASEDLDRRFVLEAEI^ 
SEQ ID 4809 

ATGGCATACTCTGCGGAOTAAGAAACAAAGOT 

ATCACCTTAGGGAAITCGTTCCCTTT^ 

TCAAGAAACACGCTTTAOCPGCGCa 
SEQ ID 4810 

MAYSADLMKALNHSGLTKIRTRWUUUX 

SEQID4811 

TTGOTTTTAACAAAAAACAGATGCCGTCTGAACTGGTTM 

SEQ ID 4812 

IiFFNKKQMPSBLVKVQAAFSYGCAFYSIFN 
SEQ ID 4813 

WOTTCAAATTOTAT^ 

AAATAGTACAACACTCGACGTTGAAGGTT 
SEQ ID 4814 

LFEIVIVYCtfLTKNRCRLNWl^ 
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SEQID4815 

TTGCTCCCT(£CGTCA^ 
MTTTCXAGACCGTCGGAACCACATTAAC 
(mTTTOSAOTAACTTCG^ 
SEQID4816 

UiPAVKKKIMPDGmADSPGSRGKLDAGG 
SEQID4817 

ACACATCATTAACMCACCCGAAGG(^CTACAm 

CGG(X5GCATCATTACGOTGCCMTCAATCCTATGCAGACAMCm 

CM(X3AGGAAATTCTCAATCTGATT^^ 

TTTCCGACCOCTACAAACXOGT6CCGCXX3TTGTr 

TAGCGGMCTGCCAAMCCACAGGGGCGGGGGMTTGCAATO 

ACGCCAAGCGCACCGGCCTACGCACTCGTTCCACGT^^ 

GCMTTATCGGCAGGATGCGGMC&AACAGA^ 

SEQID4818 

MRFPLPITNAVLIOTHGAMEFKSM 

QGGNSQSDLLEITGDASGKTTVISVGK^ 

TPSAPAYALVPRQNLESGYAMLDTI^ 

SEQ ID 4819 

TOGGGGACGTTGTGCC(ffiATGCXX^ 

GCGATGATTTTGTTGCTGTGTCCTGCGGCGAGAT • 

SEQ ID 4820 

LGDVVPIXSVNGGLHPAAEVEFAEQVL^ 

SEQ ID 4821 i 

ATGACCATCAAAATCATOT^ 

TCAAAATGATCAGTCGGCTGCAACXXX^ 

GCTGACCGTTTCCGAAGACAGCGACGAJXTTACCG^ 

GGCGATAATGTATTCTCGCCCGAGATGACCXKXIAi^^ 

GCTATCTCGCCGCAGGACACAGCAACAAAATCATCGCXXX^^ 

GGCCGCCGTTTACGCCATCXX5GCACAACGTCC^ i 
SEQ ID 4822 I ' 

MTIHILIDDHTLFRSGIKALLSR^ 
(ENVFSPEIjTAKLVKSI^^ 

SEQ ID 4823 

ATGGGCATCAAAGTCGCCATCAACGGCTACGGACGCATCGG^^ 
TTGAAACCAACGCXX^TCTGACCAMTTC^ 

cAAaxnxxxmACTGcasawi^ 

ATTTCCGCACCGGGCG^ 
CQ6TT(mAAGTGTTCA6CmAGa 
(^GCGGCGTGGAAAACATGATO^ 
TCCTTGCTAGATTTGAGCTTC^ 
QTTGGTTTCCATGGACTTCMCCACACCAC 
(^GCKmAATGCTGAAC^CC^ 
SEQ ID 4824 
K6IKVAIN6Y6RIGRQVI£AIYDyQIQIX}LQI^ 
ISAPGGDDVDATVVYGVNDSVLTADMWSN 
SLVDLSFQAARDTTVEEIN^^ 

SEQ ID 4825 

GCCTGGAAAATGCCGCCTCGGTOTCGMGAGGCGGGCAA 

ATTTGAAAAMGOTAAMCGCATTTCCC^ 

ATCCTCCCCCCGTTGCAGGCATACCGCC^ 

TCAGGCGGTTTCAATGGGTCATTATGC^ 

(XSGTCAGCGGCATTTOGACATCX^^ 

CAOTTGCCGAACAAA 

ACCACATCCTGCCCGCCGTCGG 

TTGGGCGTATCGCTCGCCGGGGCGAMCAG^ 
TOAACCTGCAGCTACAGATGCTCGAAAC^ 

TTCCT<mACA<a<^CG^^ 

GCTTTACCATGACCATCCAAGACAACGGACMGGTTT^ 

TOAGAAAT(X»TTCCaAGC^ 

SEQ ID 4826 

mPTRPSDGIPLSLRLKLLTGLWVGLAALSVVL 

ILPPLQAYRRPTQIELYRPAGNIELFLQALENAGEKNTi^ 

QVAEQTHNLEKQlffiNLTLLY^^ 
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I£VSIAGAKQEEEKRIMVI^ 
FIiPTQDEQLQ^FILQESIiSN^ 
SEQ1D4827 

TOGCAGTATCATATTCGGAGGAAAGTOT 
GTGTTAAATCACGTTATGATAAAATK^AATAACA 

SEQ ID 4828 

IAVSYSEESRFKRCKQYHSLELOTAQKGQVKBHLS 

SEQ ID 4829 

ATGAAAAOCTCAAAOTTCCCAAAA 

GACCGCT(&TTGAC^ 

GttGTGGCAGCATTTOGGCCCGCTTTCG^ 

GTCGCGCGCTTTGAAAGOGTCK^^ 

TTCCTTATCTGTTTTC6GGTATGAAAA(^TTAA 

GGAGGGA 

SEQ ID 4830 

MKTSNFPKSPPAIJCTPALIIjAGGLADRMGGEDKGIJ^ 

varfesvskrtplcnafyvetpvm™ 

SEQ ID 4831 

GTGCAAGACX!6CC6ACCGGCGCGGAAAGGAAAGGTCAA(^^ 
TA(^TTOCACCATTTTTACCGACACAGGGAAACAG^ 
CGAGCTTCCGCMGAGGCGGCAGAAMCCTGCAACA^ 
ACTTOK^AAAACCMTTTG^ 

CATAGGCAAAGTC(^GACAGCCGTTTCAATG^ 
(^GGCAATCGTCGACACCGTCGAACGGCT^ 

SEQ ID 4832 

VQDRRPARKGKVKAYRAWARTGGTMPS^ 

TIAKTNLGLLKKjDCVNIE^ 

QAIVmVERIMAQWOK 

SEQ ID 4833 ! 

ATGCAAAGTGAGGTAGAGATGGAATTGAACGAATTOT 

TCCGCTGGCAGAGTGCGGGCAAAAAAGGTTTTTTGGAA^ 

ACAATTOATTGCCGGCAGACCT<Km 

GAAGT(X3ATAAAAGCGATTTGATCGGCCTGC(X^^ 

AGGCATTGAAMCGGCATTGGATGGCGGTCTGTCCXI^ 

AGGCGTGCGCGGAGAAATC(^TCAAAAAGAGGCCGTGGMGAAAAAGTA 

CGAAGCTGGTTGGAAGATTTTGATGTGCCATATGACGM 

GGGCGGGCAGGCTGCCGGAGCAACGTGCGCCG 

SEQ ID 4834 

MQSEVEHELNEFLDKAYAVLR^^ 

EVDKSDIJGIiPYLIiMjLREI^ 

RSWLRDFDVFH)EAAOTAALQ^ 

SEQ ID 4835 

ATGATTAAMTCAAAAAAGGTCTAAATCTGCCC^ 

OTCGATGAAAATCMGGAAGGTGMGCCGTCAAAAAAGGCCM 

TGGCGAAAAGCGCGTACTTCAGTCAGTO 

CTGATTCAATCAGGCTTATGGACTO 

ACCCTACGGTCATCATCAAAGAAGCCGCCGA^ 

AAATGCTGCCMTATCGAAACACATGAATTTGGCGGCCC^ 

CAAGACGTGATTGCTATCGGA(X3TTT^^ 

AGGTGTCTCAACTTACCGCCGGCGAATTGGT^^ 

GATTTCCGTTATCGAAGAAGGCCGCAGCAAAGAGCTGT^^ 

TTCACGACAGCCGTCAACGGCGGCGACCX30^^ 

ACAGCGCGCAGGOTTGGGTTGCTTGGAATTGGAC^^ 

GAAGGAAGGC 

SEQ ID 4836 

MimKGIJ^IAGRPEQVIYIXJPAITEV^ 

QDmiGRLFVTGRIOT^WAI/MLQVW 
FTTAVNGGDRAKVPIGlTEKVHPIiD 

SEQ ID 4837 

ATCTACGCaTTGACCGCCXKAtttmCAGW 
ACAO^OT(X3GGCG(»mTCCT^ 
GCTTTCATACGTCTITOrGTTTTGTO 
GGACAMTCCTTGATATTTTCGTATO 
CCTTTTACGCAAGCAGCGATGAATTTAT 

ATCCG 
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SEQID4838 

GQIUJIFVFDKLIlRliKAWWIAPA 

SEQ ID 4839 

ACAMGGMCAAACCATGGGCATCAAAGTOGCCATCA^^ 

ACGCCAGCGGCAGCCTTGAAACCAACGCCCATCT^^ 

OTCTTCTCGACCCGC^ 

GCC&AAAAAGTCCTCATTTOXXACCGGGCGGC^ 

CCAACTCOTCTC^^ 

CCGACCGTCAACGXATCCTTGOTAGATT^ 
OTACAACACCCTGCCATCGGTTC^^ 
CAACGAATGGGGCTTCAGCTGCCAAATGCT^ 
SEQ ID 4840 

TREQT^IKVAIKGYGRIGKQVLRAIYDYQIQDQIj^ 

AKXVLISAPGGDDVBATVVYGVM)SV^^ 

PTVNVSLVDLSFQAARD?NEEINAI21KAVSEA 

SEQ ID 4841 

GTGTTGGATTTCCAATTCGTC 
CGGCGtCTGTC^ 

C<Xl^GTrcGGTTGAAMTCGTTCGGGGGGATA 

SEQ ID 4842 

VLDFQFVHRICIHAGVVAAFDIIARLVFIKNIRRN^ 

SEQ ID 4843 

GTGGCGGCGTGGGAAAGCTGGCAGGAAAGCAAAGCCATAOT 
CAATATACTTGATTGCCGTG 
SEQ ID 4844 

VAAWESWQESKAI IJCKIMGIiAKRQGAGSGTAFPLIIUTRTIYLIAV 

SEQ ID 4845 

ATGGCGGGCGACAGGGCGCGTTTGAAGGTTATGCATTC^ 
GCC<mCCACTCCA(XaT^ 

SEQ ID 4846 |j 

MAGDRARLKVMHSEHSRRRSVVEIISSOT 

SEQ ID 4847 

ATGGTATTATGGCAGTTAACTTTTGTTMT^ 

TGTCGGCAGACTTTGCCCGCCTCGGAGAAGAGGTGG 

GGTCTGCGCGGCGTTGAAGCOTATGCM 

CCCGAAGCGAGCCGCCATATCGACCG(^GCTT6AGCOT 

TGGATATOGTTTTGCTGATGrcGG^ 

CAGACGTATTGCCATTGAACmACGGCGra 

GCGGTAATCXKTGCGATGAGGGCGGMCTGGAAAM^ 

SEQ ID 4848 

MVLWQLTFVNNAVSRLSRVESAHKGI 

PEASRHIDRSLSIiIW^^ 

AVIAAKRAELEKAARAEP 

SEQ ID 4849 

ATGCCGTCTGAAACCGAACCGGGTTTCATACGGCATT^^ 

TAACTGTTTCCGTGGCAAACGG? 
SEQ ID 4850 

MPSETEPGFmCYQTFAPAPVSSI^^ 
SEQ ID 4851 

ATGCTGCAAACCGACAAOCT(^^ 

TCGGGCAGCATAMGCC^AAGAGCAGCTCGCCATCTO 

CCACATCATCGCCAAAGAGTTGGGCGTAAAT^^ 

GACGAMTCCACCGGCTAAGCCCCGTCGTCGAGGA^ 

CGCCCTTTACCCT^^ 

CAGCCOTCGGCACAATOTTGCAGC 

GCCGACGTGAAAAACMCGGCGTAATCGA^ 

ACAAATTCGGC(^(XXXX^ 

CCCGCGCGGCAGGATGGCXjACCGAACGCGOT 

SEQ ID 4852 

MLQTDNLTAAQl^RIVAAQTASAQEE^^ 
DEIHRIjSPWEEILYPALEDYRI^^ 
ADVKMJVIDAAVADAALSMLDV^ 
SEQ ID 4853 
ATGACCCTGAAAACCGATTTATT(X^ 
AANCAATlTCGAC(n i TGTTCAG6C6CA60CA 
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SEQ ID 4854 

IfiTiKTDLLPKINNEDYQRIjILKH 

SEQ ID 4855 

ATCTTTO&AGTTTCAGAD^ 
CTATITGTACTGTCT^GGCTTCGCC^^ 

SEQ ID 4856 

KFSSFSRHFSFLTAILPGCRHFQnSGLNIMYGVASPC^ 

SEQ ID 4857 

ATCGCXTTTftAAflCAT^^ 
C(XXK^GCG«X3CACGTCCG(Xa 
TGCCTTMCGCCCGATTTGCT6CAAC^ 
GCGATTTACTTOCTGCC^TXTACGCGACCGTAT 1 
TTGCCTTAATCGTTpCGCCCACGCTGCC^^ 
GGCAGGCCGCGCCTTOTGTTCTTCGCC^^ 
GACGGCX^TGAAAAACGCCGTAACCGGTCAMOC^ 
TCGT(?ITr6CCCGCATCGCT^^ 
GTACTGGCACTTGGTGGTCGGCGGCTTCGC^ 
TGCGTGTTMT(X)GCGTGGCCMTCCGGOT 
6AAAGGCGCGCA6CAATQGC 

SEQ ID 4858 

MGI^HFLmEFHFLPGGKHEKW^^ 

AlYFLPmTOIVGGF^ 

DGLKNAVTGQTITWMDAFIGKLPGSIGEVST 

SEQ ID 4859 

TTOTTCA(XX3GAACCGATT^ 
TAGGGTTTGCGTGTTTCGGGCAGGCAAACCG^ 

SEQ ID 4860 

LFSGTDmiXtfAGFPLANGWNQM 

SEQ ID 4861 

TTGCA(X£GGG<^TT^^ 

AGCAGGGCGAGCCAGTGCGCGCGTTGCGGGTAGGGCGTGTGTTCGC^^ 
CGOTTTTTTTCAATACGT^ 

AGGCTCAAA<XCATATCX3GCGGCX3CGTO 

ASACTTOAGTGCGCCGCATTCGCGCAACATTTO 

TTCCACCATCTGCCCKIATCAGCT^ 

CGCAAAATGQ^GCCGCCAAATCCCGTTGOCOGC^^ 

OmrmCTCGGCGTGGAA^ 

SEQ ID 4862 

LQRGDLRQPRPVDIiFGDKVRFSLRAAFGGDFMIM 
LIDFAHGFYAVDVVSVVARQDVGRCFGFAQWQQGGIA^ 
FHHLPHQLYGFFGDFKAVTRGKAGGTKD^ 
SEQ ID 4863 

CGGTCGGCAMGATTTTCCTGTATTCCTO 

CGTTACGCTGGAACAGGACTTGATGCXXXX^ 

CGCCACGTCTOXXXXm^ 

AAAACGGCGAAGCGGACGCATTGGTTGCCGAAC^ 

CTTGCTGCCCGAAGTCGACGCXXTCTTC^ 

CTGCttGAAC^TOXX^^ 

AACGGCTGCGCGCGCCGAGGCAGTGTGCCGAGCT^^ 

CTCGAAGCCGCmTCAGGCGG^ 
AAAAAACGTTTCGGGGCGCG 

SEQ ID 4864 

MQTYLVGGAVRDYLLGLPVKDRDWVTO 
RHVSPAFAEDPVmRAARFAARYG^^ 

LEAANQAOSGKIAAECRAQGKAHPIAEQIDRARIiAQI^ 

SEQ ID 4865 

TTGGAATGCCGTGAAGGAOTATCATTTGGA^ 

MTCCGAAAATTOAAAAATTTAATCIOT 

ATTCGGCGAWjGCAACCGCTACXJGCP^ 

GTGCGTTTGGGTAAATTGCGCCAAGACC^GTT^ 

CmTCCCGAAGCC^raSAAGACGACT 

TTCCGAAGAOTGCACAATGCGAAAATGGCG^ 

MC™GACA^TACX^TCCGCT(mG^ 
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TGGACTACAACACCATCACTATTTATCCG^ 

MCCX^ATATCCCTTTTGAAGC^ 
AAGCA(£CCAA(XOT£GC<^TOT^ 
CATCAAATGGGCGGAAAAAGCCAGTGTCGGCTOT 
CAAGTCATCCGAGGCACTTTGGGGCGTT^^ 

SEQ ID 4866 

LKREGLSFGNIAPEIIXPIFY^^ 
VRW^QDQFAYEDMRVIiI^^ 

NPYIRPEADEKAKLHRQAVEVEK^ 
QVIRGTLGRLLFGSNDV^^ 

SEQID48S7 

ATGATMTCCTTCACGGCATTCCAMTTGCG^ 
AAATCTGCTCGTGGC«^^ 
C^C!KaTGTCXX3AAA3^ 
AGACAG6GA 

SEQID4868 

MIILHC^PNCDTVKKAKNRLAGYGLEFGPRDF^ 
RQG 

SEQ ID 4869 

ATGTTCQilPrCCATCGAAAAATACAGAACAC^ 

MGTGGGCGACGAAAAAATCAGCGAGCACTCAATCAA^^ 

GAAACAGGGCGCGAAGCTGATGGGCATTTCGG 

TACCTGTCGCAACGCCATATGTCre 

MCAGCTGATCAGGCTGAC(£AGGTC^ 

CAAAAAAGACTATCTGCTGCCGCAGGCG(?rC^ 

GCGACGATGCGTTCMTCATCCCTC^ 

AAAOTAATCAATaCGTATTCAGCGACGA^ 

MCCIACTGTTTGMGMGCCAAAGATGCGGTC 

TTGACGTGAAATGGTCGGAAGTGTCCGTTTTGGG^ 

aGACTCACCGGTCTGCCGGC^ 

ACTTTCGACCTGCTGATCCGCTATTT^^ 

SEQ ID 4870 

HFHSIEmTPAQVLWLIALTFTO 

YLSQRHMSEDQFVEEIRDQFAIiQ^ 

VARLPAHEAKPSPEQEKMVENELKMKKA\^ 

IfliLFEEAKDAVRQAYIR!^^ 

TFDItLIRYFNGKIKQTKGAQSVDNQDGQ 

SEQ ID 4871 ttccCACCCACTTC^^ 
SEQ ID 4872 

LQTIjPERINPDSHPLRHSRESGNPVRWGSVISDKLLPHCISRFPLSRK 

SEQ ID 4873 

AAGCCCOTGCACGCCACTTCGCGC^^ 

TGTTCCGTATCCTGCTGGCAGTCTAra 

ACACCCTGCCGATCTGAACGCCGTAAAAGCCGC^^ 

CT(^CTCGGGCGCAATCCMGTCC^ 

TTCGCCAGCGAGCGCGGMCGTGCAGAOTTCGGGAACA^ 

O^GCWSOCBCXXm^TCCAAA 

SEQ ID 4874 

KPIARHFAPPDFLFQEHGVQilPQmiajra 
LKLGRNPSPILIW^RCRTM^ 

SEQ ID 4875 

ATGATGAATTCCAAAAAACAiSGATGCATTCCAA^ 

AAGAGGCTGMGAGCTGGTGGATGCCTTACAGACGGCATTGAGGCAGG^ 

CKMSGCQGCGGCTGATGGTCGTGCATAT^^ 

GTATCCGCCATCGCATTGGACAGG^ 

OrnOAAAACCTGCCmCM 

AGAAGGCGGGGCG 

SEQ ID 4876 

^SKKQDAFQRLIGALKVLPWGPKSAQm 

VSAlAl£RIiAQRLD(&RIE&^ 
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SEQ ID 4877 

TTC^TCTGCAAAGTGCCCC^^ 

SEQ ID 4878 

I^SAPAADRfO)A™AAAG^NAARCRRSG 

SEQ ID 4879 

TTGCACAACGCGCCGAACTAC^ 
GCCTATGOCAAAGAAATCAAAKTOGATG^ 

SEQ ID 4880 

SEQ ID 4881 

GTGCAAGTTTTC^^ 

SEQ ID 4882 

VQVPRPVFPNLPHTASIjFTQPAVimiPGF7(^ 

SEQ ID 4883 

TTGTCGTCTTCTC6OTAT(M 
TCGGTTTG(XX3AAAA0C<£A^ 
(aaX5CAM<m£T(X^TAC^ 

gcgioaggttacotaattac(k:cg^^ 

ATTGGCXJATGACGCATATCCAAGACGCGCGCGT^^ 

CCGGAGGCGCAAAGGGACACGGTTTGGGTG(XX^ 

TGGCGGCGTTCAACOTGTCT^^ 

GGGC^CGTTTTCAGGOTiTra 

TCGCAGGTTTATTnATCGACTACCTGCC^ 

CAAAMCCCMCCGGOGGAGGCTTTGCGTTATGA^ 

SEQ ID 4884 

MFSLEAVttGIiRYIiRAKKRNGFMOT 
ERKVVETCDIOffAGKFEDU 
PEAQRI7EVWVRDWTYSNRSYFEAVEIOT 
SQVYFIDYlJSDViaiGDVAfc^ 

SEQ ID 4885 

ATGAGTCATITGATITrGAAATGCGAA^ 

TCGGCTCTTCGGGCAGCGGCAAATCGACGCTGCT^ 

GGGCGATTOCGmCCKCATC^^ 

GCGGTAGAAACGGCAflireGCGATGCTCGAC^ 

CGCAACCC^TGCCPGCTTGOT 

j&CGCA£X&(XiACGAACTCX£CG^^ 

SEQ ID.4886 

MSDIJLKCEGVGKKYRDGGIdJVPVLTC 
AVETAMAMLDKVGLKHRSTHRAGELSGOT 

SEQ ID 4887 

GTOT^GAC<mTTTCGAC^^ 
TQCTTQCARRCTGTAOCGAAC^ 
CGCGCCTTTGATTTTGGAAGGCAGMGGCTGTTTTO 
GGCGCGCGGCAAAACCTCGCAAAATGGTT^^ 
AAACGACAACGGTTGCCAAACTO^TGGCACTGAT^^ 
TTCAATCAACGGTITTGftCGC^ 
AT(XX3TCC(KTOXX3T^ 
AAAACCAGCKTCGTC^^ 
CAGCGCAAATCOGCCOGTGTTGGCGCAAAACAOT 
TGGGCAITGTWGACCGGTTTCCGGACGAACTG^ 
TCGAAG(XXXATACGCAGGCATCTO 
TTTGGCCTATTTttXEGGAGAGCCXftT^^ 
TATTTTGCXGATGCGGACGGATOA 
TGCCGCCTTCCGACGCACCTTCG{^ 
GMOTITTGrCAGGCGGTAAA 
SEQ ID 4888 
VPSDGISTBl^U^ 
(^QNUKWFQGAGSBGQR33AAALA^ 

WALFDRFPDELEHSECSPNAR^ 
YFADAIX5FKKVAVSCLPEFEPAFAIfFVHKSQG 

SEQ ID 4889 

OTCCCAACCTGTTTGCCCCGAT^ 
^TGTEGGCGGTCAGQCT^ 
TTTGATGGATAAGGACACCGACATCGGTAAA 
GCAGGCMAGACCCC^^ 
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AMTCATCCTGCCGATGCACGGCAACGGTTTGTO 
CTTG&EGCmAATCGG^ 

hk 

SEQ ID 4890 

LPTOPRFSTISSHKRTSNAERRAAMAK^ 

AGKDPAQSIRIKPEDDIAGIRSRMyTEVy^^ 

<M>AI>SGASI,TSKGVQGSFAYWF^ 

SEQ ID 4891 

cgccgaccao^^ 

AT 

SEQ ID 4892 

REE 



SEQ ID 4893 

GTGCCGTTTGAAAAAGGCATMG<KKXJGGTGGOT 

SEQ ID 4894 

VPFERGIRGGCVLPTVGIGTPAPPAEHHSGG1JL 

SEQ ID 4895 

GTGCATTTGGGGCTGCGCCCTGCCGACTT^^ 
CGCTGQCTTTTGCXXmACAGGGA 

SEQ ID 4896 

VHLGLRPADFLEKAFIRAGLRVEDVIOTP 

SEQ ID 4^97 

TOGGCGGCAfiCATl^AOGTTOCT^^ 

C6GGCGTACTTGCCGCCATTTTGGCGAAACA6G(K^TTC ~ 
GGTOAAAl^XGCGAMCCGATTTATTTCCCGAAG 
TCTGCGATGCTGGCTGCGTTTT^ 

SEQ ID 4898 

L(&SIHVPFGVFSPI£GEHiDIiLAH^ 

SAMLAAFLRDAOT j 

SEQ ID 4899 

ATGCTGCCGCECAACTGCGGAACGGCM 

AGTTTGACAACGATTCAAAAGGOTAGATCAG 

CTTCGCraGTAmCAGCGmACGTAGTGAATOT 

CG6!TCGGCGCAGGAAAGCGA6CTTGTAMGAGGCAGAAGCCTA(X3G^ 

GTCATCAOGCGGCAAMGATGAAGAAGCGGCX!^ 

CAAGGAAGGCGTGMGCGTCTTGAAACCATACCGG 

GGCTTAAGCCCGCMCAAAMGAATCCGGGACAAGCGTAAG 

TAAGGGCATTTCCCGACTTCATCAAAAGGCTGGAA 

AGATTACGATCCATCXXXSTTACAAATCGGCXj 

SEQ ID 4900 

MbPPNCGfTATPHIiRl^VGIiDISKLTFDATAIVGNAE^ 



GLSPQQKJSGTSVRGKGKLTKFGNRKLRAVOT 

SEQ ID 4901 

ATGCACACCC^TAAACCCCAGCAGCAGATTGAGCGG^ 



GGACCTICPTCTTCATTGCCGAAAGCACCT^^ 

kTACCAA0CTTTG(XXX3GCTTTO 

AAACCATACGGA!IWTACA<^ 

SEQ ID 4902 

MHTHKPQQQmQKKIffAVIUiSVHV^ 
IPTFGRLIjQTFVAPFEPVIGKNKFR^ 

SEQ ID 4903 

ATGGAGTTK^TGCCATCXXX^ 
TOGCGGTTCTGCCGCOTATOT 
TATTATGTCCXjTGAAAGCGGTGCAGACGGCATTT 

SEQ ID 4904 

MEFDAMPSETFRLQTAWAWKDKAGTO 

SEQ ID 4905 ACATATAAAGCGCGTGATCGACGATGCGGTAGCCGTO 

mttogttacctcgco^ 

. tgcttgccx&ctatcac 

SEQ ID 4906 

TFALGLTVAAHAVHKKVIDDAVAVFI^ 
LQTRAGNLQTAWQIiRNVABFFHNIQVTCn IDAI IKNFSIJjAHYH 



WO 02/079243 



-376- 



PCT/IB02/02069 



SEQID4907 

ATG(&TACTCTCTTGTCTC 

(3CTTCAGAC(mTG(XKrPTGGAW 

GCAGGTATCCCATGTOTCTTT^^ 

SEQID4908 

MDTUiCLTGYGIJ'RKIiGLKVI^ 

SEQ ID 4909 

AR&CACAAAAAGAAAA(XATTT^ 

CCGCAOTGGAAAACGCC^TOaXXm 

(X^AGTCTTGAAACTGCCCXjA^^ 

GMGMGACACG(5GCGGCMTACCGACCTC6MGCC^^ 

CX^TTGCCGAAAACAGCCGOTAACCGATACG^ 

AAAGCTGGAATCCGAACTCGACATTATGCAGG^^ 

AAAGAATTGGGCGAAGAAGACG&AAACGGCGAACT^ 

TGATGOCGCCGATGTCTGCCGAATCCACCGTCCT 

GGATGCCGACCACTACGGCCTGGAAAAAGTCAAAGAA 

GGCAAAACCTCTOGGGCGAATCCATCGCC^ 

CTAT6CCCGGC^GATTTTGCAGAATAT6GCAAAA6CA 

TGAAGTGCTCGATCACGAACAAAACAACAASTTCGCC^ 

CTTGACCGTATGGAMTCATC^TCTGTCOjGCTATACCGAAGAC 

TGGTCGAAGAMGCGCGGTACGCGATATTATOGTTATTACACTC 

GjAAGATAAGAAGAGGTTGTCTGAAACCAAGAAAACCAGCAA^ 

GCCGAAAGTGAAAAWGCATCGGACAGGTTACCG6OT 

AGTTGGGCGATGTGATGMGGAATCTGTATCCGCT^^ 

CGMG<nXX^CGCC(^&^ 

CGCGGCGAAGTCCKKXGAT^ 

ACGTCAAMCCGGATTMCXATCCATO 

CXSCTTCAAAAOWACCCCGCA^ 
SEQ ID 4910 

MTQKEKHFEEYAALATLPLRDVVVyPHMW 

EEDTGGMmVRRTLlAQFEQYAKLHmPAEIIGSIlW 

KELGEEDENGELDALEAGIKKAGOT^ 

GKTSXjGESIAKATGRKYVHMALGGVRDESEIRGHERTn 

LDRMEIIRLSGYTEDEKINIAMQYLVPKQM 

AESmiGQVTGIAWimGELLTVEAAA^ 

RGmPIGGLKEKLimRGGIKHmPKDTO 
SEQ ID 4911 

GTGMCMGTCTGMTTGATCGAAGCGATTGCTCM 

CCGTTACTCTCGTWKnTO3GCT 

AGCTAAAGCATTGAAAGACGCACTG 
SEQ ID 4912 

VNKS^EAIAQEADISmAQKAKDA'nm^ 

SEQ ID 4913 

ATGGCATTTTTCCATCCGGGTTTGACGGGCAK 
AAAACAGTTTACAGTGCGTCTTTCAATGCTO 
CGAOGTAGAAAGTACCGAAACCGACX^ 
TTGAGCMTCGCTTC(^TCAATTCAG^^ 
ATTCTGAAATTGCCGTGTCAAGCAAAAAAT^ 
SEQ ID 4914 
H^FHPGLTCSPSSESDC^ILI^ 
LSNRFDQmVHNQSLPVliKraJEM^ 

SEQ ID 4915 

ATGACTGAGCCGAAACACGAAACGCCGACGGAAGAGC^^ 
CCGCCCTCCTCTC^ 
GGATGCGGACAATACCC(mmCCGTCG^ 
GACCTGCTCGGOXX^^ 
SEQ ID 4916 

MTEPKHETPTEEQTORKKAKAXIRTI SSPGKLGAQSQL 
DIiWSGAKAFETRDKQCVADUCAD 

SEQ ID 4917 

TTGATTCAAAGTGOGGCACMTTATACCGGCAGCGGATAOT 

OTACGGTOOXXKXSGTGCCX^^ 

AATOKXXXKSTCCGCC^ 

ATTGAAAACGGCAAACACATCGTTACCGCCAAC^^ 

GCXKSATCXXIAATCATCAAAGCCCTGCGC^^ 

CGCGTTTGCOGAOTATTGAAAGAAGCGCAGGC^^ 

GOTACGCCGATCAAOTCTCCGCC^^ 

CCGGCAAAGGCATCGMTTQCXXXjTCCATCX^ 
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CCCGCCCAAGTCCMGCGCAAACaTCCT^ 

MGAAMCGTGTCCATCGAAGCCTTMTCCAAAM 

CGAAGCACTGGATTGTGTGGAAAAACCGAT^ 

SEQ ID 4918 

LIQSAAQLYRQMPSSFNinSVGLQGI7IMKPVNIGLI£I/3 
IENGKHIWANKKLLAEYGKEIFPLA^ 
GTP^SACYLEGISKLDSRDIKYAEE^ 
PAQVQAQTILPMDEITSSYYIiRVQAKDEPGTLGQ 

SEQ ID 4919 

ATGTTGTTTGMGAAMTCX^ 

TCCCAAAATCATGGCGGATTTTTCC^ 
CAACCG 

SEQ ID 4920 

MfcPE^IDGQFAEYEC(&(H3IRIAG^ 

QP 

SEQ ID 4921 

GTGATCGACGATGCGGTAGCCGTTTTCCTC^ 

(XX^TGTCCAACTCATAMCCKXX^TO 

(X^GCAGAATGCGGTACACATCTTCCG^^ 

GTTGCTGMTTTTTraTAATAT^^ 

OJGA(XK3GCAGTCGGACACAGACAGGGTAAAATA 

SEQ ID 4922 

VIDDAVAVFLGDFVLQGFDFGIVEFRYLAAI^^ 
VAEFFHNIQYPCniEAIIIOTSLLAHYHTIA™ 

SEQ ID 4923 

ATGGCTGATATGAAACGCTTGAAACATTT^^ 

CCATCGTMTG(^ATTTCCGTAGCT^ 

GCTGGTTACGCTGGTCGACCAACTGCTTC^ 

GCGATGAAAGAGCCGCCGCTGGAAAGCCTGATCGACGGCATCGGC^ 

GTTATACCGTCTTTCAAACCGTACAGGACGGCGGCTOT 

ACCCGAACAGGCGGAGGAA 

SEQ ID 4924 

MADMKRLKHI^SPPIDNNPIALQVLGICSAI^ 
AMKEPPLESIJDGIGKGAGYGMLLVVArVRELIGSGKI^^ 

SEQ ID 4925 

ATGGCTGTCTTCOaCT^^ 
CGCAGCGCGTGGAACAAAAACTGCCGCCGCTGT^^ 
GGGTATGGCGCGGGACGAGATTGCCCGMTCACGGMA^ 
GAAGTGCAGTTTTTTACXX&CGMGACGGCGAC^ 
CGGTCGTGGTCAmOra^ 
CGATGCCGTGCGCCTGCTITACGACAGCCTG^^ 
TCGGACAAGGAAGGCGGAGGGGGCGGCAATTATTACG^^ 
ACCGTATGCACCCCATCCTGCACACATGGCG^ 
(XX&TACGGCAACGCGGTGATGATAOX^ 
GGTTCGACAGGGCGTTCGACCGGGCCGCACCTGCATTAC^ 

SEQ ID 4926 

MAWPLSAKHRKYALRAIAVSIOT 
EVQFFTDEDGERNIiVALTO 
SDKEGGGGGNYYDEKJKVLQEKGGFNIEP^^ 
GSTCmSTGPHLHYEARINGQPVNPVSVALPTPm^ 

SEQ ID 4927 

ATGCATGMTACCGATTOGGCATTCAGGACAT^^ 

GG 

SEQ ID 4928 

MHEYRFGIQDIGFPALGRAQGLGLASTVNFPFQTASALTFR 

SEQ ID 4929 

ATGATTGAAATCAAAMCCTC^^ 

CCAGCCTGTTTGCCTTMTCMGGGCGAAATO 

GGATTACGTTTTGCAGGGCGATGC(X&^ 

GCCTATACCGCGCCGCKXX^GCGCGGCAAAATTGT^ 

AAG(XX^TTTGOXXGCT^ 

TTCCCACGACCGTGATTTCCTCA^ 

(XXjCAACAACAAGOCGCATATGTO^ 

TGGCGAAGCTCGWiCGCA 

AGGTAAAACXOTCCTCCATGACATTACC^ 
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TTACTCT(XXXX^GCATCGTCCATTCCGAAAAA 
AAGTA(X£GMCAAGAAATCC 
TAOTXXsCAAAAGGCGAACC^^ 
(^CGA(X£CAGCCTGCTTGMGCCACGAC^ 

AGCCGAAAAAGAAATGGTGaOT^^ 
GCAAAAGTCAAAACACAGCTTGCCCMATO 

SEQ ID 4930 

HIEIKKLTLQRGLKVIiIjHCASMVNPGQRVGLIG 

ATOPARAAKLimGFSQEEHSRPVKSFS^^ 

AQQQMYVKQQAQIKHLQSFIDRFKAKATKAVQAQSRMKALA^ 

IaLSGSIVHSEKI^GYFAQRQItDTXRADQSPVWHIQ^ 

HDRSl^TTDSFLLIDKGRLKNFDGDLNm 

AKVKTQLAQIEETWIiACQEELKRIETEIEKQPAHl 

SEQ ID 4931 

TTGAAACTGAAATCGAGAAACAGTTO 
TAAACATCAGACOTTT^ 
GTGTGCCGATGCTGATTTACCTAAAATC^ 
CATGCCGGOAAAMGAACAGACAG^^ 

SEQ ID 4932 

LKLKSRNSLPSDKEARA(£l£LPANra 
HAGKKNRQAAKAPKENQK 

SEQ ID 4933 

ATGAAATCAAAACTCCCCTTMTCCTAATC^^ 
CGTCTAAAACX^TGCCACTCMCCGATTTOT 
ATATAMGCCCCGGTCAAMCAGTATCCMGCXX3GCAAAATCCM 
GCCCTGACTGAAGCCCAAAAAATGTTATCACMGCAC^^ 
CACKX^GAGAATTGGGACGTATO 
SEQ ID 4934 
MKSKLPLIIiINIiSUSSPIiGAKAAKI^ 
ALTEAQKMLSQARLAKG<^ 

SEQ ID 4935 I 

TTGACAGAAGAAATGATGAAAAAGCAGGAGAATTOT 
GGCTGCCTGTMTCGTATTGMGGTCAACGTAT^^ 

SEQ ID 4936 

LTBQiMKKQEHFWIKLGDLI^^ 
SEQ ID 4937 ^ ^ 

CCACATCAAAG(XX3AA2K^^ 
SEQ ID 4938 

ffliAVPPQALPVKSWlAEEFTARWHPffiXm^ 
SEQ ID 4939 

TTGGGC^TATGCCCTGATGCTCGCTGTTCCGC^^ 

GAATGGTTGCCTACCACCAC^^ 
SEQ ID 4940 

LGVCPDARCSAAGIARQIGK^ 
SEQ ID 4941 

ATCCCnXXK3C03A^ 

Ajm»GCA(^m^^ 

SEQ ID 4942 

MPAABQRASGOTPHAQ^^ 

SEQ ID 4943 

ATGGAACACTATTTGAGCCTCTTCATCAM 

!KX»GTGTGGCGGTMTTTTCG^ 

CTTTATCGGCGTGATTGCCGCTTTGGTGQ 

TTCGCH'GCCGTTTCGTTTATGGCGCM 

AMTGAAATATTCGGACGCTCCCAAAGGCTO 

SEQ ID 4944 

MEHYLSLFIKSVPIENMALSFFIiGMCTFIiAVSK^ 
FGAVSFMAQREYDFGESWYGFGAGI/^^ 
SEQ ID 4945 

ATGTATGC(XXXX3ACACCAGAATAATC^^ 

aOTATMTATCCGCCCGATCCTCA^ 

(^CAAAATCCTGCAATAC(X&TC(^ 

TCTGAAA(XX3TC(XX3CTC<^ 

TGTCGGCGTTTTACGGCGACATTCCGTG^ 

GTTGGGCGCGGACGATCAGCCGGACGGGGAAGACGCGT^^ 

(XXJGAATTGMCGGCTTATGGCTTTGGGAC^ 
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TCCGAGCTTAT<KX£AAAC^^ 

GTGGGMGAACGCTGtnro 

CGGMGTTTTGGC(X?KK^CGAAM 

SEQ ID 4946 

IffiAADTmETASARSAY^ 
SETVAIATPWQKMGLHQANVLTAEYLDVOT 
PKLNGLWIM)SI£GSAQ«3TLFAD™SR^ 
RKFWROTTFDGIff 
SEQ ID 4947 
TTGTGCAAAAGARGCAACATCATGAAATT^^ 

SEQ ID 4948 

LCKRSNIMKFKQFIIAT^ 
SEQ ID 4949 

ATCXECAATTTMTGAOX^ 

AATCGTCCGCCCMGCGGAmGGCTGTCGC^ 

TGGCTTreCGGCAGTCGCAGCTT^^ 

SEQ ID 4950 

MPNLVTAVIAGRFLRWRIPYP?^ 

SEQ ID 4951 

TOCTTGAATTTCA^ 

GATC^TTGCATCAAGAGCTGTTCGAATTCCG^ 
SEQ ID 4952 

LIiBFHDVASFAQSVRNSEARSIMiCGSRSFTSGIAGNW 
SEQ ID 4953 

TTGGCACGATGCACTCGGCGGCGAAAAACAGCAGCOTAOT 

GCATTCCGGCTGACAGCGTTCG^^ ' 

CGTCTTTACTCMTAOTACAM 

GGTGCOTGCAGGACAGGCGCATTCGCACGCCCTT^ 

GCACAACAAAAAGGGAGGCT^TAGACAGTCAAAATCATTO 

GCTAWPGAGTCAACAD3CT j 
SEQ ID 4954 I 

LARCTBRRKTMWSGNFKCGQ&GTFVGANYI^ 
GACRTGAFARPFRLGTLYRYRHQMCDTPQAPCLR^ 

SEQ ID 4955 

ATGGACACTTCGCACGATGCACTCGGCGGCGAAAAAC^^ 

CATACCGCCGTCTOACTCAATAmACAAAG^ 
GTTTTGACGGTG^^ 

GTGGGTATGCACMCAAAAAGGGAGGCTGATA6ACAGTCAAAA 
GGCAAACAGCTATTTGAGTCAACACGGTATCGAACC^ 
SEQ ID 4956 
MDTWHDALGGEKQQPYFQK 
VLTVRAGQAHSHAU^VfiaFTDTO 

SEQ ID 4957 

GTGCATCGTOCCAAGTGTCCATATCTGCCCT 

CCGCTGTTGCCGTOTAATaTCmm 

CTTGACCGCCGCACTGCTTTCCGCGCT^^ 

(^TACACAGGCGGCAACCGTTACGAAGTTTTTTATC 

ACCTTCTTATCATCGCAAACCCT 

SEQ ID 4958 

VHRAKCPYIiPYPFKPGYTADIALPCQTTR^ 
RnXSGNRYEVFYRGTH^QKTGQEVFEPG^ 
SEQ ID 4959 
ATGGAATTTTTCATOTCTTGTOSCAG^ 
CGGCCGGTTTGMTATTTTGATTCCCT^ 
TGTTGACtXjCATCATCTATO 

TOCGCCGCAJmATCCTT^^ 
OSTGAAGCCG^ 

(KXGAAGCCMTGCCGAAGCaTCCGTCAAA 
MGAAAGCMTACGCTGATTATGCCCGCCM 

SEQ ID 4960 

MffFinJAAVAVFGFKSPVVIPQQEV^ 

MEIiDKTFEERDEIHSTVVSAIJDEAAGAWGVKVliRYEIKDLVP 

AEANAEAIRQIAAALQTQGC^VNLKIAE^ 
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SEQID4961 

GTGAACAAMCCCTCATOTTO 
CGCGCGCCGTTGCCGCCCTGOXXXXGGCATG^ 
CCGCAACGGCATCATCAAAGAACGCACX^AOT 
GCAGACAAACAA 

SEQID4962 

VMrniLALSA^SLTACSVERVSLFPSYKLKII 
ADKQ 
SEQID4963 

ATGGAGATTATmWKSTATCGTGATGTTPAt^ 
AAAAAGAACTGACTATGCCCGCCGGCGGCAAACTCT^^ 



GAAGAAGTGTTCG^GTGAAAAMTGGGMTGCAOT 

CHJOTACATTCAAATCGAAGCC 

CGAGCCGATTTTGCGCGCCTACTCTA!^^ 

TCGTCCTACATCTGGTCGCTCAAACtt 

OXXXlATGCGTrCCCACATTTC 

CACGAAGCACCGGAAGACTGCGAATT^ 
G06GT 

SEQ ID 4964 

MEIILGIVMFTVIVIiAIJ^LFAKSKLV 
EE^GVKKWECTVISNDmTPIKELKLU 
SSYIWSLKPGDKVTISGPFGEFFAKlffl^ 
HEAPEDCEFYMCGPPIM^SVIKMLKDLGVnJE^ 

SEQ ID 4965 

ATGTATCGGAGAAAAGGACGGGGCATCMGCCGTGGATGGGTGC^ 

TGCTGGCGTATGT(?rTGGACCCTT^ 

TATTGTCCCTATGCTGGTCGGGCAGTTCAATMl^^ 

ATCGATCAGGCATCTATTATTGCGTGGlTraGO 

ACCTG<naTGCCXXX}CTT(^ 

TTTGAAC^GGTATraGGCGAATTT^ 

GTTGCQ^ATTTTGGTGTTTGTCX^CCTAra 

TTGCCGTCGGTCAGTTTCTCmAG 

TGTCGGAATGTTGGCCGGATTGCCTITGGC^^ 
SEQ ID 4966 

MYRRKGRGIKPMGAGAAFAALVWLVYALG^ 

IDQASHAWrQAHTGELSNALKAW 

VAGILVFVPYLGAFTGLU^^ 
SEQ ID 4967 

ATGCCCCGTCCTTTKHKXGATACATAAM 
GGTTTAGGCTAAAATACTGC^CATCCGATGCCGTC 

SEQ ID 4968 

MPRPFLRHKSTLPniLRASIAEPPRI^^ 

SEQ ID 4969 

ATGAGGGATGTPTIGTrTTCTTCTTC^ 
ACGATMCAGGCTATCGGGMGGCGGAGGCTTTTT 
SEQ ID 4970 

MRDVIJSCSVMHCCLIWGMP 

SEQ ID 4971 

ATGGGGGCGGTTCATOCGAGTTCGAGCAAGCGGTTO 
CCACCGTTCAGGTAAAGGTTGAAMCCMGMCMTTGGACM 

SEQ ID 4972 

MGAVHPBFEQAVLETVRIJIAPDTQAHHITTR^ 

SEQ ID 4973 

ATGAAMTCATACACAAAGGTOTGGTCGAATATCTGCC^^ 

TCTTTACGCMGGACTGGCGGGAAAACCCGAGCAOT 

TACGATGATTGATTOAMCGGCGCAAMCKAGCGTCCGCAA 

OCCGGCATTTACGTCGGCGAACGCAAAATOGC^^ 

ACCCCTGCGOTACGCX^TATGGAAATGAC^^ 

AGCAGACAACAATGAG 

SEQ ID 4974 

MKIIHKGLVEYIJTFEAMKTFNA^ 
PGIYVGERKIASLCMIKNGSVYHGIALN^ 

SEQ ID 4975 

TTGAAACACMCTCACACCGAMGCAGACMCMTGAGTGAAATCAA^ 

CCCTTCAGGAAAAACTGAAAMGCTCGMTGGATA 

CTGCCCmCATCGGCGAATGC^ 
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(XCGACX3AGCXX3AAAAACCTCGCAGMTOT 

TCAAA(XXATCCGCGAAAC(^ 

CAACTTGGAAACCCATCCGAGCCTCTACAG 

ATTATGGTAGGCTTGGGCXaAMCAGACGAAGAC^^ 

CTGTCTT(XXXrrAa3raCACCCGAAC^ 

GGCGGCGGAAGCCTTACGCGAAAGCCACGGCXSG^ 

SEQID4976 

LKHNSHRKQTTMSEIKTTDPKR6IKI1RGADKTARIPI 
PDEPKNLAESVKAMNLRYVHTSVD^ 

SEQ ID 4977 

ATGCGTGCATTGTATGCAAAAGCTGTCG^^ 
CG(nOTnTrGCTQCT 
SEQ ID 4978 
HRALYAKAVGfimmQTCII^^ 

SEQ ID 4979 

ACAMCTCAAAAraTCCAAGGCAAC^ 

CCATACCGACCGCTGGGACTATACCTTO^ 

CAAAATGCCGCCGAAGCCCTCOKXSCGAAA^ 

SEQ ID 4980 

H*T£^AQKGSPVHKTLILALSAliPSLTACSVER^ 
QNAAEALRAKQNADKQ*GTH 

SEQ ID 4981 

GTGTTTGMCXXX^TATTPAACCGCCGAM^ 

AATTCGCAGTCTTCCGGTGCTTCGTGGTIT^ 

MGTGAAGTTCGGGAATTCTGCCGCGAGTTGGTO^ 

GTCGAAAATGTGGGAACt^TCGGGGCCATACCC^^ 

GGTTTGA0CGACCA6ATGTA66ACGACA!ITr6TCCC^ 

TAGAGTAGGCGCGCAAAATCX^CICGTTCACT^^ 

CGGGGC1TCGATTTSAAT6TAGCCGCC<K3CGC66A^ 

CATTTTTTCACGCCXiAACACTTOT 

TCGGCAGMTGTCGCCGCCGCCOr^^ 

GCCGGCGGGCATAGTCAGTTCTTTTTCATCATTGA^^ 

SEQ ID 4982 

VFEAEYLTAEIVQEDVFVFHAQVPW 
VENVGTHRQfPRTAAKCYHFGIGVFGEK^ 
RGFDLNVAAGAERHFPAFGNGKLEF^ 
AGGHSQFFFIIDFDGDVAFADQLGFGKQNHQCQS 

SEQID4983 

TOTCAACAATATTCJmTTTCCCATO 

ATTTTTCMCTTTTGACGAAAAATGGCCACGG 

(X&TTTCGAACTG(KX!G(X£AA^ 

GTCAGAAAAACTTTATCGGTAAAATCGAAGGCTGCGAAAACGA 
ATTCAAATTC 
SEQ ID 4984 

LSTimSHYKTVHPLFSVPKTQ^ 
IGYKNLEISSPGLDRPLKKAADFVRFAGQNAKIKTRLPI 

SEQ ID 4985 

TTGGAAGGCAKEGGGAATACGtX^ 

SEQ ID 4986 

I^IGEYALnLIIVCIU^RKFVMPTAFAVNARIPNSVTH^ 

SEQ ID 4987 

ATGCAAGGCAATCAAGCTCTOTOAT^^ 
TTGAACGTCTCAATCACGAGATGGAAGMGAAACC^^ 
CGTGGTTTCCTGCCTCAAAGCCGAC^^ 
CAACTGAAAGACACCGAAGAAGACCACGCACACT^^ 

SEQ ID 4988 

MQGNQAVVDYMNELLSGELAARDQTC 
QUa)TEmARmiEQQLRLIEIJGEGmQSQL 

SEQ ID 4989 

TTGGCTITOTAGTAGTTGCCTrcGCCMTC^^ 

TAGTCTKIAGOTCTT^^ 
GTGCXIAOTTCGGCGTAOCGCCCAGCATCA!^ 
GTAGAGGCGGGAGTGGATGAAGTATTGGTOCXXjTGCC^^ 
TTTCGTGOGGT 
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VEWM^VACRQIiW^FVHraOBLIAI^SIJIJWVSCQ 



GOsAACATTTCTTCAMCAA^^ — ~ — ^^i^^x*™™*™** 

KQlQOTEEHIDWLETQQEliIGKIGLPNyiiQTAAQED 

SEQ ID 4994 

UQOTJDPIEKPMKHSimmMPSmWaXXraWFVP 



GCTGCCGCCGCM^a^^ 
SEQ ID 4996 



CJAACAAM^TCTAGMTTCMXTOGGTA^ 
njJMUHIKAKKH^PHHOTTHQEEK 

CMCQGC^^ 
GAVmWVNGXTOjlffiHIJDVLGIMHR 

gtcgagntaacaaaacgttc 
S !^f\?5fcLfkpar^ 

IALTft(X3IIAQABGRVKEAVSHYREIi^^ 



VEFNKTF 



MACCTlOTCGCXm 
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GATAGCGMGGCGCMGGCTTGGCGGCCAACCTGCCCGAOC^^ 

(X3(K!TGCATOAAACGCCA^ 

GCOTTTTTACCGTCCGOT^ 

CCAAGTCAACAACCAAATCGCCGCGCGGCA^ 

ACCCTGCGCGCAT(X?rcGGGG(mCG 

TATCTGCCC6CGT6GGAAAAM 
TAGATATGCACAGCCATGAGCTGCCCTACX^^ 
GCCAAAGGACGCGGCGGCGACCATGCCGT^^ 
TCCmTGTCT(Xraa3CCCAA^ 



CGCCGCCGGCAG 
TCTGGCACGCCGCCAAOCTCGTACACGAC^^ 

GAAGACTACCCttJACATCCAAAGCG^ 

CMCGTCAGCCTGCOTACGCCAAAATC^^ 
CTGGAACAGTTGGCA 
SEQ ID 5004 
KPENLSSALETFKQQRNAAEAHYliKANRVSWFREYTMVCT 
VRDDITGOTAFLKARFLFC^QTMELAEKMNVQRNVA^ 
RLHLNAKRAEERLLFDIjQPQ^^ 
TLRAWWGATRKIHSFYQNSENR^ 
AK6R6GDHAVQ6IADAI^FAAfflFLTEEESDIiIiAW 
RRRQEAADLLl^VPEKQQKKLWNALGSAYFARHQSRBIL 
EDYPDIQSALEAELNSFIHGHWft 
LEQIA 

SEQ ID 5005 

GTGCATCTTGCACAAACCGAMGACCGAGCATTATGCCT^ 

AAACCAAGCTTACCCGCGAAATCACACTCAACC^^ 

TAAAAA(ATGCCGCCCGAAAT(X!A^^ 

ATGCGCGCGCAGCGCAAACGCAAAATGTCCGGCX!^^ 

CCATTATGACCC(X3CGCGAAOT 

GAGC(XX3TCAAAGCCTK£ra^ 

AAGTCATCGGCGGO^TCtmCTGCCAAA^ 

O^^XnX^(mACTGACa^TTCAC^ 

GCAGGC^TACAGCGTCATGCTC^ 

CGATGAGCCAAGGTTCTGCCGAC(XK^ 

CCAACTGACCGGCGGACTGCGTTCC^^ 

CACGACGTTCAAATTACCAAAGAA^ 
SEQID5006 

VHliAQTERPSIMRIVmYTFDD^ 

MRAQRKRKMSGLPVVQWKWGIVTNRDLRFE^ 

ERVKALVEAGADVIVV^^ 

AGAYSVffl/»(^AGTEEAPGEIELY^ 

HDVQITKEAPNYER 
SEQID5007 

TTGAAAGATTGCGTOATTTCCACCGCCGTOT 

CCGTOXX£CMTCA(£CCG^^ 
SEQ ID 5008 

LKDCVYFHWKGRRYSAD^ 

SEQ ID 5009 

ATGAAAATGCCGTCTGAACG^^ 

GCCGGCCGCCATATCAAAA^ 
SEQ ID 5010 

MKMPSERRLHPVCRIPSAOTAVBINAIFQyGR^ 
SEQ ID 5011 

TTGCCTTTCCGTTTTI^ 

TGGITOCXjATTTCCCTTO^ 

ATOmCGCGCGACCXEG^^ 
SEQ ID 5012 

LPFRFLTliIGTGFFAFFPDSATAGT^ 
SEQ ID 5013 

ATGMTAAAAATAmAATCTTTAAATTT^ 

CCGAGGCXXIAGTTOCGOGGCGGACMGCTG^ 

TACGAAOGCCAGATGCGTG<^ 

AAOTtHCGGCCGTT^^ 
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TCAGGTTATCGTCGGCAAAATTGAGGTTTATCCCG 
AAGCATCATTTG^^ 

CX3ATA6ACGGCGAAACGGCGCGCGATTTCGAC6ACGC 

GATTGATGC/^TGCTCMGAACGCAGTACCAGCGTGT^ 

TGTAIiyjGTGTGCGATATGGTCGTTAOT 

iOGGCMCGQG^TCCGCIftCAA^ 

TTTCGACGACAACG^^ 

AAACATACGOTTTGTT(X2GCAACX^ 

ACTATGCC(mrTOCCGAACMTOAAA 

GGCTTATGAAGCATACGCCCACTTTACCTCGC^^ 

GCTTTGGGCGTGCATACTOGTTT^ 

TCTOOGGGGTGGCAAATTTTQGAM^ 

CGGCGGAAGGTCAAATTATCCGCATCTGCCAAAOT 
CTGCCCTTOXmTCGGGGAAAM 

YERQMRGVHHGDTVTVR^^ 



OTattWYAGNIKE^ 



KHTALFRNHLGPTPEKLAT^^-^ * — — ^^nT.Awm 

M/^FCERRADDAGTO^^ 

RRKVKLSASAKPAGAAGKGKSRraAEKKTSECGKVKSGVPAVA^ 

S 1£iKiSaTC^ 
GCQATATGCCGCTGGAAAAATGGCGGCajJITIOT^ 
ACGGCATOGCGTATCATTTAAACAI 

SEQ ID 5016 

MFOSMmOTmTQKAYPPKQICSTCCR^^ 

S Mra£»AA^^ 

TtaTOSACAS^CGACOTQAAAAA^ 
CfflmMIGQMCGOGm^ 

OCGATTGTCGTCATCAACAAAATCGAOUIACCXjTCCGCCCGTCOGAG^^ 

OTrarrTCAGGTCTCAG^^ 

(XCGCTCCAACTCCAAAmCCCAACl^ 

TCG(XGTMCCAT(a(mCaU^CAACCCCAAA 

ATTOrrGACCAGCCGCCAMTCCGCaCCGCCTO^ 

CACCWftCGMTTTGCTGGAAMTATGCG^ 



mxmrraTrarrTTCCA^ 

GTACTGGTGTC0CAAGA6CAGGGCGAGGCG6TTGOTACGCX?nBTGG^ 

ACAttOmCAACGATTOK?^^ 
C(XG<^GmATC(^CGAl^CGA(X:ra?ra3AAATCA^ 

mVIlTOKPSARPSWVIDS^FDraXaTOEQ^ 



HLTIUjENMRREGYEIiAVu.. » - _ 

— - — •■ — ™-~ >-»~™~>»m ""jyjsELEjmiJHPKKU) 



AAGATTACGAATAC^CTAOACAAATG^ 

SEQ ID 5020 

HVDSITKKVSINPFYEVRIMSDS^ 

TQGOCTTTCCAACCGGCTTCAATGO^ 

SEQ ID 5022 

VQVCRLCVKTFPGASKQPFDR^ 
^FO ID 5023 

TOGCCGTAGTCGGCGGATCGGCAGGGATGAGCGGCGOKC^ 

cGiairiwx»nmcca^ 

(j[$XXXXAGTGCTGGCGCGGMTCGAG 
GGG«amAACATOGTTOX3GCGTCCGW 
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TATAAAGGAATCGQCAGGCATTG^^ 
CG(XX£CAAGGTGGCTGC^ 

SEQID5024 

MFPVFHLSGESSREMLQTAIiRFPHVFKARAEDSHKG 

VAAVLARJmDEAVVLDADAm 

AQGVPVFEAACAG^ 

SEQ ID 5025 

AT(&AAAACXXKAAACATTTC(X^ 

CTTTTCOOGGCTGCTTGCC^ 
SEQ ID 5026 

MQiGKHFPSNRLiaiRLENXPIU 
SEQ ID 5027 

ATCTGCTATAGCGCAAAA(£C(^^ 

GTTTCTTCAGTTTC^^ . 

CGCCGTCTOTTTCAAGTCGG^ 

CTttACTTTTTOa^^ 

AAGGOttAAOTCAATCX^^ 

SEQ ID 5028 

MWYSAKAGGDMLQWHTHVG^ 
LNFFKLLAVFSAVHmVAAYRI 

SEQ ID 5029 

GTGMmmTCCC^TTTOG^ 
CCGCAAAACACCTGAATGCCG^^ 

GTTCAGATTGCCGTTTCCAGCTCCGGC^^ 
(XX^TCXXX5GTTTTTAT^^ 
(XXXIATCGAACCGAACGGGC^^ 
TCGCGC6GCGATCTITTOTCGATAAAAATTTC 

CMCOTCAAAAACCCOTACGGCAACGCC^ j • 

SEQ ID 5030 11 

VNYFPIFANIAQlPVLWt^VAAIUaSIiLm 

VQIAVSSSGSAPVLARIJ^RKRLEALI^ 

SRGDLFVDKNFPLAPQKHPEATI^ 

SEQ ID 5031 

TOTATCAGTTTOTATGCCGCTGT^ 

TCATTA!TCCXXXXXXX?I^^ 

GACCGACMTGCGGOTCTTOSTCG^ 

SEQ ID 5032 

LYQFFMPLF^LPPmTMPSEARIGF^AAKNERFSAGFIIHPRP 
SVCFSPSISMPKDFDLCKTUCAAKG 

SEQ ID 5033 

GTGTCGGGGCTGTCCGGCTACGC , nK3GGCAAAAGCAG^ 

CGCAAACXSGCAATCOGAAATOTTTGCXXjACA^^ . 
TCGCCTCCTGOTGGTTACGTCC^ 
CTG(K3mraGCGACA^ 
CCGCaXXmGAOJGATGa^ 
AGGGCAGCH'AOTGCAAGGCAGACCCCTITO^^ 
SEQ ID 5034 
VSGLSGYAWAKAGNGASAG^ 
LGLGDSSYPNFCRAGKDFDKRPEELGAKRIiKRVD^ 

SEQ ID 5035 

GT6CGCCACAT(X5AAATCGM^^ 
TmiXXXXmCGGAAM^ 
CGATAAT0ACGAACa«5ACa?rAa^ 
G0CGG0CWOTGCGCXX!GCTTGCGCXX^ 

GKKX3GCTCC6GTACCGGCGT^^ 
TATCAGACCGAATGGCAGCAGTITCCCAAAGACGGCTO 
GACTTTGGCMTGGCTCX^^ 
AGACGGCGCAGAAG^ 

SEQ ID 5036 

VRHIEIDIiSGSDIiHYLP(a)AL6VWFDNDPALVGEI^ 

AGLLRPLAPRLYS I S SSQAEAGDEVHLTVGAVRFEHEGRARAGGASGFFADRLEEKT\^ 
YQTETOFJH0X3FLHRYDFAtfSR^ 
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SEQID5038 

LDCYNREBKOff?! 

TCMGAAGGC^ 



ftCGGTATA(X3CACiaGGAWCACACTRTGCCAGAATACCGOT 
CCGACGCGCTGGTGTGCAT 



GGCWmTSQTftMSACCaaGGCGT^ 



SEQ ID 5042 ^ ^^^^^^jTTHGRHiaGARMm 

HFFFAARVSIPQNSmWY^ 



GCXXXTO3CTGTCGGTAAAGCCGCCGTO 
SEQ ID 5044 




CCCAGTCTGTGTAATTOGTGTAGGGCAC^ 
GTTGCCATTT6TT6 
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(yilG(&TACGCCTGACQGAAAG^ 
TCCTGTOCGTCOGCMCGCOGGfPGW 



SEQ ID 5052 

t^IFVWAHPFGVPPSSHSVPmDPnX^I(X:i£VRMI5VRl^ 



SEQ ID "J"*--.^^ 

GGGAACGGACTftOTtn^ 

GNGIjDVSPGGDITBR 
SEQ ID 5055 ^^nw*™™™*^^ 

SEQ ID 5056 rmrrm n.m^.mmncmniPTOilimT^ 

GTTTGCCTGTCAGCG^ 

SEQ ID 5058 

LPGCAAQRRKNRLPIHFKK^^ 

ACXICGCAftCCGCCGTCATTGCGCCGG^ 

CCGMGT(X3AAMAGCCTCCGQGATTAAAGCGGATTAT 
SEQ ID , mMO _^ RPm ^^ 
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MLAWVLEYASLAPGFtlGGVPEKFGVS/^PQTPRQDFNSKSPFFVIE^ 
KGCWTPVBKFGTGHGBQIGEVHAIXjSFDVLIjIXjKKAGOT^ 

TITGTMTAOGTGCKCTTCGGGMAAT&AICCi? 

SEQ ID 5064 . 

IS*miF*P*SHS(«FCFIlBISS»ILIW*{aiHHNFSPPFCHICCFGKnH 

TCmTraSGGGCTGTTTICAAAGCTGTCAGAam 

GCMGGGSSJlCGGQi&AA^CGGAOjGCGRCAiMARGCC^ 

SEQ ID 5068 

LHRSKSFQffiMKSEBJTIAGRlftliEG^ 

CGTCCAAAATCCGA&AGOJAAW^^ 

^ TTCATCGGTTOMTCMTG(X^jrCT<&^ 

M^TAAACATWX^^ 
rrCMTAGAACTTCATCTMTTCTTTCATTOTTGCTCA 

FHRTSSHSFIPCS 

SEQ ID 5076 

I^AAHKHRLPAKPIUVKDADNAATLraPTOUiFA 

SEQ ID 5076 

UJWIMPQIiFRIYRQTPYIiUlAVIiFFTKL 

TCAAAGCACCCGCA 
SEQ ID 5060 



SEQ ID 5082 

IJMVEHVIiTDIAPEANGIRTHKEFGKKFVKSTD 

GGCAGTOjAMCTCATCTTATTGTCCAATOCCGTGG^^ 

SEQ ID 5084 

HPSEKIYYGVIJH£IAattJiSPFTCAGMiKPK 
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SEQ ID 5085 

GTGMACACATCAGTOGACC^ 

TTTTCCTGC^TCCGGCA^^ 

CGGCATATTGAAAAAMTCA6CAGCTTGA6TTGTGCA 

GATCCGGGCAATCTCGGCACCGTOT 

CO^TTTCTTGTTG^ 

TTTGTGCCaAACCGACAGCTTGGGTGTTCGGC^ 

GTCGCGATGGCGGCGACAATCTGOTGTI^ 

SEQ ID 5086 

VKHISSTNN^IRHLHRLI^QGOT 

DPGITOGTVLRSAAAAGW^^ 

VAMAATICLFEQ&RQRAAY 

SEQ ID 5087 

ATGCCGTCTGAAACACGCCTGCCGAACCTTAT^^ 

TGGGTA(X3ACCTATACCGTCAAATAOT 

CCAGACCGATTOGAAATCAGCX^^ 

GCACTGGACGTAACCGTCGGCCCTTTGGT^^ 

AMTCATTTTGCAACAAGGCAAAGATTACGCTTCCT^ 

AAAATACGGCATTCAAAA^ATCTGGTCXjAAATCGGCGG 

AATAC(XAGATTATCGTCCX^^ 

GACCCATCACmCAACCTCGCC^ 

AGAAAAACTCGCTGTTTTOCTM 
SEQ ID 5088 

MPSETRLPNHRALIFALGFIFTiNACSEQra^ 

ALDVTVGPLVfflWFGPDK^ 

NTQIIWLNNRSLATSGDm 
SEQ ID 5089 

ATGCAGTITTACCTCCAACX^ 
CATTCGGGCAAAAACCWTTTTGC^^ 
CTCGCAGGCAG(&CG(XX^ 
TCAAATGGTTGTTTTCATGCGCCGGGAMTC?^^ 
SEQ ID 5090 

MQFTLQPQAQFmGVNGGFTDSBGRRSGCSAAVSGKSAPAFGQK^^ 
SNGCFDAPQ^THREHTCTAPCAAI^^ 
SEQ ID 5091 
TRXTTTTTTCOTATCGGG^ 
TGGATTCCGTCGTTTTGCTGCATC^^ 
CCGGMTTATTGCGATATOTAGGGGTGGGG(nK^ 
GGCTTTGACGTITPGGCGTTGGCGCACCACA 
GGGAAAACGGCAOTATCTGGCGGCCTTTG^ 
GCGAMCaCTTCCGGCACOT^ 
<3T03TTCAAGACTGC(X3TTQG6TTO 

GTKGMCAGTTGGCAAAAOCGGCTTTTCT^ 
ATCTTGGAACAGGACGGGATTTOAGGAC^ 
CAATTTGGCCGCTCGTT(KXGACA6CG^ 
ATTT 

SEQ ID 5092 

LLFLYRDBAVLTLDAF^^ 
GFOTIAIiRHHRD^ 
VVQKttWVCGAGY^^ 
MLEQDGILRTVAASIMiAV^ 

SEQ ID 5093 

TTGGATTTTGAAjMCCCATAGCCGMCTGACCM 
A3TTCACCm!mOTTCAGCAAA 
GCACGGO^CamOTTGCCG^ 
OmCTTCGGTATGCCCC^^ 
TCG«XXmAGMCGCGGGCAGTCGGAAGra 
GGTCGOCGTAGGCG&TTACGTCAATA^ 
TTGGGTATTACTGCCGACCGCCTGCA^^^ 
AAAAACAACTtXaCGAAGra^ 
SEQ ID 5094 
IflFEQPIAEMMm^ 
KNFGMPRPBGYRKAIiRIiMKTAEKFGIiPVMrPIOT 
LGlTADRLQKLDLVDtfTO 

SEQ ID 5095 

TTGTTCAAAA3mAAAAAACAGOTra 



WO 02/079243 



-390- 



PCT/IB02/02069 




CATACG6GTTCACCGT 

G^TTGACCTGATGCRCSMTraGOCCiCGCGfTTCGCGCC^ 



EKTCRPKPFVQPHGGRIVLVFifriLFEriEFVSSIA 

SEQ ID 51 05 nnn^^OOP^W^ 



ATCAAATGGACCGATAOwwvv^^ 

TTCACt&CGATCCCGCCCGTTQCQSOG^ 

^KKHTOTRRLA^ 



TCCGTCTTCACmCGTCTGT^^ 
SEQ ID 51 1 0 «*,^„m»iwwa«» »<^VS6RSVKKPIKINIESKVPSSQTSVFR 



LHEjQKAFBATQIKEINPKAGDKIiDPNIH^^ 
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SEQID5113 

GTGTCTTTTGATTGOTC(^ 

ATTTCTCCAGCGGTAAAACCGATCAAm 

ATATATTACAGATMTTACCGMCTGATTTAM(X1^ 

ATAAAGCAGTTTCjGAOTTC^^ 
SEQ ID 5114 

VSFTCFGAFSYQPGSDTRP^ 

IKQreiSH3WNKLQQKKPQKIlJSLIEEQyiiTPYII^ , 
SEQ ID 5115 

ATGTTGCCGAOTGCTCGAACCGCGACOT 

TATCCXXXXSAGGGTGCAGTCATC^ 

(XAraTGAmTCCGCAAA(X?TTTCGA^ 

GCCGGCGAGGATTTGATTG^^ 
SEQ ID 5116 

MIiPTCSHRDIRffrvyiiKTrott 

AGEDLIVKEPKYTINHAREEH 

SEQ ID 5117 

TTGAGGAGCAGTATGATGGGCGATTCCGTCATTTATTAT^ 
GGGMCGCCGCCramTTOCCGO^^ 
TCTTTATATTTACGGTGTGCTGMCMTGAATC 
CCCTAC(XXX3GCTTTA(XX^ 

AAGAAAAAGAACATCCCGAAMTTTGGMGAGCATCCGA^ 
TACECC<£TTCATCATGAAG^ 
CTGACGGOTGGCAAAATOT 
SEQ ID 5118 

LRSSMMGDSVTHVEQADEFVNRAG^ 

PYGGFTVQAMRGQMTEEERTEHOTAWGI^ 

I/TGWQNVADHiEPRH 
SEQ ID 5119 

TT^CaTAAGTGC^TTTCCTGC^ 

TICCGGCACATCGGATGCTCTTCCAAATT^^ 

CTTCCCTCATCTC^^ 

ATTGGTCA<m»GCC^^ 

ACTTTGACCATGGCAAAATCCAAG^ 

OTKjTTCTACATMTAAflTGAC^^ 

AAATATTCGGAA 
SEQ ID 5120 

UfflKCISCHGFMMKRm^ 
IGQSAGPIVQinWIKTVKINIVDPHMFANFAW^ 



SEQ ID 5121 

ATGCGCCATTTGO&AM 

GGATAGGCGATACCGMTTGTTCGCCGTGGACGAAMCCTTC 
CGTCCACGAACGCAGGCAGGAAGG^^ 
TCCTACAAGCCGAAAAAATCGAAAATCCGOCTTW 
CGGATTTGGAAAAAGCC^^ 

SEQ ID 5122 

MRHLENNGXANVAGLmiAVKl^^ 
SYKPKKSnRLPKEAYFGKSGTRLTDEIADGIHIYTCT 

SEQ ID 5123 

TTOAAAAACATCGATATGGAAAAATCAGGCATTGTCTATT^ 

SEQ ID 5124 

LKNOTEKSGIWSMKTVKGVYSEU{LNRYGTO 

SEQ ID 5125 

ATGATGGGAGGGAAAGGCGGATTTTATCGGTATTTG^ 

CCTTCCTTTACACACCGCACm 

CAACAGClTOTCGCCACCGTCa^^ 

ACCGCCCTGTCCGCCOU^^^ 

ACAATCAACGCTTTATCGAACTGCATACGCGGCAS 

ATCGCCTACGGGCTGG^^ 
GC6GCAACAGC(XX3TTGGGCGGCGATGACTrCGAC^ 
CGCCGCCAAAGMCAATTGACCACGCAAACXX^^ 
GTC^TGAAAACOTCGAGCCGGTCAAACAGGCGTTGAM 

GTOGACCCTTACCCCCP^^ 
OGTCAGA(XGCX3ATGACGATACACGTCGTG^^ 
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GCAGCACCGACAAWCG(X5jCCAAGC<XMGAACC^^ 



DSDLLDAEEFAQIQRDIMIiQGRIiKre 
GCG&AOTGGACGMCTGGTTGACGAiTGTTCCAMXX!CGST^^ 

(XXKXX^CACGCCGGAACXXSGCA 
ALIiNEIDHX&QNVLKYK^ 



GYYPSKIRTFSRNFKQRQEISHPPPNTLPQKPm 

AGTGTCATGAGAAATTTTCC™ 

SEQ ID 5132 

LPLISASLTGS^ASEPQAYGVQNSDKTDNVKRLTRPI^FQ^ 

SEQ ID 5133 ^„ Mflrwv ^^ 



CGGGACOrmACAAGGTGCGGCMAACGCTAAAATACAT 
SEQ ID 5134 ..^^^^.^.vnmmSFI^^ 



.PAKPERCRLKTPAGIGTLYKVRQKAKIH 
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SEQ ID 5136 

MmTDmLVTGASQG^ 
APIffiLTRAIiFPIMQSPDASVIFVGESHGETPKAYW 
SEQ ID 5137 
ATGCOXXX&AAGGE^^ 
CACCAGACAAATGCGCmGTACGTCOOTAGC^ 

GATGACGGACECGTCGGGCGACTGCT^ 

Om<&GGGCOTAAAAATAGC^^ 

AACGGATGGCGMGGGTTCGGGGTGTCXXjGCTTCGACAAXC^^ 

6ACCT(nTCXKXCAA0CCTTG0GATGCG^ 
SEQ ID 5138 

MRAEGTDIFQNGSAmGCLPFmEVNDF^ 

DISRVGRLLQQRBQ^ 

DUAQALRCAGH 
SEQ ID 5139 

ATGAAAACCGTTTOCGCCGOCATCGCTTTTGC^ 

ACGCTGTTCGCACCCC^^ 

CCTQCAACAAACCAAOGXITTCAACGT 

6ATGTAACCGAATTCGGACGCA6A6ATGTCX3GCGATO 

CCGAAATCGTCTATTCTACACA(^GCXK^GGC^ 

AAWXIGCGAAG(XX7P(X3ACAACTTGGTTCA^ 
SEQ ID 5140 

MKTVSAAIAPAAAAVSI/KK^TESSRSIiEVAKVASQ 

DVTEFGRRETOJHQLFGII^ 

SEQ ID 5141 

TT(KXWAAGCCGCCAACAAAAAAMTGMT^^ 
GGCTGTTTCCCGAATCGGGCGTATTTATGGACT^^ 

SEQ ID 5142 

LIUCPPTKKMNAAPGAHAHLGLLI^^ 

SEQ ID 5143 l! 

ATGAGAAATTTTCCm^TCMGTm 

GAATC 11 

SEQ ID 5144 

MRNFPLSSVKVIHIIYnrafflllEKSG 

SEQ ID 5145 

CATTTTCGTCCCGTTTTC^ 

MCATGCGCTTCAAACCCAATTTCTGCAAAOT 

CCGTTTTCAGCGTTGCCCGCACCAAA^ 

raTAraTcmTCGcc™ 

CGGATAGC1TC6TGCAGTTTGACGAGCGCX3CTGC^^ 
GCMCTGTTCGATAACGCCGAMCTCACGCXXXTGCCAOT 
GGCGGCACGCCTTTCAGACGGCATTTCC^^ 
CCGGCGCGGATMTGCGGTGGATATAGCCCTT^^ 
TGCTTATTOGTGCGTTTTCAT 
SEQ ID 5146 
HFRFVFQHHRKLRQII&HIjIUjQHQRLT^^ 
AYHLFALCAGIPBQGQFDMEIVVRQPAHPQRKVDFFGVWTO 
GGTPPRRHFLLRHAHRRNPHRISLRNTAVGPAAPLVQPHLAGA^ 

SEQ ID 5147 

TTWAGAimCAmTGAGTCGTGAAATGTTA 

AGAAAAAGGCAGACCGCGAACACATGGACGTGCGC<^^ 

CSAAAAAACCATCGAGGAAATCCMGAGGAAATCCCCGAT^ 

CAAATCATCCTGCAACGCATCC^ 

TCGAAGTCGTTGCXXmAACTGGAC<XX^ 

CACCXSGCCGCAAACAAATCATCX?^^ 

CCGGGCCAAOTGCCAAAGTCGCCGTCAAAG 

raTCGACGTTCTCCTTTGGra 

C^6GACC(K3C!IW;anCGC^ 

GAA(ftTGC(X3CCATCCG(XX£CTOTO 

TATGCCTMCCTCGAAGGCGTAGATGCCGA^^ 
AACGAAGAAACTGCAAAAGCCGTCATCCTGACCGCACTC 
SEQ ID 5148 
LEMFKMSRMLQLAEALASEKNVDAEVV^ 

QIII^RIRDAEREQ^MKFLAVKEDIVSGTVKRVERHGI IVEWAGKLDALI PRDQllIPRENFRSGDRIRALFLRVEEIGNTCRKQI ILSRTSGDPLVKLYANEVPKIADGMLKIRAVABD 
PGQRAKVAVKANDQRIDPQGTCIGVRGSRVNAVSNELSGERIDVVI^ 
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NEETAKAVILTAREHWFTEDK 

GAttraTITCCAMXGCATraa^^ 
CCWXXXX3AAGTCAMTTAAGCAGC(MTTO^ 

PAESQIKQRILVmORBSt 

CMGCJOT0CTC0CCQTCT<nT^ 
SEQ ID 5152 



CGCCGCAOU^QCGACAAAOHXXX^ 

CftW^AftCGGCTTGACCAATGCO^ 

GOTCTMGGGOCTITGGICGSXJC^^ 
TRTCTfGAA/^TCCtoGATTOTCGAAGAfiCAGCCCAAA 

S Sl!^^A(^RKAPDIMTSPKBSKPASII.VVPPI^ 
QIUBVTTVSAKmVDSRNGKEMSGSASIREfKHHSNSG I 

MTCCGGCGTGGITCTATOJ 

SEQ ID 5158 

MRIUIAStTAQNQATGFNIRTRFGFCILQNPHIjI^^ 

SEQ ID 5159 

irecAAGca^^ 

ATGGTAAATTA 
SEQ ID 5160 



S mOTM^OTATAACAAG(^T^^^ 

SEQ ID 51 62 

LblHYHKQYALTGKTSPNFIKKSTVPVFFEBH 

SEQ ID 51 64 

KanMYQCRLKKTGTVDPUlKFGBVLPVRAYCUi 

AGTTTGCCTATATCCTQCGCCTTTGCCTGACACCGAAAAGCA^ 
CATGTCCAAAG6ACWTACGATTCGATGCAGGATTTGAATGCTTC 
CTCGAT 
SEQ ID 5166 



ID 

S ktcaSSaaLitoaacacmc^ 

GCCAC^GTCCCTCGGCTCCGTCOT^ 

aCCtX^ATCTCGACGGACGCAAAAWCTG^ 
ACfmCXmTTTCTTOX^AGAAACCX^ 
gjQ(jgjailJBMC800B9MCOCCT 
GTACCamAACCCCTGGCGC^^ 



WO 02/079243 



-395- 



PCT/IB02/02069 



SEQID5168 

MKKNHLNTOFDLWHTIRE^ 

HRMLOHOTAY^^ 

VPAKFVARSLKTPSAffifflQNIQFAKIDFm 

SEQ ID 5169 

TTGMC^GAAAACATCATCGCCCAC^^ 

SEQ ID 5170 

LNQENIIAHTLPKSGIVQAHRLGNVIKSGG 

SEQ ID 5171 

TTCKXX&TTAAATGGTCGGMCGOT 
AACA(^TGCGCAGGAATTCATCOT^ 

SEQ ID 5172 

IAIKWSERLHEVLTSRWL^^ 

SEQ ID 5173 

GT6CGGAATGTCGAAGCCGAACACGGTCAGGATGCCTGOT 
ATGMTTCCTGCGCATCCTCTT^ 
G7PCCGACCATTTAATCGCCJA(KOT 
ATCCTGCATCGAATCGTAAAGTCOTT^^ 

SEQ ID 5174 

VRNVEAEHGQDACQAlX)mGKFQRSffl)RIIHAAm 
ILHRIVKSFGHEGTQHIQPHFG 

SEQ ID 5175 

TTGGCAGGCATCCTGACCGTGTTCGGCTTC^ 

ACAAmTGAAAACAAAAAACTmCGGCGACTA^ 

CCGCTTTAAAGCCAmTCCG^^ 

CGCCTOTTGCCGGACAAA^ 

CCCGACTGGAGCGTCTGTACGTCTTTTTGCP^ 

(^(XjGCGAAGTTTTATTGGACGG^ 

SEQ ID 5176 

IAGILTVFGFDIFHGIVWFVFIFMIIAIFIAMWIG^^ 
RLFAGQIKIGDIQQTVQAFARLQNALSFFRMFYNKFTA^ 

SEQ ID 5177 ATCTGATTGACAAATTG(&TA^ 
CCAAAAICGTCCTanCGAOJAA^ 

SEQ ID 5178 

MNTCRL6niIira3KTOGWQHKI£ 

SEQ ID 5179 

ATGCC(nrreAAMOX3CTTa^ 
GGCAGTTTTTGOT 

SEQ ID 5180 

MPSENGFRRHRSAAGSVADIEVHIAERVERTSVSDTDDAVGQFLL 

SEQ ID 5181 

GTGTCTTTCC^TTTGAAC^ 
CGATACTGATGATGCTGTCGGGCAGTTTTTGCTW 

SEQ ID 5182 

VSFQFERCBOTASDGIVQPQAA^ 

SEQ ID 5183 

TATTAAAAACGCACTOG(^AAGCATTTT(XXTO 
CCCAA 

SEQ ID 5184 

Y*OTPQSIFPAPVPSRQAQHVVKFELILPFBRGSKTAAEPQ 

SEQ ID 5185 

GTGAAACAGCTTGAACX^GAAAACATCATCGCCCACAC^^ 
TACOSCAAAACGGAGGCAAAATGCTCAAATCC^^ 
AGGCAACAAAGAAATOGCCGCCATCGTCCAC^^ 
ATCAGCCGCCGGATGCAGGGA 
SEQ ID 5186 



SEQ ID 5187 

GTC^CCAGOTATTTTOGACTT^^ 
TCTGGGGCGAAGAAGGCGCGGGCAAAAGCCATCTGTTGCAG^ 
GTTTGAGGCAGAATAOCTCGCCGTCGATC^ 
TCGGAATACAC(XXX£A(XAGCTC^ 

raaxaxxmccmAGTAA^ 

CATGGGCAAACGCATCAOCCTGCCGCTGC^^ 
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SEYTPQQU 

CCCC(»TOXX^CTGGAAAAAGM 

GGAATCTAGGACGTAAAATC 
SEQ ID 5192 



SEQ ID 5193 

GTGGGMTGACGAMTTTCAAAGTTATGGCGTTA 1 

SEQ ID 5194 

VGffl^SYGVIGEmJQSRRIYPKQPDFKKT^ 



A6ACCTGTCGGCACGGAAACTTATCGGGAAAAAMGGTTTOT 
SEQ ID 5196 



XAAACTCACGCCGAAACGGA1 



(XGATGCCX^ATCGGGTCGATGATOTAGGCA^^ 
TLYGGMLPKIASAVEAAVNGVKATHIIIX3RLPKAIJJ^ 

■^^^^^^^^^ 

FNEIPVLUWAALYLWFKPP 

CTTOTATCCGACCCTTCC^^ 
^SaSIqPAGA^ 

GCMATTAMGMCMGCTT(MGCQGGCM.TGCCGTC 
SEQ ID 5204 



S ^^SILotA^MC^ 

SEQ ID 5208 

LLTISAYVNAKRMjKQSSDGVWHTFGQVEAIiRYGR 
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AATCATAGGCCGCTGTC^^ 

CGAAGGGCTGCTCGGCAQGCT(X^^ 

CGTACCAACQCCGCCXCTGCGMCXnCAACCX^ 

ACATCCACGGC<^TGMGGTCTGCCCTTOT 

CCCCGAOTOmGACGMTACGOTACGAA^ 

TTCAAAGACMCGOyUMX^ 

SEQID5208 

LKPCDTOGmVYIWK^KDPPMra 
NHTPLSNSVYYAYFBPYSEE5QHLNIiLGDAQ6S6IiCRII© 
RmAGANLNREWENPTL^ 
FKDlimPDDDFGW^RSI^EAVLSAILNVAGDI^ 

SEQID5209 _ 

ATGATTATCCATCACCMTTCGATO 

TCGCQCAAGGCTOTCCGTOITACCA^ 

CCTCGCCCATCCGCTTGATATmm 

AAACTGATGGACACtXSTGXXsO^ 

GOTCCCGCMGCGCATTACGMGATGOCGAAGCX 

TCAAGGC^TCTGCCTGTTCGC(%TO 

CGCCAACCCEACGACTATCKX^^ 

AC 

SEQID5210 

MIIHHQFDPVLISIGPIAVRWYALSHLGFILFTO^ 
KUfflTVAPLVPUSIASGM 

RQPDDTfLGIJ/n/SLSMGQ^ ' 

SEQ ID 5211 

ATGGAAATC(X3GTTTCAGACAGCATTTTTACGTT^ 

CCGGTTTTTTCCCCTTCCCTGCG^^ 

TGAATTGCAGGCGGATAGCGCCGOT 

MTGTCCAAACCGTAAAGATA(&CG 

GCX^TTGCCAAAACCAGCCGTKXaATTTCGGAACGT^ 

CCGTCATAAAACCTTGAACCTCGTCG^ 

SEQ ID 5212 

MEIRF^AFI^VQMKTNASIL^^^ 
NVQTVKIGWSVKIAAALAVVVEPQIGQ^ 

SEQ ID 5213 

TimAAGGAAAATGATGGATTCCCGAAAAm^ 

TTATGACGGOGCTGTTGAGCXX^ 

G6CMT6GCGATG6AAACAACC6CCGCGATGTCGGATAAAAAG^^ 
TTG(^CATTGTGCCGAC(^TTGGTTTGAAOT 
TGCAATTCACCGAAGGOSAGCT^^ 
AAAAACCGGCAGGAAC(^T(XCTGTOC(n^ 

SEQ ID 5214 

LKRKMDSRKFTEASKRRLGELLDAKS 
LDIVPTDWFEAADDEAFEE^ 

SEQ ID 5215 

TTGACGGAGCATCTTATGCCGATAGAMTCX^C 
MCGCACCATCCGCATCCCra^ 

gatggaaacgatgtgcccgtttaagatggaaaaacggcot 
gag?itoxgccgaaaatgccgatt™ 

CCGTC 
SEQ ID 5216 

LTEHLMPIEIIMFLIENDK^ 
ELPAENADFDRPDBIXJRB^ 

SEQ ID 5217 

ATGCAATTAGATATAGACCXJTCTGGTCGCCTATTTO 

6ACACAG6CCAA(X3GCGTTATTATT1TOjA^^ 

G(XX&CATCATCGAAGCGGGTTTTG(^^ 

AJMCCXjTCAGGCGGGTMGGCE^^ 

GGCGGCGGTCAAAGCGGTAAAAATCGCTOGTGA^ 

(XX3GGCXXX3ACCACCATCAATATTC 

(^CACTGCCACMCGATTTGGGCT^ 

T^GAMTCGTGATGGCGTTG^GGTGCGCCACX^ 

(XX5CAAACCGATT6A6CTTGG6CAAATTGT^^ 
AGAMTCGCCGACAAAAAAOGCGA 
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ACam^GACTOXSa^^ 



CGACATOTGG(X5H^CCGCCAAAQOTAC^^ 

n _ --^•-•«»^*™^EITOEDIJB^VSDEMGMffl^ 



TCREPRADIVFGIKGEEKRASATGSGPVDAIFKAIESVAQSG^^ 

TTmCAAATTmATTGCCGfTTOTCCAAM 

SEQ ID 5220 

U>PCTAEFAIiFCBHCQQSRMCHKIIJK)CfRI^ 

GTGCGGAACACGCmGTCCGAACAAGHSAAAGOTTC ^ 



AAG^CCTQCTCAMTCGOTAAG^ 

SEQ ID 5224 

MSGIAGIEGHFSEHDOTVYSKATKQPI^^ 

S |£i£a^ 

3TGTACAA! iCCAAAACUW: aUAJjU h\j . ---~««^«»«»»"»^»"«i | (£(JNSJ^^ 

'ATCGAT 



GAAAAO 

TCTGCCGCCGTGTATAT 
AGCTGACGAGTTACAAATACATCGTACMGGCACGGGACAGGTCAGGGAA 



SEQ ID 5228 

LIFljQIEHPSEDC?R8HFYSGLTKIRTHERSRRQYK 
^ FTGGAAMTCmIsGTAAI^ 



TGCOTAATCGAC^CCGCCGCATCCCATTCGACCSj 
OGOTTCSCGGGAAACGG^^ 



WO 02/079243 



PCT/EB02/02069 



-399- 

GGATACGGAAGC«3AT(^GG<^(?F^^ 

GMGAACTGTTGGAAAACAGCATCACCATCGMGAAAMTTO 

TC(KKXn i GC^CGGCAATTCCGTTCT^ 

GMCCCGMACGCCAAATGATACGCCTGAGCGAAATT^^ 

GACTTGGGCAAAGCACCGCAIITiOT 

TTATGATCXM'CCGAAAAT^^ 

GGAAAAACGCTACCGCCTGATGAGC^ 

GACGATCCCGAACOTTGGAAAAACTGCCGTTTA 

CCCGC(XX3GCA£mTCCACmATCC 

CAGCCGCACGATTCTCGACCAAATGGGCXX^^ 

GAGGTGCACCGCGTGGTCGAATATCTGAAC^ 

LETONAAKIKGRGGYYTVGGGOT^^ 
LFHAVKTAVYWLFVGVVRFCRNYLAHESEro 
TRALSSAALRETKKRHBAFEKNGTAVPKVR^ 
VSOTAVRDAmVSVNLKEPmTVSAEm 
PPAGFEQAQISRIAETOHLAAl^ 
EEI£^SITIEEKLAEFKVKVKVVDSYS^ 
DLGKAPHLLVAGTTGSGKSVGVNMILS^^ 
DDPEPUEKLPFIVVVVDEFAD^ 
EVHRVVBYMQPGEPDYVDDIIiSGGGSBELPGIGRSGM 
SEQID5231 
ATGTTATTCAACGCAACGCAGGCCGAAGAGCTGCGCGTTO 
GCATCATTACXXGCATCGAGCCGTCGCTGGAAGCGTGTIT^ 
GCGCGCGCGCATraGGACGTGCTCAAAGMGGCATGGAAG 
CTGOTATTGATGCCGAACAA^^ 
TCGCGCGTACCGCCGrara 
OTTATGGAAAGCTCGTTGCra 
ATGC£GGGCMCGCAGGTCGTCTGAAACTCTAC£A^ 
GCGCGATCGTCATCGACCACACCGAAGCCCTCGTCT^ 
ACmXKXmCAAAT(KX3CCTGCGC(^ 

CCGGCGTGATCCGCGGCATCGAATCCACCGCC^^ 

CCTGCTGMCEAAAMCGCGCCGAGCTGTTTGCM 

GACGACGTAGAAGAAGACGGCGAAC(X3AGCTACAAGCGCGTTGCC^ 

AAGGC(n , GCGCCACAaymXXXXXXm 

CGCTGCCGAAACCTCCGAAAAACGCAGCACGGCAAACCGCAGCGGCA 

GCGGGCAAAACTGCCGGACAGAAAGCGCGCGCCGAC 

AAAGCCGCMCGTACAGCCTGCCGCACCCGTTGCAGAT^^ . 
AACCGCCGCCGTTGC(mACA<^ 
CGCGATGCCAAAGAACGCCGCGAACGCAACAATCAGCGCGAC^ 
ACAMGTCCGTT(XGCCGCCGCGCA(X3TTT^^ 
AAACGGCGACGCACTGATTTATGATGCGGCGQAAAAAATCCGC^ 
TTTACGGCTGCGGCGGAACA(XX!ACG^ 
GlTrGCGCCGTTTCGAACAGCCAAAACCGTCTGM 
SEQID5232 
MLFNATQAEELRVAIVIXK3^I^ 
LVUiPNNPRGGGVSRRIEG^ 
MPGNAGRUOiYEDHTPI^^ 
DmVQMGKLSRFGLI^RQRLKPA^ 

DDVEEDGEPSYKRVAEPBEDESAKPPGGEKAKAARPEPAVKGVRHTSPAPTAA^ 

AGRTAGQKARADKAETRMGMRRRN^ 

RIMERRERNNQRIMQNGKK^ 

FTAAAEQARETAQTGGLVLIETDPAALKAWAAQPEVQAGRGL^ 

SEQID5233 

ATGAAATGGWGAAACXKX^TGAC^^ 

AGTCGTCCAMTTGAAAAACGAACCGCCTGCOGT^^ 

CGATTACGCCGTCGAACTGGCAAAGCTGC^GACA^ 

GAAACCGAGCCCGTCCma^^ 

TGTCGCTGATGCGTTTCGGCGCGGC™ 

GCGOGAAGCCGO^TGCAGGACAACTCGGGCGCAATTT^^ 

ATATGCAOCCCGG(^TATTTTGGTTGCCGC^^ 
TTTCAACCGC6A3TACC6ACGTQTCGCCACCGC^^ 
AAACCGATTTCXSCAGATTTCCTT^ 
AAGGCTTGGGCCGCCAACTCG^ 
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OTyv&uiwuuuAiiuunwiaujuvBirt.xY. — "~""iipsi>PEKIHU.VDBAREQQ!RI!AYIHLVKVQQR 



QSUfLAAIAWUiLILtAK 



(^TTACCTCAAGGGATAWXS3CA&GGCAGATQCXOT 



OXSC^GCCCCMTimT^^ 
VNHKRHLVBRLPPNIPIFIFVIK3FIASAAra^ 

ATGGAAAGGGGGAQGATtnTCGAAAGCATGGOTTGTCTGOT 

MERGBMLESMVLSAYAV 

^ ATCAATAIOTOOT 
CTTPTTTGCATTTT 

SEQ ID 5244 

mFTOFSYWFt^miSHPSVHSSI^IDFGKRIKrVTPUff 

TACTGGCTGATGAAGGAT 
SEQ ID 5246 



YHUKD 

CGGGCGAGGATATGGCAGCGTffKX^ 
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SEQID5251 

ATGACCGTCAGGCG?TTCAACCATTO 

CCTCCGGCG«ymC(OT^^ 
SEQID5252 

MIVRRFKHFIYKIPNTAFIGQSKNLTEHI^^ 
SEQ ID 5253 

GT(mACACAJAACGTATG6Cma^ 

AACGTCXXSAAIXXCTGCCCGTCAAACCC^^ 
TTGTGCGCCGGCGCGGAAGAAGCCGTCATO 
TTTTC(£AGAAATAGACCGGACGCATT^^ 
SEQ ID 5254 
VRTQNVWI&QVEOTAECHKMP^ 
LCAGAEEAVIMGGAQIYGQAMPIiATDLRITEVDIjSVEG 

SEQ ID 5255 

ATGTTCMCATACCGGCCGTC6CCGTTTCCTATTTM 
CGACCAATGTTTTACGCAGCGGCAAMAAAAGGCGGCCGOGCT^ 




CCTGCAACTGOTTGGTCTGCG(X3TT^ 
CGCATACTTCTTGGATTTTCGCM 
SEQ ID 5256 

MFNIPAVAVSYLIGSLSFAVIVSKYYGMDDPRTYGSGN^ 
PATALVCALIWLVMAFGFKVSSLAALVATTAAPLAAOT 
SEQ ID 5257 
ATGCTCAAACACTTCACCTTGT^ 
AGAAAGTCGGCGAATA(XGGCAAGGCATCCXXXTGCTCC^ 
CGGCTCTCCGTGGCTGGCGCAAIC^^ 
CGCGGAC^GTGTTGCGCGMGACGOTACAMGGGCTGAT^^ 

CGACAACTACCTTTACGCXXjAACTGCACGT^ 

AACCTCCCCGCCCT6ACG(XjQCTGCTTG^ jj 
SEQ ID 5258 ' " 

MtjKHFTLFLVLQYFNRKDKFY^ 

RGQVLREIX^GIilSLLPPPPRRMVLIDPPYEEKQDYRRVTSM 
NLPALTRLLAQDEGARYLLDSKIR 

SEQ ID 5259 

ATGAAAAACTTGAAAMTATTTCCGTTGTTGTO 
GTOTAACCGTGCCGAAGOX^ 

SEQ ID 5260 

MKNLmSTOVCAVlJiAACASENSVANYAIQ 
SEQ ID 5261 

ATGCCGTCTGAACCCTTCAGACX3GCATC 

ACACCCACGGTACCAACCACACCGTTC^ra 

AAACATTTTTACGTTGACGGCGGTGTTGCGCCAGT^ 

ATTTTCAGAAGCGCAGGCAGCAAGCAAAACGGCACACACAG 
SEQ ID 5262 

MPSEPFRRHLPSNXQNRAIHTPYRPVLHCT 

IPRSAGSKQNGTHSNNGHIFQVFHTCPIISK 
SEQ ID 5263 

ATCAAAATGAAGAAACTGATTTTGTTGTC(^^ 

CCATTCAGATAAGCGAGCGTGCCGAGCCTKXX^^ 

CCGA 
SEQ ID 5264 

MKMnJLLSVAAMLWACmDRRFVTQESAM 

SEQ ID 5265 

atcttatcactata™^ 

ccacgcc&tatccggcggaac^^ 

acacctgcaacaaaacc^ctgcccgaagcactcg^^ 

gcgggcacgggcaatattcagggtgmgaccag^ 

acactttcgtaagtgccggtgaaaacggacot 

toacactcaatgccgaaaacgaatttgtgctga^ 

AATACtmX5ACGAGCTCTTGOX3GGCG 

GTATCTGCAAGACAAACGCGAOCCGTCCA^ 

A3OTCGATATACAGCC(XyiAAG^^ 
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TTTACGGTTAAAAAGTCAAAT 



GGGCAAICG(SX XX3C«^ 
SEQ ID 5272 MWinOT tm0TBftW ™ VHOMRRI^mGPCSIHD^^ 




CTGCAAAAAGCGGCGGQCAAA 




LQKAAGK 

C(XX^GAA3TATTTTTGGATi^GAT6CT 
laVBPQSSGIfiGfM^mOTAmTmXSRmAFmTPEIfl^ 



SEQ ID 5277 

GTTOJGGft^TGATGTMATTTGGGCGMC^ 



TGA r 



■53================" 
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SEQID5281 

ATGGCGGGGCAG6AA6MTT6TCTTGGCAAGTGGTGTATC^ 
ATGTATTGCCTTTGATTAGCCTTGTGCTAM^ 

SEQID5282 

HAGQEELSWQVVYQRVMADKDVVGAGYLIDFAQTAENLPFTO 

SEQ ID 5283 

A3t3mTCAAAAAG(Xm?ACCCTCGCACAA 

GCGCCGTGATGGAGGCACAAGGTTCGCMTTGAC^ 

OTAAAAGAACTGTmSCGCAGCCTATGCCM 

GCGCA(XX£GGACACCTGAC(^ 

CGCTCGAACACAAACCCAAAATGATTCTQGCGGGCGCff 

GACMTACCCACGAAAMGCGCTGAACTCTTC^ 

AACAATA(X£AAAACAAGTGAAAATCAATGCT^^ 

GATGAAAATCACCGGCAAAGCCGCCGAAGCCGC^^ 

TCCGCCCXX^TGACCACTCGOSGCTO 

TGACTGCOTGTGCGACAAATACCCCGTra 

SEQ ID 5284 

MFSRSVTLAQYDPDIiAAAIAQM 
AHGGHLTHGASVNISGKLYNAVTYGLDIft^ 
IMEKAI^SSIFPSIiQGGPIiMHVIAmVAFKEAIjQ 
SAAfflTRGFNETDARVLSNLVA^ 

SEQ ID 5285 ' 

TTGTTTTCXX?rAACATTGCTATTTTTO 
TTTTCATTTTCAAACACC^GCATTTCA 

CTGATO3ACTTCCCT 

SEQ ID 5286 

I^SWLLFFLFQQNDQIiFQYCD^ 

LIDFP . i 

SEQ ID 5287 

ATGGCTTTCICACCAT^CGCCCATG^ 

ATCATCAGCTTGTCGGTTACCCCTGCC^^ 

OGGTATGGAC^TGGCGGAATGCCCGGCCTTGGTTACAGACAAG 

GACGGGGCAGGA(^GACC36CTTGCCAATTCGC6^ 

TCXK^AGAGGCAGGCATGCXXATATTATTCAGGGA 

(EAACACCmmTAAq^ 

TGGGCAATCGGCGGCMCAACTCTTTMC^^ 

ACCTTGCACAAACAAGTQCAACATATTACTACAAA 

SEQ ID 5288 

M&FHHHAHDIAAWQLVGNVLCHFHLPGVII^ 

KAGgTACQFAVDLAFGRPW^ 

RTRPDNDQQPPVPAWQIGKFLADAVNQ 

SEQ ID 5289 

AMSOTECCGCQCGC^ 

GGCAGCOTAGCGTGATGTTCACGCCC^^ 

ATGTGTCCGCCGGATACX36GCCGGCGAGGITPATC 

GAGCCCGGCGAmGGATTTTGGGATTTTO 

GGCAGGTGGGTCGTTACGAGGGCGAC(XX5CAGGO^^ 

OXXX^GGCGTmTGATGCCTTTG^ 

AT3^GCCACCCTGCC!im 

OTTCGGCGCGCGCGCGCAACAGG^ 

CGGCAAAAACAGGCTGTTTCATCGTGTTTGCTO 

SEQ ID 5290 

KAAARHUXBIiHGSYQAAm 

EPOT*DFGIPDAEm*VVl©NAaiAF^ 

I©PCLNBFGGRDVQHFQPAFSVRAA*FR^ 

SEQ ID 5291 

ATGATTGAGATGAGTAMGATTACCGAAACGATTT^ 
CGGMGGATTGATCXjAATCGCTCGATTCCAAAOT 
CCGTATATGGAGTTGGAAACGGTCGTGCCGCCGGAGGAGGGG^^ 
TAT(nTCGCAA(XCGAG0CX3CCCGG 

SEQ ID 5292 

MIEMSKDYRNDLyDWSyPPQVKWLIRECL^^ 



WO 02/079243 



-404- 



PCT/IB02/02069 



SEQ1D5293 



TTGAAAAAGTTGGTCATT^^ 
UOCLVILLEEKK 

GGGMCATOfiAfi^TCCTRCCiMATCAACGW 

====^^ 

GNIHVDPTKINEyaVRVGVKP 

SEQ ID 5300 ; 

MPSEKFQTVLGSATTSKTANQKACGFVSGAYRRD 1 



AAMCCACCTrCOTMGCm^ 




I^KEUTCLQLEIEA^ 



IYHHVSHELEHGSSEL 
,WITHDRRFLDHIATRIV^^ 
lAELEHASFAYDDKVIMIKFSAIIjQRGiaCIGIjIGPHGIG 
iERKRI^IAKLFTRPANILVLDEPTNDLDIDTQEL 
IPGIFKD 
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^TOKSOACAGATC^^ 
CGTTCGCCGCCGGAGAS 
opQ iQ 5308 
LQCSCPmraSGI(OTVDPCIJX5FQGCT 
mQQIFKQFLGVIWVGRFVQN^ 

CTTTCCGTATAGTCC&TTTO 

ATCCGTACT(£AAACCCGAA^ 

AGGOTTOXJT^ 

SEQ ID 5308 

KPSERSMITQTSRSHGKRRSATACCFRIRIADDEI^ 

RAFVRHGV1P 

ATTTCATACGCAAGGAAAATGTTGAAAACAAAAAGAAC 

SEQ ID 5310 

LimPAIHSGKTARNIRFPVSTTmFTPRCR^ 

TOGCACCCGGTOTTTJ^^ 

SEQ ID 5312 

MFKRPEEIIVLIlAVLWIAGTYFtiAALFGA^ 

SEQ ID 5313 i 

TTGCTGATttTTTCTCCTTTCCCftAAAGGCGGWTCGGACGGCGCG^^ 

SEQ ID 5314 



MGMOGCAAACM^^ 

GCCCCTATACCTACCTOTaMTCCGTTOTCOT 

AAACG(KGAAATC<?KX»ACMTAC<^CGGCMTC^ 

TACTTCGGTTACGAAACCGTCTACMTTTCGMCACTTCGCCtt 

TAATCGACAATTTGAG03GCAAAAIC(Sl(X!TCATC^ 

CGCCATCCCACTTTCCCTCGGQGCAAACAMCCCAA^ 

GTCGTCCCCAGOCAACGCATGAGCAT(XMTTTACCGAG&^ 

TCGT^TTCOTGCCCGAAATCCTCff^^ 

TTTOTMGCGACGCCAAAGAAATCGC03AACACGTTATGOT 

AMTACTOOCATGTGAT<K!ACATOGTTTCCARC^^ 



GATTi 

GTGCAGGAAGGGTTGGATAAA 
SEQ ID 5315 



HISKQEYQAQAAQGYBRIPLVQEIiliADIflTO^ 
YFGYETVYNFlOTAHRUa^^ 
VVPSORKSMEFTONPIALYRAIJlTUffSPYLF^ 
KSHVMHIVSHVEKn*^^ 



ATOCAATK»a^^ 

GTTTAAAOCGTOTGATTGGAW}GTTATTGAAGGAGATGTCOGCTTG^ 

AAAACARCTGGGAAA^^ 
GAAAATTATCGCAACCATATTACAGAACAATTTGCCAM 

CGTTGAAAAAa^ACACOU^^ 

TCAGGTTOTGGGATAGTGCGCCGCCTGAAGACTOTCGGGMTG^ 
CTATCTACCGCCAAATCGGCAAIXOTTTCC^ 

HJYRHHITEQPAKI/m^^ 
SGLHDSAPPEDFSGMPELTWBM^QGPPDDWFreKKTPM^ 

SEQ ID 5319 

ATGAATCCGCTTTTTACCCAAGAACGCAGGAOTTOCATAAGAAATTAT^^ 

TCAATATAGCCAAGGGGATTGCCGACTTACTGCATTCGGAAACGGl^^ 

TCAAAAATTGCAQCATATCAGACCCGGCGACTGGAATOTAAG^ 

GMGAAAACCCC(MTO3GCraX»XTTGGG^ 

AAAATATTCCCACATATGCCAGTTTGCGTGCGGGTAATGGC^^ 

ATTGAATTTGGTTAGAAACAGAAAAGGttGACTGCOGCAaTTGTOTC^ 
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CATTTTGCOTATATGAGCTGGAAC^^ 
ATTTGGCATTT 

SEQ ID 5320 

I^LFTQBRRIFHKKLLDGKI 

EENPELAAAIGSDmTPDIIVTm^ 

HFALYELEQHiQSLNYEDAI^^ 

SEQ ID 5321 

ATGAACACACCOXX^TCCTTC^ 

A(XXXMTG06CCGGCGGCG6AATTTGCCGA0CGC^ 

TGOX3ACGCGATGCGCTCTCTG(^ 

GCGCCGTATCAGGCGGTTTGCAAGGCCGAAGACA^^ 

CttfTGGACGAACTCAAAGCCGCGTTTGOT 

GCAAACCTTCGACCCCGCCGAAAAC^^ 

GACGAATTGGATTATtn'CKaGCGTATTGGCGGTAG^ 

CCTTCGCCGttGACC^my^ 

TGGCGGCGAACCGGATTCGCTGCCGTTG^ 

SEQ ID 5322 

MOTPPILPPAMLGILGGGQIX3RMPAVAAKTMGYKVTVU)ro 
APyQAVCKAEDITBASAQFLPGILKTATLGYM 
DELDYVGVLAVEMPVVGIZniEL^ 
SDTAFQEAKKLHQSL 
SEQ ID 5323 
TTGMGATGTCTTTGCAGTOTXX^ 

SEQ ID 5324 

LKMSLQSCTVAGRNSVSRI^VCRRLQTASVPIiV^ 

SEQ ID 5325 

TCGTGTTTGCTCCAAACAAAACGCGATOT 
ACCGCCAT 

SEQ ID 5326 

SOiLQTKRDSNArGARRCKFF^ 

SEQ ID 5327 

AT6TCGGACGAAAGCCCPA!PTATTmACTG^ 

CGGGTTTCCAGTACGGATTTACPTT^ 

AATCGACTATACGGAAAGTCTGCAG(MTTCGCAGTTCXJ^ 

SEQ ID 5328 

SEQ ID 5329 

ATCCTCCTTTTTCAGATAQ6TACTCAGATAAAT 

AA(XAGTTGGCAATAC(XATAAAAATCAGAAMCCGACCX^ 

C^GGTGGCTGCmGCAGGAATAACmXX^A^ 

TCX^COCACTGCATCGTATCCAGCCTT^ 

GCAAAAAACAA(&TGACGGACAAM 

SEQ ID 5330 

ILLFQIGTQINTIMKEKKHQRSRTEHLIVNP 



SEQ ID 5331 

GTGCAAGACTGCAMGACATCTTCAAAGTCCACC^ 
TGCCGACOTTTCC<&T^^ 
GTTCCACAATCTCGGACTGGGCACTGCCC^^ 
AACCOTACGAATCGCTCG^^ 

SEQ ID 5332 

VQIXmFKVHLHSVQYTC 
NRYESLGSAVLQLDPSIKIKCELMRKHL 
SEQ ID 5333 

ATGCTTGTCCTGTCGGTTOT 
ATCAGTTC(XIATTTGATTTTGATGCTGGGG^ 
TCAAACOXam^^ 
GAAAAATCCCTCTAIIOTG^ 

SEQ ID 5334 

MLVLSVGRWQIFIRIjSALVEIMFDIjKI isfgykaglevfahqfafdfdagvk^ 
EKSLYVAFVFGNPKCFVGIGNGRQVAFCV 

SEQ ID 5335 

ATGAAMCCTACGCACA(XX^CTCTATA^ 

ACATTACCCTGCTCMTCXX^ 
(XACGGCGGCTTCAGCAMCAGGCAnATCX^ 

SEQ ID 5336 

MKTYAQALYTAALLTACSPAADSf^^ 
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SEQID5337 

ATGGTTTTCCTTGTCC^^ 
GCGCATACGOTACGGTCTGAAACXX^AmCG^ 
OTCACACAACGCAACATCGGCGAOTAC^ 
GAAAGCCCaOTOXX^ 
TGCGTOCCGCMCCCCGAAGTATTGG 
CGGCGGCGTATTCGACCGCCTGACCGCACAAAAA 
SEQID5338 



SEQ ID 5339 

ATGACCATTATGCCX^TCTGMCCGCCTTTO 
CGCCAAAATGCCGTCTGAAGCCGTCGGCG^ 

SEQ DD 5340 

OTIMPSEPPFRRHPTHQNRKPHPK^ 

SEQ ID 5341 

ATCTTCCGATACGAACCGCAAAATATTGGTTT^^ 
GA 

SEQID5342 

MFRYEPQNIGFVSKlQKHa^QKGWLRKVMRPFDBARPR 

SEQ ID 5343 

ATGTTTTTTGAAAGTGTGATTGTATATCAAAMG 

OTGCCTTGCftAAGCAGGTTAAT^ 
SEQ ID 5344 

MFFESVmQKMQPSAVIJAGHAGilSHRyARTO 
SEQ ID 5345 ' 

ATGAATACXXSAOTCCAAAAGATCCTCAAA!^ 

MGCAGTCXXSACMGTCTCCCCATTAAAAMCCGGCAGCMTAra 

AGGCAGCACATACAACGACOCACCCGCCAGCTTCTGC^ 

ACCCMGAAGAAGaCAAAAAGTCCTmCGAATTC^ 

AAMCATGACCCGmCTGGCTGATGCAACACCCC^ 
SEQ ID 5346 j j 

TQEE&QKVLNEFIAFTQKRGVCG& 

SEQ ID 5347 

TTGACOTTTCGCAATTTATGGGGTA^ 

SEQ ID 5348 

LSFRNLW(^W^SGITLKNILKIQKPSRIP 

SEQ ID 5349 

TPGCAGGCGTTTGACCAACAGATTTTCATTAA^ 

TCC(X)CTTGACGGAAATTACAGACAGGTTTTO 

CCGCAGCCGCCC(^TATCATATCTCG<X^OM 

CXX^TCAGCAGCCAATCX^TACTACACCCXTO 

TTCGATGCGATTTTTGCCTGCCT^ 
SEQ ID 5350 

LQAFDQQIFINPQHIQSV^^ 

PCQQPIMTPLQIiIOTPQCFRIiRQTAFI^ 
SEQ ID 5351 

GTGTCCGCCGTTACGGTAGCCGATATT(XX^ 

SEQ ID 5352 

VSAVTVADI PESNGKGEYKVFTAYVKGKATAGNLKEK 

SEQ ID 5353 

TACTTGGTTTATATGCTTTTGGGTA^ 
ACCCTCAC 

SEQ ID 5354 

YLVBflilXnXSIiFFTVTTGFVQ 

SEQ ID 5355 

TTGArrAAACTGTOGAerTT^^ 

SEQ ID 5356 

HKLSDFLGCSSASWTIRI^^ 
SEQ ID 5357 

ATGCCATCCGTTGAACTGCT(m^ 

TTTATCAACGATATAAGGGMGATACGGCTACCGGAG^ 

AGT(&TACG(XX3GCGCAAATAC^^ 

AAGTTCGATGTAGGCGGAGAAAAGATATACOT 

AAGGTGCGCCGGAGAAACCGGGGCCGTCTGAAAAGCCGATGC^ 

GTOCCGCAAGGGAAACTGCTTGGACAATGCGGCAATGGAAAGT^^ 

GAATATATCX!GTTACTACAACAACGATAGMTC^ 



WO 02/079243 



-408- 



PCT7IB02/02069 



KFDVGGEKIYIiSPIMJLFNGEWSYH^ 
EYIRYTfflNDRIKLKIiKGIiSPVQYRIQSIKAA 

MCCOAAGGAW^GCCCGTGAAM^ 
CCTAAAAAAGCTCGftlGCCTTQ^GAW^^ 



gen m 53gO 

ceo in 5362 

rCCMGGGGTGCGGCTGCCGOTATATCACGAMTCCAACAGG 
SKGCGCKYITKSHH 

TGCCGTC 
SEQ ID 5366 

mPLYHElQQVElPFMAAFGAFPVVGDVFKLCPtmVFHVa^ "j 
^ATCiAGGGGAGOTffltf^ 

CMWcaaaumoac^^ 

GTGTAGOTATAACCGGCATACACG(XAGCCRJOTOCCTATTGCOCCG1^ 
TATGjTGGCGTTCOTGlftCGAAAACCXftTGGACGGT^^ 

S ^,SafSa^SQITFN^^ 

AC^GAACTOmAC^^ 
TTTCGTGATA 

SEQ ID 5372 

HIOTASIXHHTKERRBMKKHGTHKETBO^^ 

S ^^gIL«XXOTATCT^^ 

(XOT^nWATTtTGCGQCTTCGATTOCTGCT 

S ^m5S^FPRGAAAAISBNPT(mLCP^ 
RIiFItiRLELIiLFLCRGV 

S !SaSgS5LcC^^ 
CAMOTGOGraGTCGGTATAGMGCCGMGGCGGCAftCGGOTAGGCTO 

AMTCGGCGGCGGCTACGTCATGTTCAICACAG^ 

rCTATmmxxacGAAGAa^^ 
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C(XX&TTGGCGTGGGAC^ 

ecmT<xxx&GTCc<^^ 

SEQ ID 5376 

MNAQTLLANVAELVKTKI^ 
UGIEOQ^AECVEGGYm^ 
SLGimVNVVHAEVKSY^^ 
SEQ ID 5377 
TTGTTCCTTMGGTTTTTGGGGGGA^ 
GCGGTTTTtnTTGCTTAGGCGTATGCCT^^ 
(XXXXjTTGTAfiCCCGCTTCGAGGAAGG 
TIXCGCCAAAGCGCCHXXEGCATTG^ 
ATC^CTTCGACTTCGCCGGCG^ 
TGTTTTTGGGTACGGCCra 

SEQ ID 5378 

LFLKVFGGKSGARIilROTSRRLGG^ 
CRQSA(X3IEGLLGQEFFQGCEF^^ 
SEQ ID 5379 
AT6GCACGTTTM0CGTACACA<XXTCGAAACCG(^^ 
CCmOTITOGCGTmACCM 
<m(XKK^CACCAAACTCGCMCCCT6A^^ 
ACCCAAGCCGTMTG(X£AA^^ 
TGTGmCTACGCCAACAACCTCOXrAAAra 

SEQ ID 5380 

MARLTVHTLETAPEAAKPRVEAV^ 
TQAVHAKKGAVSDDEU&FliEAGYN^^ 

SEQ ID 5381 

ATGAACGAGAACTTTACCGAATGGCTGCACGGCTGGGT^^ 

TTGTCC^TTCCGCCTGTTOGGGCGaG^ 

GGGCGTGGGCAATATCGCGGGCGTGGCMTCGCCATCAAM 

GOGCAGCTCTTTAAAGTCQX^CTGCGACAAC^^ 

TCTGTTTCGGTTTTGT^^ 

TATCTTCGGC6GCATCAGG(X^ATATCCAAA6CG6CGGA 

GTGTPCG(HCAGATTmTCGQGCGC^ 

CGGGTATGGGTTCOGCGOCGAACGCCGCCGCCGCCGCC^^ 

CTTCATCATCTTGATTTACC^ 

ATGTTTGOTmCCACCOT 

TCGGCCCGGTTGCCAATGT^^ 

CGATTACACCGCCAAGCTGAAAATGGGCAAAGACCCC^^ 

SEQ ID 5382 

MNENFTEWLHGWVGAIND^ 
AQLFKVRDCDNHHFRGGPAYTTETO^ 
VFGQIFSGAFKFDmGGIdX&LISQTH^^ 
MFAFSTTOGmYAESNV^ 
SEQ ID 5383 
ATGCGTTTTMGGCAGAGTACGCCCGAAGCAAAAAC^ 
CCCAACCGGQCGGC^TO^ 
AGCAGATGGTGAMTCCGTCAAAACCTTG^ 

SEQ ID 5384 

KRPKAEYARSKTOADDKRGNAHIANGAIGKm 

SEQ ID 5385 

ATTCCGTTTTTGGGATAGCTMTGTAAMTA^ 

SEQ ID 5386 

LLVFFIFLSFFOQFGIKPMENLKNYVIiD^^ 

SEQ ID 5387 

TOTTC6GCAC6CTGTGG(KXTOTTTCAA^^ 

ACTCGGCGTTTAACGTATTGGGTTTGGCGGTTT^^ 

CGCCAACC(X!CAAGCCGCATTG(X^ 

SEQ ID 5388 

LFGTLWACFKRMWGLGLGMTGA 

SEQ ID 5389 

TTtmTCACQ(X?ETGGGG^ 
CGCGCGCCCGTGTGCAGACATACGGCTTGMCAT^^ 

SEQ ID 5390 

LFDGI^QG<3fiM)FVWFH^ 

SEQ ID 5391 

«IT«GCGGTGC7iramX5GGA^ 
CCGACGGCAACGAWACGACCGCG^ 
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ccccamcsAA^^ 



VGHLYEAKPALKQFYRH£)PTNSFNPGI6KTSKKK1R?AE 

GGATATTTTGGTGCGGCAAGTTGCACGCTGCM^ 

SEQ ID 5394 

l^SPTWVBIJiIKCEAPITI^^ 

QIAANAKNIKLHYFFNQK 
SEQ ID 5398 

GGAAATGCTCGCC(MCGTCGTCTATGAA 



AAACACCGCC 

SEQ ID 5404 
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GCTTGCCGAAATOTWU^ 

MCGTCCACACCMTATTCCGGTC^^ 

ACGGCATCGGCATC^CCAAGCTOMTTCC^ 

GGACTTACGCMCGCCTACACGCCGTCTTTC 

TGTAMCXXKSTCTGCTCCACCCAC^^ 

GCGTTTCCGTCAAACATTTTGMGAACTGATGGA^ 

CAACTACCTTCCCGATTTCGGC^ 

G6CTrCCCCGC(^GAACTTT^ 

ATTTCATCAACCGCCXACTGCCCA^ 

AGCCGTGTTCTACTTOCCCGGCTGCGGTO 

TGCGGCTATCCGCAAGACGC^^ 

TCGTCAGTTGCGGCACTTGTTACGACCAGCTTGAAAM 

CGTGAAAGGTCAGCAATACCOT^ 

GGCGMTCCGGTATGTTCGCCGTCAAACGCCC^ 

TGACCTCCT(XmX5CCT^^ 

GTTTGTAAAAAAAGCCAACAACGGCGGOGTAGAGAAAGTGTTGCT 
SEQ ID 5406 
UffllAFSNPAETDFSNGIQIKTAKHGHR^^ 
NEIRKRH3DNQQADW^ 
SAVIOTHCLDXHRCrVEYVE 
OTUDSIXjINIVKTERLEIPC^PRK^ 
GKGRPKMVLLADWSDDEAAVE^^ 
TDLPTAELI/jERGKHALAHVAAVKAR^^ 
NVHTNIPVHSDnAEML^^ 
CKPVCSTHVPRANIiLYSPRNKIIX^GI^ 
GFPAQNFAYKIGKLLPIGTKKQKAEPKATVGKAP^ 
CGYPQDAtmAKAEEMSTN^ 
GESGMFAVKRPDIATQVKFRKQEEIEKHLKELPQGEPV^ 

SEQ ID 5407 [ 

TTGAATCAAATAOTACXCGAATCATGATTGA 

CAOTATCGGCGGCTACGACGMTGCGTCAGCGMTTO 

C(mCG(OTGCACC(mX^^ 

ACCGTCAACAAACACAAAGGCGGCGCAA3TCT^ 

TAGGCTATGMGACAAAAGCCCTGOTTCG^ 

GGAATGGATAAGGAAAATCCACGATACCGTCAATATTC^^ 

GGTCGCGGTGCGGTCAT(X^(XXXiATTTGGC^ 

TGTGOTGACAAMGAGGCAAACAACAAATACCCGATCGC^^ 

AGATGCGGATGMGTGCGCCGAATCCTGAACGCATT^^ 
SEQ ID 5408 

UJQiraimXKJTTEPKQKraiPAPMRaVD 

TVNKPG(MLIiKEPELIPHmTLRGRLPAm 

GRtoVIRPDIARQIKQYENGGPVKD^ 
SEQ ID 5409 

ATGTTTTGGTATGTMTCGGCTTTTGCGCtT^ 

TGGG?TTGGGGGCGAAACTGAAAMCAAAMTACACXXjAMGCAGGGGAAAM 

SEQ ID 5410 

MFWYVIGFCAFVVAIiIiSIi^AGA 

SEQ ID 541 H 

TOTTATCTTTATTGCCGTTTTTCX^ 
GCGOmMTGACCCACAGCGA 
ACAAGCXXaUXSAGCAAATAAATOTCTGA 
SEQ ID 5412 
LLSLLPFFPCFRCIFVPQFRPQTQTFFIIMi^ 

SEQ ID 5413 

CCGGAGCCGCTTCCCCG(XMGCGGOTra^^ 

<^3AATGOCX3TCTGAAAM 
SEQ ID 5414 

PEPLPREAAFPmTVKTCM^ 
SEQ ID 5415 

TTGGTTtftAOCTAAGTCGA!^ 
SEQ ID 5416 

LVVPKSIPITPM^I^PAYFmVGTFCPDGIj 

SEQ ID 5417 

TOATGGAGCAAAGGTTCTACGAA^ 
TTOGGTTTGTGTGAAACAA 
SEQ ID 5418 
LMEQRLYEGWKATC(X!LVWSR^^ 
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GCTGCGGC 

SEQ ID 5420 

HSETAIGJCLNDRSENAEKPNBKVELPIVD 

SEQ ID 5422 



SEQ ID 5424 

MFNPNLKREAVCnPSEAQSGI^^ 

SEQ ID 5425 M *^«^xwnrc>^^ 

SAAWLKRNTGIQITPQINAASVGIjSHKP 

==^^ 



SEQ ID 5428 opmMMnr v, m cmn^sropsGRH^^ 

I*SKNBGKPSKTKLSASP^^ 

IPTPIXPIKEPKSI/rFVLKRLIjMiVmPV 




TATTCCAATA 



■SSSS-- -■— "-— "5= SSSS=SS3SS:^S5£ 

^^^^^^^^^ 



GCTTCTGCAAAGAAAGACGATGATGTCGTAGAIGCCGACTT^^ 
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QGEVLSGGRSDVLLIiDVTPIiSI^ 

ITIQGSSGLSEEEIEHK\^ 

ASAKKDDDWDADFEEVKDDKK 

SEQ ID 5433 

A3XJPTT6ATGGAGCAAAGGTIOT 

SEQ ID 5434 



A!TCTTGCGGGAAAG<^CATTTTCAAAAAAOT 
TTCGGGGGGTAATTGQAGAC 
SEQ ID 5436 

MLRBSNIFKKVNLLFYIEILPF^ 
^ATOCTTC^^ 

CAAATOTCGGCGAAGACGMGCCGACATCAAACAGGG^^ 
GCGTACGCGAATACGATATTATCGMGTCCGGTATATT 
SEQ ID 5438 

QIVGEDBADIKQGKIYVGSPIARALIGKEBGDrAEVQAPGGVRBYDIIBVOT 

GCGCGGGCAATCGGAGAGCCGACAYAGATTTTGCCT 
CGMCACGA'ITITOCTTCGGCGTGGATTTCGGtt 
CTATKXKKOTmCGGACAA^ 
ACXMH^GGGGGATTTTTTGCATCGGTOT 

VPGVPGQIAVRSGFGNRFDHFGTRHAFELLQPIiP^ I 
SEQ ID 5441 ' 

GTCTGCCGGMTATGGAGAAAAAGA(X!GATGCAAAAAATC^ 

GCGATTG(EGAWX£AGATC(^ 

CCCACATCATCMTCCGGCCGAAATCCACGCCGM 

AGCC6AWTCAAACAGGGCAAAATCTATCTCX3GCTCT^ 

ATCGAAGTCCG(H , ATATTT6ATT06GOT6ATTTCGAT^ 

SEQ ID 5442 

VCRNMBKKTDftKNPPDRTRCGIAETC 
SRHQT<$NLCItf^ 

SEQ ID 5443 m ' %%hfMfWa ^^ 

TTCTA 

SEQ ID 5444 

LRRQAYRVPSCFVRTOIKIKIQAARnGGIiSVVEQRIiFyQIIi 

MTTTAAAAAAAATCMTTTTCJ6GCA 

SEQ ID 5446 

liGGYSTKITEIGNRAVPPANALCP^ 

SEQ ID 5447 

GTGGAAGGTTTGGGCGAGGC^ 
TGGCATTTTCGGTTTTAAAGTACGTGAATT^^ 

SEQ ID 5448 

VEGI5EAAGRRTDSVMAKRKYKDKNSLHY 

SEQ ID 5449 _ 

A1X£TC(^GCACTCX&TAAAC 

TGCCGGCGCCGATGTC6AATATCTCGGCAACGAACACT 
ACCtmTTTTGATGGCGTTGC^^ 
AGGmmTGAA(^ACCGCGTTTTCAAM 
GATTGCCGCCACGCOTTCCGCGTTACC^^ 
GMGICTCGCQCCCGI^ 
SEQ ID 5450 



TALUJMJLTCQAFEIKMVGBXIS^ 
EVSRPKGRAVQAFDLDAHHIPDA^^ 
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SEQ ID 5451 



ATOC^^mAOCCCAAAAC^^ 



CATCCWnTCCGCC^^ 
TCGAA<»CA^TCAACC^^ 

^^^^ 



S m^L 0 GGHAOrEAAl J ^ 
m^AVEOTTI^SOTTAMI^^ 

TIEGIinrAUUiNoi ^iiuwuiuu^/uxnvy! »y DHEEIDSI^IKTIiKLPDQIDYGRVKGIiSAEVQQKIiNCHKPEfrVGQA 



SRISGVTPAAVAIAMVHLKRGFKDAK 
SEQ ID 5454 



GAGEQHTEQYKHFFHGFSFKGCKQQTASCraaiAD 
GGAA 

SEQ ID 5458 

BSRPPRENEGKMPSEALFSGFEBBFSAIjFECFGQ 
CGCCAAAAACTACOTGGAfiCGCATAOTTTWCftCAGAA 

S 5JSpAAttmi^^ 

JOQFVyKNALNVKLPSTAEIBttAASRKIPDSRIiK^ 

l\.iiva31^ia\.owu«iwv.«v, ~^**^jcrcftCGM3IC^^ 



ATGCCGTKCCGCCACAAAAGACAAAGGCCTGACCGGCAGCTC 
fA 
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SEQDD5462 

HHTDPKIQAMPSETK^ 

LAAERCMWRLLM^ 

SAHNAmiRSGNIGKGI£AL<m©^ 

VmTNS^ATDVMVHSGYVKYRKPLLKAGm 

HNRLQWHDPATWCTYPNEPEAKW?KR^ 

SEQ ID 5463 

ATGASTlTCGATAftrAATPATCATTACAA 

(KflACAATCACCCTATGGAAAAAAACACCC^^ 

ACTGCCC<^TTGCC(X^ 

CGATGCTGGTTCCGATACCGCGTCTGACCAT^^ 

GGGCGGTTATCTGGTCGAATACGOTCGTGGCACTG^ 

GCTCTGGATTTCAAAra 

TGCTGTTT^CGCCOT^^ 

CGCX^TCAGCTC(^TTCCmTCT^ 

(^TTOCCTCTCGCTGGCG^ 
CAACCMCAGOT(X£ATCAG^ 

SEQ ID 5464 

HIFDNNYHyNAMPIRLLVNIQDADSAGIQTASENNNG^ 

GTKKVFFGSIAVFKLGSAI^AASGSLFELTLSEW^ 

ALDFRGYLTFS^AACLLLIAAE^ 

LIAPLMKRFGYRTVI^^ 

IGG ITIASS LVFKRLHVSDGANLTRGTRP 

SEQ ID 5465 

ATGAGC(X3^ACCATTCCCCCAAAAATCATCC 

TAATTCAGCAGCAGGGCATACGCGTCCATTTCC^ 

TTTCCCCGAAAACTQCXXX&TTGTC^^ 

CGTATCATTGAGOTACTCATTTCTTGCTT^ 

TTTCAGGCTGGCAGTTGATTGACGGTATGGAAMCGTO 

(^TGGTMCCGACAAAGTOCOSCGCTGAAAGAAA 

SEQ ID 5466 

MSATIPramDSNFTOW^ 
RIIEFTHFLLAIGYKPEDKGEPVKVAVHTSCAARKHdNVH^ 
RPKHIASFLLERTGGKV 
SEQ ID 5467 /wimmm ^^ 

CGATTGCACCGACCGCCGttCMGGCTATre 

CAOACTTTmCGGCAAAM 

CGCGACACTGATTGGAAATACAGCCGCTTCAATCAGG 

CGCAGGKX3T(MCGACGCATTGG^^ 

CCATGCC(&CAATTACTCmGCG 

ACGGCAGCC<m«?^^ 

CCGATACCTTATCTTACMGTTTGAC^^ 

TTTCCCOTCGCCTCATCACAAC^^ 

CAAAAATACAGCGTTTCCCTCGCGCATTTGGM 

SEQ ID 5468 

MKimTWALMTVFSVAPS^ 
RDTDWKYSRFNQALVLGFVPS 
TAAVSYRNDSQNGERNIliTAMCDFljN^ 
QKYSVSMHbmOTNTERFNSLMIVQNRMSQ 

SEQ ID 5469 

ATGAGCGCGCGCGAAAATATTTTCXSC^^ 

GTCTGAAACATTGGGCTGCCGCCATGCXX<^^ 

GCTGCCCTOtmACCGAAaC^ 

GATGCAGGCTTCAGCGGCGGXAATGCGGCAT^^ 

ATACGTCCAAGATGTACAACGAGTTTCACAACGCCG^ 

GAC(£TCGCTTACGGTGCGCACG(nXXX^^ 

SEQ ID 5470 

MSARENILAKLKKAGALPMEEPAVTO^ 
EAGFSGAQCGMTGTLffiiFSSPEEPRTLSLVPPVHFCIf 

SEQ ID 5471 

ATGACTACGCAAACCATCAAGTTCCACATGAAGCCGGAAACTO 
0CAAG0QCAAA!(XX3GTOTGACCGACGA 
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AGCGMGAMTOaGCTGAAC^ 



AGCGAAGAAATCGAGCTGi 
^ACTMCGACTO 

CGTGATACOGGCTCAOTTOTGCGGCCGATGTCGC^ 
RDTGSFFAADVAVFNPWHD 

CTGCGGCTCQCGCACGGAATT 
LHLAHGI 

GCTCGCGCACXSGMTTTGATTGCGTTGTGQjTGGCCTCGGAT^ 
SEQ ID 5478 



ARARNLIALCVASDKFLITTEAIAFPS 
TACTGATAACGAGGTTTCTTTG 



CAGGCATCCGCftGQCGT 

SEQ ID 5482 

imSDFl^SRATVPYI^GR<nm0VKKRUlPPEFSGIRRR 

VTESEAMiHEYIRYYHHISIIKIXLKGIjSPVQVRIQSLKAA 
CAGAAACCTGCTCGOKAGCAGGCCTO5TTI 
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SEQID5487 

ATGCCATCCGTTGAAC^ 

TTTATCAACGATATMGGGM^ 

AGTGATACGGCGGCGO^TACCGCTCGTTCAAAGG^ 

SEQ ID 5488 

MPSVELLLGIVGLH^^ 

SEQ ID 5489 

ATGGAMCAGCGTTGCA&MGTTCG^^ 
TTGTGATTGACCGGTATTCOTXGTGACGGATG^ 
CTGtmTTGGACGGCmTTGGTWAGA^ 
CTTCTTOAAGGCMCGAGCT 
SEQ ID 5490 
METALQBVRHJFAI^^ 
LLSRHEAFLGQPIiPL 

SEQ ID 5491 

ATGTCCCAAAAAGCTGCCGCAGACCMCTGAATCTG^ 
AGCCCGTGAAAAAACAGTATCCGCCtXAMCGAAAAAAGC^ 
TCCCTTGAAAGAAGTGCGGCAGAMGAACGCAA^ 
SEQ ID 5492 

HSQKAAADQIMJECSVLPQW^ 

SEQ ID 5493 

ATGCCGCTTTCACOGTGT^^ 
ATGCTACGGMCCGGTKX^^ 

SEQ ID 5494 

MPLSPCNLYLATHLRTNESEMERYS 

SEQ ID 5495 

ATGGCiWK^ATOCAAAATGTCTGCAT^ 
CCCGCAC(X»TTTTG^^ 
AGC(£GTTTGGTOX£ATTGCC^^ 
ACACGTITreAACCGCTTATTCGGGATTT^^ 

SEQ ID 5496 |! 

MAAITKCLHQVHETQ^SVKQIJARro 

TRFBPIiIRDFHQRUjYSELTKTGTALPRK^KGTIP 1 1 

SEQ ID 5497 

TTGTTTTTAATOACTATAT^ 
ACGGCATT 

SEQ ID 5498 

LFIJmLTASKGKMVSMIJ^ 

SEQ ID 5499 

TTGAACATCAAAIKXEGTC 
ATACTGGATTAAAAACAAAA 

SEQ ID 5500 

IMKCRLKPEI(^AFCI^FQKEAASTYSHYFCPUlQSEY^Lm 

SEQ ID 5501 

ATGAGCCTATTGATTMTAMGACACCAAAGTA^ 

CAGGCAAAGGCGGTCAAACCCACCTAGACCTGCCT^ 

TATCGTTGAAGCOTTGLAT^^ 

GTCX3GCCCTAACTGCCCGGCKX3T^ 

AAGCCGTGGCACAAACCACCAACCTGGGCTTGGGTCAGT^ 

TACCGATGCCATCATCATGAW^^ 

AAAGGCAAACC&TGGGCCAOG^^ 

QCACGACGATCCTGGAAGTGTTGAAAGCAAAAGGTT^^ 

SEQ ID 5502 

MSVLIKKDTKVIiVQ^ 
. VGPNCPGVITPGBCHGIH^^ 
KGKRMt^GAIISGGKGTAEEKFAAFE^ 

SEQ ID 5503 

ATGGGATTGGCOGCAGGGCGATTCOTAAAACOCACTO 

TTTTGGCGCACMC6GC6MGAA6CCGCTGCA6CTTA 

TAAAAGCCGCGAAAAAGCCAAAGMGTGGCTGAAAGCC^ 

CCTCTTCAAAC^GCT^ 

AAATCTTCAAACTAACOSTTGATCCG^ 

CGCCTACAAAGOGTTT(^ 

TACCGCCT(X£AAAAATC(X^ 

AOGGTGCXXjGTTOGCGATGGCCACCATGG^ 

GATTTTCGAAGACftAATCCGTTW 

TAAACGCC 
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CTGTAOGGTGKKGGTOjQGWTGCGGMGAACGGOTG^ 

SEQ ID 5506 

HVIIFFHRKVGHCQTFFQOTG^ 



TOCGCCCGTCGTPi 



^TCGTaTCaCACGGGC^CTGGCTTCGTTG 
CGXCCAACTGGAATGCTICGACATCOTCGAAGTTGCCCGCACC^ 



WOTPRGVITFIiDrPGHEAFTAMRJIRGAKATDIVI 
V^IRQTCSVSKVGNIAO^ 

ess 



Sotcgtccgtgm^ 



(aCCCGACCGCCATCGC(SXOTTO}CXj(MCGCGGCAAC „.^,^ 1 ^»,-™rv-»rw^xi-iryv^'nrit('!il 

ESS 



GCCGMATAGMTGCCTffi^ 

UUAjAiftiiujKi i^v.i\iftiwni . — » ^ATGCGGTCAGCOB&TTG^^ 
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TGAGTTTATTICrTTCTT^ 

GACAGCGAGTTCGCCCT(aTTGGCGACMCGAA 

CaTCCTGCGCGAACAGGGCGTAMCGG 

(XikCTTCAAMTGCTGG(X&OT 

TTI^ATCCGMCACGCTGAO^^ 

AGTTTGAAGCGCGCCTGMTATGGTTCAC^^ 

GOTCACGGCGGC^TATTTCCGCCGATT^^ 




mi 

TTGCC«XX^CGTAAAGCOTG(K3C 
SEQ ID 5510 



LIAAQSVEQTPEAREGLHLFLVTPRLGT^ 
KaiAI^EimVENTC^ 
HPTAIAPFAGAATGAGGEIRDBGAOTGSRPI^TC 
QOTHKDEIPEGALIJQLGGPGMLIGLGGGAAS^ 

HRDQMVGKYQTPV^^ 
KSBKKSWSPL^^ 

DSEFALIGDNERSALFADVKFDVNEDIAAP 

FADPmTLC^CX^VSNI^ 

ITNADGRVTIMMPHPERVYRAAQMSRKPEGWra 

SEQ ID 551 H 

TACAGGCAGAACCCTATCTTTGCACCTGCA^ 

SEQ ID 5512 

YRQNPIFAPAKPTNFVLINANMHRCP*KKMKICH 

TCGCCGCATTGTGGCAGATGTATGTGATGATGAC^^ 
TCTGCTCTGTCCC^TTCGOT 

ATQCCOTGCATTGGGCGGTAOT^ 

(XJGCGACGTAAMSTCGACGCGGCCAAAAT^ 

GOTTCCTTTGCCGCTAAAAATGGCGATGCT^ 

OGCAAGTCGCTATCGACAACGTGAACGTATTGGACAAC^^ 

ATCGAACCOX^TGCGGAAOT 

ATCGAATTGGGrcTCAmTCGGCGACATC^ 

GCCGTATTCCAMCACCAAAGGCCTGAACGCTC 

CGACGKXtfTCGC^ 

ACCGACCCTGAAATCGCTi^GTGGGTAAAACCGAAG 

CCAAAGCTACGGTTAJAGTGTTCG^ 

(^O^GCGAAGACATCGCC^ 

ASFAGKNGnAYQIEVOTGBKTOEAiaiVIVATGSV^ 
IELGVKIGDIKSEGKGVSVAYETMGEAKTEVFDKLIVAIGRIPNT^ 
TDPmWVGKTEEQmEGVEmGTSGFGANGRAL^ 



TTGAATGACTGCGTTTTGATGGTm 
ACTACATOCOTACATCTAAATAAAAATTCAGA^ 
SEQ ID 5518 

SEQ ID 5519 

ATGATTATTGATGTAAAAGTACCTATGCTGT^ 
CGGACAAAGTCGTmGGAM 

^CGTGAAOT 

(yjXXTGAAGMCGCGTAXAA^ 

GGACTTGCGTCO^GTACAMGAAA^ 

TCTGTOACGGCAAAGACATO^^ 

«KXaAACAAG(^ATTGTTGATTACGCAAAAAAA 

CCCGATCATCAAOXXK^^ 

CACOGTATCATTGACGGCCGCGAASCTGTAT^ 

0VNVIfl39GWX&VLKEOT 
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SVDOTIVYHGYPDIGIMGSPRGLWPIIiRDM^ 
HRI IIXaiEAVLTLVAIKDALEDPVRIiLIJ)I> 

aTTOGTCTCTCCftftlGGOXriTOa^ 
HFVSPHMiSDGIiLCVFHSATOHSA 



eeMQCQCCAAACAWU^^ 



GCGTTA 



YI^GHNEGIHJPTLTQPbMYKKVSQHPGAKAIjI i EybiAiajv V V ^l^qegIjGEFLVIDSIIjKEEAVHAFTO 



GAQVTIDQFI£SGETKWGRUX3LTTIL: 
NDSVKRVVLCAGQVYYDLEAGRABEK1 

All 

wactaatoxxjAAcaocgcaaaatcgcgatttaocgcot^ 

\inft^vjvjuuu.iui./iiuv^* -~^*«^'^ w ATTATGTGC0GGCAGGCGftGA(5S 
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SEGID5526 

VGYNLQRSDNMSKSIKLNVPG^ 
TMVHEQLTV^RGFRRDAHPMAMMVGVVGALSAFYQ 
NASTSTVRIAGSSGANPFACIAAGIAC^ 
ERKLYPNVDFYSGIVLSALGIPTHl^^ 

SEQ ID 5527 

TTGCAAGTTATMCCCACGTTACTGCATCC^^ 
AGCATCGGGATMGGTGCOTTTGIXXGT^^ 
CGAATTCTTTTTCCATA^^ 

SEQ ID 5528 

LQVITHraSAARSIIiSIEm 

SEQ ID 5529 

TTGCAGGCAA!TC£CCCIOT 
GCGGCACGCTTGTCGCGGACAAAGTCAAAGACCT^^ 
TACCGCEC&GGCCTACGCGCGTTACM^ 
GMGO^GTCCMGCCGACTACAMMCT 
GCATAGACAMCGCGAAGCCGCCAAAGGCTTTACCGTTGC^ 
C^TACATCAttXXXEAC^^ 
SEQ ID 5530 

LQAIPLLKTSfflNHRLTADSRNDYYNGYM 
EAAVQADYKNYFAKSYNIGAAT^^ 

SEQ ID 5531 

ATGCTTTCAGCGCTTATCTCTTCCGAACTrA 

TTCCAACGCCCACTGCAGATAGGGACCAAACT^^ 

GAGCCGACATCGAGGTGCCAAACTCCGC^^ 

CACTATGTCCTGCTTTCGCACCTGC^^ • 
6CTTTGG6AGGAGA(XX3CCCCAfiTCAAACTGOT 

SEQ ID 5532 

MLSALISSELSYPAMQIJttiQ^ 

HYVLLSHU?DLSVSQLSYLI^ [I 

SEQ ID 5533 jj 

ATGATGGTTTTTGACGACATTGCCAMCGGAAMTO 

AAGAGCTGTOCGAGTTTKXXSAAATCCTT^ 

GGCA 

SEQ ID 5534 KUALIHGHSETDK^IRniEKIRRA 

SEQ ID 5535 

GT6TTTAGGACATACGTCTftC(XIAGT^ 
TAG(KX5TTCAGCAAGCCGGAC^ 

SEQ ID 5536 

mTmAVHNGVAAEQTVRIVQIIQTU^ISAVRm 

SEQ ID 5537 

TT^TGAGAAAAGGAACACTT^ 

PGAAACTTTTGGACGCTTTO 

GGCGTGTCTGACCGATTIACGCAGCTO^ 

GTCAAACCTTATGTTGTTAACGACAATCCGATTGATC 

CCGCCTGTCCGTCATTCTGC^^ 

GMtf^TCCCTACOTTTGTTC 

GOT 

SEQ ID 5538 

U4RKGTLLMEKMSFEIYRYNP^^ 

VKPYVVNDNPIDADKERIOTEERKEU^ 

V 

SEQ ID 5539 

TTGMTTGTGCCGTTTTGTCT 

ATOXHEAAAGGT^^ 

CGGCAAAATTTATCAGCXaTCCTO^^ 

CAAAACGTCCGTGOCAATACACAATTCTTT^ 

TTTTCCACGCCMGGCCGTGATGTTT^ 

GTTGGMGATATGGAATTCTGGCMTTCCACC^^ 

AIGGAACOTATC^^ 

AAATCGACCATATC6G!I!OCA6AAAAAATTATGGAAA 
CTATATGATGGGCGGQTTCCGACCMTTATCAC^^ 

ATGCO(XjTGAGTTGAO0^ 
TACCGATGAGATTCTGAGCAAAGGCOTCAAG 
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HBSHHLGTBSIiIDLVVFQKAAGDSfflKFIKIKiSDWKP^ 
imiEVAKATWSAEARKESRGAHA^^ 

CEAAAGAATATCCGGCATGGtSGGCATTTITrAGTCAAGCTTGCOT 



CTATAT 

S k^SLdwvm^tavmjota^ 

gctaaagccxhotttgcttctgcatt^^ 
akavfasalvmwlgalu! 

TTCCACGCGTGTTTOICAAM 

SEQ ID 5546 

MPPEIFPBEHFlF(a)GVQGiB!UICBBGSGKOT£Fffi^ 

S !SSISLlCAATTAACC^^ 
ACCCTCCGGGGOTATOGAGCMGTCCTTAACOTAGAGGGCATACC^^ 

CTGCCCACCTOTGTCGG'rc'TGCGGTAC^ 
AGAAAACCCGGAITTCCC 

RKPGFA 

SEQ ID 5550 

KKraFflSPA«2lPIEPVSV?PljHF!l*IJllFH 

S 5E2ScU^^ 

CAACGGAGAAGAGGTTATGCGGCGOTCAAATC^^ 

CAKAOaiACCGTCCAAAl^ 

TOiAOGACGACGTraTCTCGTGGGATAOGCKXXSAAQ 

CGOTTGGGCGGAATCATCGQCGGCATTTCCGAOCCCGTTGW^ 

GACGGCAATCTAGAAOTCCATTGGGAGCAGGATCGCGAAAAAGGCGTATW^^ 

mTC aCAAAGAC«X*GCG^^ 

TACGCGCAGGTTGCC(XX3CMTACCAAGCCGACGAATTTGTCAT^ 

AAAMGC(^CCGGCCGCTraATCGGC(XX3GAMTCCTW 

GITCTACCATCCTGTTAT0GAGGAAGGTCTGCGTACCGCGTTGOGOGATQ(^^ 

S ^aADrmam<aGAP^^ 



IXan/EVHV^DGEKCWVAEYKtiAAVG 
ttCVMQIQADBTOGEVSFKlOGRS^^ 

S ATCOTnGMTCGGMGATATTTAMTGCTGTCTGAAACCGATA 

SEQ ID 5554 

KFLIGKIFKCCLKPICSVSAMFQTKTEGQRIi 

^raamCACTTOSAAAG^^ 
mCGAATOGTO(MC«SnTCAAAGAAAA^ 

MTCCTGACGGCAACGGCGAAl^ 
•AAATGTTCATCKAACCTGAAGAACCGGGCGATOTTOA^ 
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AGGTT(mATTCTGTGCCAAAGTCAAACM 
ACTGCXECTCMGTCTT^ 

SEQID5556 

LHGVECAAIHNYFSNQSIKELKMA^ 

IPDGKGEFTEGimVGKEDLGFGKRSWRYSMLVNI^^ 

TAPQVFIGGKYIG6SH)LEAYLAKN 

SEQID5557 

TOXXXXJAAAATCTOTC^^ 
TATACAAATCATTTTGCATGGGG 

SEQID5558 

LPENLFIMCFSIiIRLYMWGTGKS^ 

SEQ ID 5559 

ATGAACAAATGCCGTCTGAAAGCCTTTCAGAC^^ 
TGGGGAACAGGCAMTCGGAGTTGTATTTGATAGTTTTAAA 
SEQ ID 5560 

MNKCRLKAFQTAFCPELRRRGAVAGK^ 

SEQ ID 5561 

TTGAA(XGGTCOTACAAAATCCMCTGGAGAGTTCAA(^ 

MGAGATCAGTGMCMGAGCTGCTCGA(X5TTGCCAM 

CATCCTCGACCTGCAAGTTGCCACCGGC^ 

GmTGACCAAATGGTTC(^CACCAACTACC^ 

GCTTGAAAGCCAAACCGACCGTTGTCGGT^^ 

AATCCTGACTGCTITOTTGAAGCCffi 

AAAGTMGCGCCAAAATCOTTTGGGCACTTAOT 

TGGACOX3TTC(KXXOTA 

CGAGCGTTTGTGGATTTCCAGCTCT^^ 

AAAACCCMGAATTGCGTGTTCTGAAAGCCGCATT^^ 

GTGCAGAC^TTGOJUiACGC^^ 

GAAGAGCAAGAAAAACTGGAOTGGACGTACTGGTGCAC 

AAAGCTACGGCTCACGCTGCGTTAAACCACCTAra 

TATGTTGACCGGTOTGTAACCA^^ 

GCCGGCATTAAAGTCATCCAAATTGAC^^ 

GTTGCGAAGACAGCACTCAAATCCACACTCATATGTGCT 

GGAACTCTTGGCTG(X3TTCGGCGAATTCA^ 

ATTGAGGTTGTTCCGGT1X&GCGTCTGTGGGTO 

CCGAATTGGCGAAA 

SEQ ID 5562 

LNRSLQNPTGEFNMTTLHFSGFP 

EMTKWFOTWYHYLVPEFHAiyPEFKANAKHW^ 

KVSAKILUSTyFGSVAmAIiIiKSLPVIW 

KTQELEVLKAAI^Hm 

EEQEKLDLDVLVHGEAERNIB^^ 

AGimQIDEPMREGLPLKRADmYimGESP^ 

IEVVPVERLWVNPO^LmGWK^ 

SEQ ID 5563 

ATGAATTATGCAAAAGAAATCAATGtXHTAAAC^^ 

ATTOATCCACCGTCTGCAAACCCTGCAT^ 

GGAAGCAGCTTCGCACCTGACCGGCATCGATGC^ 

GGTOATGAGAAAAAACCGTTTTAOGCOGAAGAOTGOT 

CC^CCTGAWAATCTGAAGOSCAAMTCG^ 

ftJTCGAAATCGTTCTCGGCATCC^^ 

C^CAA(XXmCGCAACCTCGT^^ 

CCATCTGCCATATTOAGGCGCGCGCCCT 

SEQ ID 5564 

MNYAKEINALNNSLSDL^ 
GTCEKKPFYAEDLVKIiIiR^ 
a^TmVMSIAIEWKVLSREGVKDFHm 

SEQ ID 5565 

ATGAAftKGAATATTCATCCPGACAATTA^ 
AOGGCAAAGMTATCTGCTCTTTTCCTT^ 
AITOGTCATGTCATCGTTTAGAAAGGACAAA 
SEQ ID 5566 

SEQ ID 5567 

ATGGATATGAAMGACGCGATTTCTTAAAMTGACCGCCGCGC^^ 

aaGCGTTAC(»!ITO^ 

AGTGATGATGAGTCCGAGC^TGTCGGCGTAAACCTGCGC^ 



-424- 



TCGOmGGATITOmXSGCAAAA^ 

ACGCTGCCACACCGCO^^ 

CGTGCACGGTTTTGA?PTGGTGAAAG^ 

ATCTTCCATCAAGATTO^^ 

GCGCACGGCTGACGGTAAGCAAAGGa 

CTTGGCP 

SEQ ID 5568 

MEMKRRDFLKMTAALAAAGVSP^ 

SPQDLVGKKILWFKKDMPDIVLQAIiIiKKIiKID 

iraQDLKNALNWILANRQNAAH 

SEQ ID 5569 

TTGCCTAAGTCTTCCACCTACCCGCT 

CTACGCATTTCTGCCTOX^ 

TC«^CTTCTGATACCTCCA(^ 

TCTTCCGCGCAGGACGACTCGftCCAGTGAGCT 

SEQ ID 5570 

LPKSSTYMiKQMPTACQPNIi^^ 
SSAQDDSTSELUlFIi 

SEQ ID 5571 

AAAATGACTGAAACTCAAAAAAACCG(&TTCCOT 

CGGGAATGACGATATTTCAGTTTTCTGTT^^ 

SEQ ID 5572 

KMTETQK^PAKTGIRSLKVOTVre 

SEQ ID 5573 

GC 

SEQ ID 5574 

MQVLSSLKTAKQRHRIX^IVRRIOVmCKSmFKSRQR 

SEQ ID 5575 

TTGAAAAGGGOTGCGGTAGGAGGGGCTTTACMC^^ 
TGACGATTTGGCAOTCGCGGTGGCGTTGTTTGGCGG^ 

SEQ ID 5576 

LKRACGRRGFTTEPGRA^ 

SEQ ID 5577 

ATGATAGGCAGACTTTTCCGTATTTTTTTO^ 
TACTGCTCGGTGCAGC(XKXK5CGACGCTG^ 

SEQ ID 5578 

MIGRLmFFFFALAAIJINRLFSRRQKRVLREVAEI^^ 

SEQ ID 5579 

CTGTTTGAAATACAGCATAAACAGA^ 

C66TTGATMTCAGCGCGGCAA6TGCGAAAMGAAAAAAATACKWAAM 

TC^GGCTGCTTCT^ 

AAAAATTCCGTT 

SEQ ID 5580 

VFEIQHKQIPKQRRGGCTEQYPC^ 

SEQ ID 5581 

ACAAT(£GGTTTTAATCTGCCA£X£^ 
TACGGMCGTTTTTTCGTCGTCGGGTTGAACMIt^^ 
CTGAAGGACTGCXSTAAAATCACA^^ 
CCGAACGTGTGCGCCACGCCTTGGTGATTGCGT^^ 

mrcGCAaxmACACCOTra^ 

GGCTATCMTACGGCTATAGCGTTCMTTTGAAGTAGMTC 
ATTTCGATCCGGCGG(X3GATTTCGGCAACA^ 
ACTGGTCAAGGCACTGATTGCOGOCCAAAAAl^^ 
ATGAACAATGTTGACAA6GATT6CCGATTA 

SEQ ID 5582 

VGIVTPQKIPFEMPLVIjENGKT^^ 
VKDWVKSQAAIJK^ 



MNNVDKDCRL 

SEQ ID 5583 

TlGGOGGA&CACftAARAMGCftX^ 
TCAACGATCAAAGTTTTGATGTGATTGT^^ 
mCGOTACTGGCGC^UmmCC^ 
GftTTTGTOKSCGC^^ 
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SEQ ID 5584 

LAEHKKCSGYGIEIDTDNVIAAM^ 
DLLCAKNNIRVIjERAVMTGNRQ^^ 

SEQ ID 5585 

ATGWAAACAAAACGTCATCACTTCCOT 

<MCGGC1TCTT(^^ 
SEQ ID 5586 

MKNKTSSLPLWIAAIMIiAARSPSKEDKTOTGA 
SEQ ID 5587 

TTTCCCCGACACCCTCGCCGCCAGTTO 

AATCGGGATTGACCGCCTGACGGAOT 
MGCGTCMCAGCGTCAGOGAAGGCGGACAGTCGATCJUK^ 
TCCGCOEGGCAAGCGCGCra^ 
CCGCCTOntt&TGCCGCTK 
CACACCGCCCTTCTGATTQG^ 
SEQ ID 5588 

PPRHPRRQFWQGKIIUiGTUMWSHHRffraW 
KRQQRQRRRTVIXJDKVVIFRHCliERWQTHFPGD 
HTALLIGDGKDVRTH 

SEQ ID 5589 

TTGTATCCGACA£KXXCGTOAG 
TTTTCA6ACGGCCTCTO00OT 
SEQ ID 5590 

LYPTAPFSRNVGFKNPTHAADNALIGRLKNDKSGIHNHPTLFR^ 

SEQ ID 5591 , 

TTGTATCGACTTAATCCTGAAACA^ 
SEQ ID 5592 

LYRIJiPETQKAGLRHNKAVSFIKVGISSIiI/r 

SEQ ID 5593 

TTGCXXXMTCATGCTCGCCGCGCGTO 
CGCATCCQCCCCTGATiAOGTCAAGCAGGCAGAAAGC^^ 
GACTGTCTGTTTGCCCTTTCXXM 
ATCAGAAAATCATGCTOXWCGAAGA^ 
SEQ ID 5594 
LPQSCWPKVPRAKHKRKKTAHPPIjItfiPRHRIiL 
IRKSCWQSKTCVTERAPSAGRHKMP 

SEQ ID 5595 

AT(mCAGGAACGCCGC^^ 

CXTPATTTGAACC^CMCAACGGM 

CCGTGMCGCAATCCCGACAAACCGTTTTTAGACATC^ 

GCGACGAAGCCGAAGACAGTACAGATAGTACGGCAAGG03AGGCAA 
SEQ ID 5596 

MEQERRLf^TLMLW^ 

ATKPKTVQIWQGEATLYWFRFNPLYFNPNTGRI 
SEQ ID 5597 

ATGAAAACACXACAAGAACTGCGCGCCGGCM 

TGAAAMCCT(X!TGACCGGOGCTGCTTCCGAM 

CTTTATGGACGMGAATTCAACCAATACX3AAATCGM 

ATTOPGTCGAACTGCCCACCAm 

AAGTGATGTCTTACATCGAAAACGGCGACA^ 

SEQ ID 5598 

MKTAQELRAGNVFWGMDPMVVQKTEyiKGGRSSAKVSMKLKNl^ 
ISVEIiPTIIVREVEYTEPAVKGDTSGKVMKT 
SEQ ID 5599 
TTGGATTTCGGT(KXXXmCCAGA(X5Ca 

AGACGTACATCGGGTCGGCAAMTAGCTGTAOT 

CA6TTTCATGCTSACTTT66CGGAA6A(X}GG^ 

ATTGTGATTTTCCMTCi^ATC^ 

SEQ ID 5600 

LDFGAADQTRGFWAGSIAFTCRimDFAHDIXK^ 
QFHADFGGRAAAFDVFGF^^ 

SEQ ID 5601 

ATGAATACATACGCTTTTCCTOT 
ATTTGTGGACGC&CGACGTGTCCGCOT 
C6C6C(XXa i lt^GATG(XGTTATCGA^ 
AGCMTGAAAGGCTGCACCTGATGOTTCGCC^ 
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Cl TO OQQC TA y Ot^ 

GGCAGGCAGCCTGA3^COTACTGC^^ 

GTCCGCCTTGTCAAAATCXXXSTTO^ 

CCTTTTTTTGGaCATCTACCCG^ 

GGACGACCTCAACXjGCGGAGAGGOT^ 

(XmGCCTTCCGCATCCGAAAAAC^ 

SEQ ID 5602 

MNTCAFPVCfflFCKVIDNFGDI 
SNERLHUlPSPQEGVQKYFHPKGFSEKSGGIiI 
VRLVHPFVPQQDFDKLLHLAD^ 
GQPSASEKLAAFVSKHQKIR 

SEQ ID 5603 

ATGGCGTTTTGCTTTTTTGAA^ 
AACCCCAGCATCTGATCAACGG^^ 
CATGTTTCTGTTTATTCCGATTGTO 
SEQ ID 5604 

SEQ ID 5605 

GTGGAACmTACAC^CCAGCAAAAC^TCGGMTAA^ 

AACGTCGCTTCGCAGGCAAGGATGATGCOT^ 

CMGCAGGTITO^GACATOKIAAGGCTITC 

GCCGCACCACCGAACCTT 

SEQ ID 5606 

VEQIHNQQNNmQKHGLTFQOT^ 
AAPPNL 

SEQ ID 5607 

ATGMTGAAGATAAATATAGGGAAAATATGACATGGGATGAAATT^ 
AGAATAGGTTGGCT 

SEQ ID 5608 

MNEDKYRENMTWDEIENVTO 

SEQ ID 5609 

TTOTTCTTAATCCTCmTTO 
AAAACAAACTCATTGTCTTTGTTTGT 

SEQ ID 5610 

LCIiNPAFCVSGLSRYNHHOT^ 

SEQ ID 5611 

ACGCTTGCT(AGGATGACGATATGG(X3Tra 

TTCATCGCCGCCCAGTAGCOjCCGTC^^ 

CGOXXXXEACACCACGCGC^^ 

CGCAGGGCGGTCAGATTGTAGGTTTTGWC^ 
GTTCTTTACGTTCCAT 

SEQ ID 5612 

TTjAQDDDMAFRIQARHVEPLDFFDAI^ 
RmGIAAARTDIQHRLCAFD^^ 
SEQ ID 5613 

ATGCGTCGGGOTTCAATA!^^ 

SEQ ID 5614 

MRRAFNMPAPDAALSAGIGGSVRFLG 

SEQ ID 5615 

ATGTTTCAGGTTTGGAAAMTGCGGATTATAGCGG 
AAACCGATGCGTCGGGCmmTATGCCXX^^ 

SEQ ID 5616 

MFQVWKNADYSGKRAALIFCFAEARRR^ 

SEQ ID 5617 

GTGCG(X3ACAAAG«:GCGTTCGGAG^ 
(m(KXXX3GTTTTGCO^ 

SEQ ID 5618 

VRDKGAFGGGRFRRADVHTGIEQ<^EAD^ 

SEQ ID 5619 

ATGATGTATTT0(X3CGCCCTGAC(X^ 
TGCAAAAAGTCGACCCTGAAACAGOTGCC^^ 

GAAGTAGAAAACCTGCGCCXXXGCTATCCCGGCCTGAOC^^ 

TCGAAAAAGGCATTGOCGCCAOGCGCCAACTTO 

TGAGGCGGCAAAACGGTTTTTCGGCGAA 
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SEQ ID 5620 

MMYFRALTQGLNDI^EADACIiEADLDEQRQ 
EVENLRRRYPGLTADSPAI^ 

SEQ ID 5621 

ATGCTCGCCACGCCGCCGCTCG^ 
CXmTimCCCTCACGAAM(X^ 

GcaxncGCCGccrrrccGCca^ 

GGCAAMTCATCACATCGGC^CACAACACC^^ 
(mCATCTJUaTCACCCTC^ 

CAAOTGTTTCC^ . 

SEQ ID 5622 

MliATPPLAPKTVAALHRLGIR^ 
OTITSAHNTCIAKWSRH^ 

SEQ ID 5623 

A5t»CCGCCCTACTC<n^ 

CCG(XX5GCTATCMATCTACGGCGCAGGCATCTO 

ACTCGCCGTTCGAGGTGCATTGGCCGGCAGCAT^ 

TT 

SEQ ID 5624 

MTAUtVILAIiALIAVCTA^ 

I^AGKAGUSTLLGLVVGTAFKIGCAVSILFILLVKYIAOT 
SEQID5625 
GTGTCCCGTCTGATGTTGAAAAAAAGAAAAACAGTGOGG^ 
TCCGCATCCAAGCCCGGCACGTCGAGCCTTTGC^ 
TCAACAGGTGCGCCGTCCACGACACAAAATCC^^ 
CAACCCCMCTCGATAGCCGCCTGCCGCAAAAA^ 
GAACCGACATCCAGCATCGTCTO 
TCCACTTTTTCAAC^TCa 
TCCATATTCAGGCCTAATOTAGO^AAAM 

G(M0GTCAG6CGTTT0GGCA60GTCAQCA , 
SEQ ID 5626 | j 

VSRimO(RKTVRNF17IRTKKVFM^ 

QPQLDSRLPQKSRFLRIRIQQGYU^ 

SIFRWWSGKTCRLKPRIGFQTAFPFRPIFSCG^ 
SEQ ID 5627 

TTGGTAGAGCACCTGCTTTGCAAGCAGG 

GMTAACGACGCATCGATCTTTMCAAATRK3AMGOT 

SEQ ID 5628 

LVEHLLCKQOTGSIPFASTKT^^ 

SEQ ID 5629 

TTGACAAACACCAfTATGGCAAACCAAAGA(X)CATCT 
CAGX2ACGCGCGAAGTGCTGGAAAAGGCGGCAAACGAAM 
CGATGOTCCAAAAACCACGTCCACCT^^ 
(XXXaCGCAATOTtXCTACACGCC^^ 
AOTCGCCCGCACCCTGCGCGAA 1 ^^ 
CAACGMGGCGACGGTATGCGCCGCCTCACGCGCGW^ 
GAAAC0CGCCGGCM0GOGTATTGAAAAACAAAAAAG0G 
SEQ ID 5630 
LTOTIMA^RPIYGFHAVNARLWQNP^ 
AIMGVHAVIAPKDKSAGLNAWSKVACGAAETP^ 
ETRRQRVLKNKKA 

SEQ ID 5631 

GTGGCAAMCCCCAAATCGAraCCGAACOT 

GACCGCCTCAACGCCATCAGCAAAGGCGCGCTC 

TGCT(&TACTC<ftC(^K^ 

CGTCA(X^AAAGTC(X)CTGCGGCGCGK 

GGCGACGCCGACCTTTACXIATTGCAATC^^ 

TGTTCGGCACGGTCGAAAGCATGAACGTC^ 

SEQ ID 5632 

VAKPQIDYRTLYPRRQIRRTHARSAGKGGKRKHPRIFC^ 
RQQSRLRRGGOTLHHRNQPRPHPARIERIR^^ 

SEQ ID 5633 

ATGAAAGAGGGG3GTTK&GTGAGGGGG(^ 
0<Xm0KnTAG<XK3CGra 

TTGGGCAAGGTGGATTTTTCCGCACTGC^^ 
TOGGCGTGTTT(^GGOGGTCGGCGATTTGACGGCA^ 
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GGTGATTGCGACGGCTTTGGGT^^ 
CTGTTQGGrcTGTTC(XXX5TT6^ 

CGG(XH3CGGCATTAGGGCA(^ 
SEQ ID 5634 
HKEGGI5BGAMVSTLLGVSPA6AFLVCFSAWLL 
I£KVDFSALQNLPP^TLP^FRYGF 
LLGLFPVVGRAFTTIPSPVIXKSAMVDirc 

SEQ ID 5635 

ATGGCTGAAACAATCAAAAAACAGGCGGAITCGCC^^ 
CGATGATTACGCCCGCGCTGATTGTGGGC^^ 

SEQ ID 5636 

MAETMKKQADSPDLVYC&EDRP^ 

SEQ ID 5637 

ATGACGGTAAC6CT(^TG6ACAGCATCCCC6AGCCGAC(^^ 
ACCGGCAATTCtttXXXXXX^^ 
OTACACCAMTCAGGCGAATCXXXXTG^ 
ATCAATTCAAATCCTCATGT(K!AAAAA^ 
SEQ ID 5638 
fflVTLDGQHPlUflRPEAVD^ 
INSNPHVQKHHRLSBFKFL 

SEQ ID 5639 

ATCTTACAAGGTAOTTG^^ 

CTGCCGGCAC£ACAGGCGMTC(XXJCACC(^^ 

C^CCGTCmAGCCATCGCCCTTTCXX^ 

AAGCAAGCGGCAACGTCGGCAGCAACATCGAACTCATC^ 
TATTACCGTAGCCGCCAACGCCGCCC^^ 
ATGTTCTGCGAACCCAGTCCCGCCGC^ 
CCGCCCTGAAAGCCTCAGGACAACTT 
SEQ ID 5640 

MLQGSLVALITPMNQIX3SIHYDQLRQLIDWHI 
VAEAAAIPMH,YNVP(BaW^ 

}ffCEPSPAAPKWAVSALGRCGPHVRLPLVPLTEGGQAKVRAALKASGQL 

SEQ ID 5641 

at6ccgtct6aaccgttcggac66cataacat6acaa^ 
cct<x:tcx^gcagcaaaaccgaaca(^ 
6cotccggttc6gga(xxxn^g(x^ 
ggcagccaacgctggcttgtcgttgacggc^ 
tggaaaccgagtgggcggaaaaccgtgccaamtccc^^ 
tatcgaacagggcaaaaacggcgtttccgacatct^^ 
cttgaggccgctttcctgacgc^^ 
gcaaaagcctgattgtctttggcgactacg^ 
aaaagccccgaacgaaagcaatgcagttaccgaacaaa^ 
otgtcgccgacggttcgcgcatot 
SEQ ID 5642 

MPSEPFGRHNMTNTLISITQDDTifFHI^^ 

GSQRWLWIX^AEIWPLLKAFV^ 

LEAAPLTRFMQYIXMXmQAENALAKKCT 

SEQ ID 5643 

ATGATGTATGAAATAAAACAGCCTTTC 

GGCCGGGCECGGGGGCTTCGCCTGAATGTC^^ 

TACGACTTCWGATTKKmSG^ 

TCGCGCCGATTGCTGAAAATCGGCGGA^^ 
TCTGATTCX3TTTCGAGCCGGAGGGAGTGGA^ 
AACMTATCGGQUUATACGGCATATCCCGACCGTT^ 
TGGTC«GGCGGGGCATTGTGC^^ 

SEQ ID 5644 

MKTEIKQPFHSGYLQVSEIHQIYW^^ 

PERVKGLVUIGIFLCRPSETAWLNEAGGVSRIYPEQWQKF^ 

NNIGKII^PTVIVQCmYDLOT 

SEQ ID 5645 

GTGTTGGATTTGAGGGCAGGGGCTTGCGCCATT^^ 
TATGGGTTTTCTTTPCAATGATATATTT^^ 
(XXXH'GTCGGGATGTTCGGATOTCAGAAGC^ 
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SEQ ID 5646 

VLDLRAQACAIL^ALPRRLEGRGLTVRK©^ 

SEQ ID 5647 

ATG(XX^TGGTGGCGTTGTGGGCGTGGGGCA^^ 

TCGGGATTTCGCTGTGGCGGTCGGTAGTGOT 

G0CTGTGATTA(X3ATTTTGTTAGCAA 

ACGTTTXXX^TGCGGCGGTCGGMTGGCGAGC^ 

GCTATGTGAmCCA(XX3^ 

G 

SEQ ID 5648 

MAMVALWAWGSAVPGEFMLPAPVEVFQra 

SEQ ID 5649 

ATGGGCGCAAGCCTGATCCAGCCATGCCGCGTG^^ 

AA 

SEQ ID 5650 

^ASLIQPCRVSEEGLRWKDFCOGRKGRCQYRRPOTPEE 

SEQ ID 5651 

GTGCAGGCGQATWAGCXTACGCCGCCG^^ 
ATTCCGTCCACCCCAGGGTGTCGGTCGGCT 
AGAAAACAATGGCAGCGGGAAAAAACTQACGCAAGACX^PGAM 
TCCCGCTTCAAACOTATGCAGGC^ 
CTACAACTTATAATOTMTC(^CAGACGCAAA^ 
(^(^CC«X^ACCGCTACCACAACTG(M 
SEQ ID 5652 

VQADLAYAAERITHDYPEFK3AKKDKKI 
SRFKPYAGVRVSYGHVRHSIDSTKKTTD^ 
SEQ ID 5653 

AATGCCGTCTGAAGGTTCAQAC^ j 
SEQ ID 5654 ' ! 

MTGFmVLSIXjFRDYGIVGNIXSSGIIi^ j 

SEQ ID 5655 

TTGTTTATTGTTGAGGAAAATATGAGATCTTC^^ 

AGGCG(^GATACAGGTTTTGGAAGATGTGCACGTCMGGCGMGC^ 

AAACCTCGACAACATCGTACGCAGCATACCCGGTGCGTTT 

GACGGCATCACGCAGACCTTTTATO 

GCTTCAGCGGCTCGGCAGGCA7CAACAGCCTTTO 

CACCAATTC^AOCAAAGOTMTGCGATGGCGGCGA 

GTGGGCGGCGGCGGGCAGCAC^TCGGAAATTTTGGTGMGAAT^ 

GGGATTTGCAAAGGCAATACTGGAAAACAAAGTGGTATA^ 

C^CATWCCCCC^TCGATCCGTCCGG(^^ 

ATCGGCAGCC(mAATCATCM(XX^ 

TITGGGCTTCAAmmCCACAACGAATAC^ 

GGCGATAAAGGGCTGTTGCCTCAAAAATCMCCA^ 

CCAATGCMTCAACTACOGTTTCGGCGGCGAATA^ 

CGGGCTTTATGAACCCGTATTGAAAAAATACGGCAAAAAGCXSCGC^ 

CGTATGCCCAACATCCAAGAAATGTATTTTTC^ 

TGTTAAAACAAGATGATATATTAGGATTGAAACT^^ 

CX3GCAGCACXX3(mTOX^ACACC^ 

GCCTATCAAAAAAGCACGmCCGACCMTTTC^ 

GAGATTACGGACGTTTGGAAGTCGGTACGC^^ 

CACCAACGGGGGAAATACCAGCAATGTCCGGCAACTGGG^ 

AACCTTATTTTCCGCGCCGAAGTCAAAMCCTC 

ACGAAGACGTAACGTGTAATGCTGATAAAACGTTGTGCAAC^ 

GOT 

SEQ ID 5656 

LFIVEENHRSSFRIiKPICFYUfGVMLYHHSTAEDAGRAGS 

WITQmSTSTDAGRAGGSSQFGASVDSNFIAGI^ 

VGGGGQHIGKFGEEYLERRKQQYPVQBGGLKFNAGSGKWERDI^ 

IGSmiNmQFNmSLNPYTNIjNLTAAYNSGRQ 

GDKGLLPQKSTIVQPAGSQYFMTYFDAAIjKKDIYRLMYff 

AYQKSTQPOTSDASESPNNASKEDQIiKQGYGLSRVS^ 
ffljIFRAEVKTOjFDRRHDPIi^^ 
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SEQ ID 5657 

O\6G6AAGCX^66T , ITTTGATTT0GOT 

SEQ ID 5658 

MPSESPSIXjICFPKRIILPQAyANPFYRIGNSKKNPFHSH^ 

SEQ ID 5659 

A«XXX»OCXrmGT(OT 
CCTATTACATTGTTAAAACCGGGCAGOOT^ 
SEQ ID 5660 

MPAIiCATOFNAFPKLINN^ 

MTmCTATATATGACAAATTGAAA 

SEQ ID 5662 

MFHGLKKTrALPIiSGSAVLSAARTC^ 

SEQ ID 5663 

ATCCGCGCGAGCTG6KX3TCGTOT 
CAAAACCGATACCOEC^ 

AGTGCGCCTGCGCGTACTGAGCAGOTGC^ 
GCXCGCATCMCGTGCGCGGTGTGCCCGATMGCCC^ 
CMaX3AlTOTOTOACCGTACC^ 
CTCCOT^GTCGGT^ 

TTGATTGi^GAAAAATACATGGAACTGGCMCCAGGGTA 

SEQ ID 5664 

VAKKVAKARAEGHDIWWSAHSGETNRLVA^ 
GNISTIX^SOTSAVALAAALKADE^ 
ARINVRGVPDKPGVAYQILGAVAIMIEVDMIIQNVGSEGTTD 
LIDEKfflELATRVIJIKAFDIiQ 

SEQ ID 5665 ' 

ATGGCTTATGTGGTCTTGGTCATGGTATTTGT^ 

SEQ ID 5666 

MAYVVLVMVFVSLFEYLIPSLCKPCLCRGGDDDLS 

SEQ ID 5667 

GTCCGAGGCCCAAGGCACTCACACTTATC® 

ATACTTTTCAGTTCGTCCGCCGCT^ 

A 

SEQ ID 5668 

VRGPRHSHIiSVICimimilOT 

SEQ ID 5669 

mTO«?rcTGGMCACCACAGG^^ 
CCCAAOXX^CTGCCm^ 
«KX?TCAGGCCCGCCGA^ 
CGCCCGTCCGAACCX^^TGC^ 



SEQ ID 5670 

PFFVEHHRLHRRARQAFRQRQHIft^ 
RPSERTCI^IMHRQRKRQRQKHQPRKQGRIRIFLVHKHRR^ 

SEQ ID 5671 

ATGCGCTCGG<KX3AGC^^ 
CTAWTOTACATGMGTTCAAG^ 

SEQ ID 5672 

MRSGEPPDVPPYOTINftMFHAFI£GIQ 

SEQ ID 5673 

GTGTG(^mGTCGGGCAG(^^ 
CCTGTTCTTTCAGAT^^ 
SEQ ID 5674 

SEQ ID 5675 

ATNXX^TGOTATATCOGCA^ 

TTTGTACCGCTTCCGTGGACGAAMTGCCC^ 

TTCGGCGGGCAMCAGTCX5GGACraCACM 

AT^GffiXEACGGCGGCAOGOT 

THXXBCCGTATO^ 

MTACAGGGM(WXX3AAGGCGCG(X^TTCAGTTTGGACG 
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SEQ ID 5676 

HPDYIOTSRAADSLRDIKITPHFI^ 
IQADGOTTASITGAFVALQIAVGRLVSDGILSENPILEAVMV 
SEQ ID 5677 
ATGTACAAAttTCTGGTCGTTGCCGTT^ 
CTGCCGAATATATGOm^ 
GGGAAATACCGTCAAACACCTGCTG(?^^ 
ACXXjGGCTGATGCACCTTTTGATGC^^ 

SEQ ID 5678 

MYKHLVVAVIXSSETSINALKHAAELAGVNGARLTLVH^^ 
TGLMHmffiSFAETVMRQSHIiPIiLIIRSKAEEA 
SEQ ID 5679 
GTGGOTTGGCGCATTACCGTTTCGGCG^ 
TCCTGCGCCATTTCGCGCGCACCC^ 
CGGCGGCCTCGTAACTITCGTGTTGCAAAAA 
TTTCAGGGCATTGATGGACGTTTCACro 
CTGCGCC^TTTCATTATAGACCC^ 

SEQ ID 5660 

VAIJmYRPGETAHQKVHQPGAPmQHQQAGTVFra 
FQGIIXiRFTAVNGNDQMFVHIWSFCIVSRCRM 

SEQ ID 5661 

ATCXXEOCGAWXXTOX^^ 
SEQ ID 5682 
HPPKRREEVKALSIMVESCKIASHQKVKESRQKM 

SEQ ID 5683 

TTGACCTCTTCGCGACGCTTCGGCGGCATCGC^ 
AGGCTTPQGCGOSamK 
CGCX^TTGCGCGAACGTATTACACAAATCMTOT 
GCGATCGACCAAGTGCGCGTCAAAGCGGCAATGG 

AAATATCATGCCGCCGCMTCGGGCGCAGCGTGTT(^^ 

SEQ ID 5684 1 

LTSSRRFGGIARLYGDSALTHFSQARVCVVGVGGVG^ 
AIDQVE^KAAMAAYPVERKQPFVLSGGAGGQKOT 
QAAVHOAGKR 
SEQ ID 5685 

TTGCAGGTTC^AACrGAAATCGCGCACGATGGGGTCG 
GTATTCAAACAGCGACACAAATACCA 
SEQ ID 5686 

LQVQTBIAHDGVAraFKm/FKTIHHLI^ 

SEQ ID 5687 

ATGCCX3TCTGAAACAGGATTCAGAC 
TTTGCATT 

SEQ ID 5688 

MPSETGFRRHLNKRG^VGSEKPTSARLKPYSQSTPFQNI^ 

SEQ ID 5689 

maTTOAAAGCmT^ 

GcascaaTccxroxx^ 
SEQID5690 

LHIJCAVIMPSEHLPDHIMRECRQIV^^ 
SEQ ID 5691 

TTGTTTATCCC6CCCGA06CGCT6AAAGTCH3 

TTACCGGGCAGTATCTGCACTATATTGOC^^ 

TACCGMGC(X3TCGMGACGMGAGGCC£ACCCG^ 

GAT?TCGCGTGGGCTTA(XTGCCGCTGGAAATT^ 

GCCACGAAGTMTCCMGAMCCCTTra 

TTACGTGAT(X3TCAACTTCATCGCCC^^ 

GACGGC^TTTTC(mCACGGGGCGa 
SEQ ID 5692 

LFIPPDMKVVIjGSF^ 

DFAWAYLPLEIAAETKLPEV^ 

DGIFGTRGGRDVF 
SEQ ID 5693 

ATGAGGQGGCGCATTCAGACGGCATTTl^^ . 

^mACACTGACCTCTAC^ 

CATCAGGGARGACTTGGAAGCCGAACTCXjACGCGOT 

CTCTTCCAGCTCOUm^ 
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SEQ ID 5694 



YKGSFKjGreiBniWHDHSBI 



(YK3GS3CCTATTAGMCACATCGCGOTGCCCCGTGTGC^ 



AftGCGCCX ^^ 



CAAACCGTCGTCA' 



GGCTATCTGCOTAAGATGCAAAAGCCGAGTTTATC^^ 



lTGATGGGCAAAGACGGCAAACCGOTCAAAACGC^ 



GCTGAAACAAGGCiro&TTTGCTGG^ 



SEQID5700 w ™ M ™vEmm^^^ 
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HACRQPGRLQQNTCRNQQRQAYIMTFGAEYGQNAPNQRTHGQ^ 
RPPHPHRHRHQPRTGSPRRTPPLPMAGLPtiAQ^ 
SEQ ID 5701 
GTGAAAGCC1TGTTCG6G(XXjAGCGGCTGTG6CAA^ 
^^GGAAAACCGCCTCCCX&AAAAOT 
TTTGMCAAATATQX^CCGAATTGTCCGGCGGCATC 
GATOGCGCGATATTTTGGT^ 
TTTCCGCTAAGGGCATGAACETGCAAAAC^^ 
0 

SEQ ID 5702 

VKALFGPSGCGKTTVLRIiIAGLETPKSGTIR^ 

DLRDILVAMLVEKIERQGMACILVTHDRFEAARIiSHEIM^ 
SEQ ID 5703 

ATGCCGACAGTTTTCATCTTTTT^ 

GCGCGGCATTGACCGT(XAC^^ 

TGCC(X£AGCTTTTTTTATCCG^ 

TTTCOTTTATGCTGCATATOm 

AGCTCGCAGGATTCCTGCCGTC^^ 

ATTCGGCAAACCCATGACATTTACXXIATAAAACCG^ 

TGGACCATCATC6GATTTGTCA6CCTTATGATTGOT 
SEQ ID 5704 

MPTVPIPLTAraGLGAFAifflCQQQGRGRnyPWKm 

WTIIGFVSLMIAYKGSKFVLBIIIiKR 

SEQ ID 5705 

TTGCCTATGCCTGATGACGAAATTTACAGACTG 
SEQ ID 5706 

LPHPDDBIYRLYAVKPYSAANPI^IHIiAKTTDKSN 

SEQ ID 5707 

ATGATACAGAACCXX3TTTCGTTGCCGTCTGAAOT 

ATTTCGTTGTAGCCTTTGAGCATCAGACCGCACTC 

toGGCGGATATGGGMTCGCGTTTGACCACGCC^ 

CTTTTCGGGAGAAAGCATGCCGCTCATCGC^ 

GCACGCACGTTAAAGCXK^TMTGAACGOGCCCGAAGCGATGGCAA 

mrcAOOTxmm 

CTGTTTGGCMGGCGCACGTCGCGGTATTTGCC^^ 
AGGATTTCGAOGGGGATGGACGGGCCGGCTTCGCT^ 
CGTTTTC^CCAGCAATGTGGCAACCGC^ 
TACGGCTTCGAGCAGCGCGTCGATGTTOGT^^ 

CAAGAATCAOGATAT(X&TTGOT 

ATTGGTOGATGGTAACCATCATGCCCT^^ 

TACGACGGGTTCXXnft3GTGCTTGGAAGGCGTC 

CCGCCTTTTGCACCTTTGCCTT^^ 

CCT8XX3TTTCTCGGCA<^^ 

CTCTTTCAACAGGGCTTCCTGGTGGGCXS^^ 

QCOGGTTTOTGGCATCTTT^^ 

AOTTTACGCCGTCAACGGTGCTGACTT^ 

OTOTTTGTCTTCGCTAAAC^ 

GGGTGCTAATGCCTGCTTCGGCAAGGCT 

GGCATOKXjGGCAOTGTTGCGGAGCATATO^ 
SEQ ID 5708 

MIQNPFRCRLNASIXjIIiFAKEIjQGAGOT^ 

LFGRKHAAHRRIiIVVIX3IVDDVVVADFHVFirc 

LFGKAmWALAm&FARFF^^ 

YGFEQRVDVRSFFRGNVDELN^ 

VUUftARFALYHFGAADWQ^^ 

YEJGFGRCLEGVLVRIiEFBFFAPLAAAYALVMVAGm 

LF^FLVGAQCGGAARIiFVALRLPFRAra 
. RVFCRLLRLNRCGMSLGGFYI^ 

LYLSSVNQCSRAVRM'AFAVSSFTraSISS^ 
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SEQ ID 5709 

ATGAGGCAAACGTGCCTTO 
ATGCGGGTGTAGATAGATTGGCAGCCC^OT 
TCTTTTC<XX}GCAATCX&OT 
SEQ ID 5710 



SEQ ID 5711 

^QTACTGCC!AAGTC6GAAATAAATGTOT 

GCATGTGGAAGATGTCGGACGGTCA(Km^ 

TTA OT T GT rTCCCTQCTGA 

SEQ ID 5712 

MYCQVGNKCLEKHRAENLYFSLW 

SEQ ID 5713 

GTGAAAGGTGAMGCGGAGTTGATATCGAAAATTGGAAAAACAAATO 

AAGCGGftAGTWfflCTCTQCCT^^ 
TTTAAGTGCTCCAATCGAG(XAGTTAA^ 
OTCTATTAACGAATAACAAAACGCAQGCGGMCAAAAm 
TATCT(X!TAGTA!TTTTT^ 

SEQ ID 5714 

VKGESGVDIENWKNKLPEKEREFV^ 

LtLTNNKTQAEQNL^ 

SEQ ID 5715 

TTG<X^CAA(MCCCTC^ 
CAAGCAAGCAAAAAim 
SEQ ID 571 6 

LRTTALQSTGGPSSSESEKSGAIHRIRRPSQNAKQTHARQASKQKL 

SEQ ID 5717 

(jrecGCGCCCCRKXX^CCGTGC^ • 
CX^CATATAAAGT^CCGCCOTATGTGTCGTCC 

SEQ ID 5718 t 

VRAPAAIKWAPGRHRSMRFSVSNTPVSERNADV 

SEQ ID 5719 

ATGTCGTCCTGTGTGTTGAMCATCTGC^ 
GCGCACGCGTGlT^GGGAAGGAAAACAGGGAAAfiGAGGOGGGC 

SEQ ID 5720 

MSSCVIoKHIiHFFI^QGYIjKQRTGWRNDAVRAO 

SEQ ID 5721 

CTGAMGCCC6CCCTCGGATACCAGCTGAAGCGGTO 

TCGTCGGGC(XXX^«X3CAGTG(^Ca 

GGTCTPTTTOMCCAG^ 

GGCGGCACGCTGCTGATTGCCCAAGG^ 

TTTGC(^mAGATAaTGGCCATCXmCCCGGTO 

CGGCTTTTTGCMCACAAACGGCATCGC^ 

CATAAMGGGAAGAGTTGGATCACCATCATCAGOCC^^ 

AGGTAGTCCAAATTGACAATCXSGGCGGGCGGT^ 

GCAQCGTCCATTOTTOGGMAGCA^^ 

SEQ ID 5722 

LKARPRIPAEAWKRTTRAVHRKGGOT 

GGTIiLIAQGE0X3VDQA<XaN^ 
HKREELDHHHQPVKQNRRPKRQYAGNFDETPRQGDQQ 

SEQ ID 5723 

ATGTCCGAGTOSCGGGGTTTmCCGAAAAG 

TOCTGGCCtXX^VTGCGGTGTT^^ 

;&TTGCCGGCGTATTGGCGTTTTGGOT 

GCCCCCGCCCOT^^ 

TTGTGG(X3GCAGCCGOTATTTCACCGGC^ 

CAGC(^GAATGCCGACTTTATTAC(XXXXKX^ 

GflX£CGAATCTTGGGGGCTGC^^ 
CGACGGTCGMGGCGAAATG(XXX^TTGTGCGCCTA 

T^ACGCCACCTTTC^GATGCACGGCGCGGGT 

CCATTTTG3GCGGOGTGTGC6ACAGC^^ 

SEQ ID 5724 

MSmGmKGm^IHALLSEQWTIiPPPLP 

VAESWGLP^ELQNATFAKIJAQKDRPSVWEK 
PFSAAOTASCSAKCI^SKNTTRXFTG 
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SEQ ID 5725 

GTGTGCGACAGCGAGCTGTOXK^ 
ACyUSGCTCAAATGCACCGAATACGGOT 
CACGGMGTCATCATCX5TCGGCGACCAK^^ 
SEQ ID 5726 

VCDSBLFGEVSAFPKKHDKGLFYWMTLTSHAD^ 

SEQID5727 

TTGTTGTTATTTGATTTTGMGTGCAGCCAG^ 

TGAAAATGTCGGAOTCGGGATAGTCGGCG 
GCCGAAAATGGCXK^GGTTTTTTACCGAT^ 

SEQ ID 5728 

LLLFDFEVQPGDVSLFEVA^ 
ABfGAGFLPIRFSAVLIF 

SEQID5729 

ATGQCGTCTGAACGCACCAACAGCCTTCAGACXK^^ 

ACAGGMOTATCAAAAAAACAGJ^^ 

CCCGCCTGCGCAGGAA 

SEQ ID 5730 

MPSERTNSLQTAFCRQfM 

SEQID5731 

ATGTCGGGGGTTCGATTCCCTTCTCCGGCAC^ 

AACGCX!TTTCC^GCCGCTGGCTCAAC^^ 

CCAACACG(K^GMTGGATAGGGATGACCGTCTTOT 

Otf&CTGCTGCCGCTGAAATCCACACTGATC 

GAGCAGAATATGGTCAAAATGCGCCAAATCAACGCAO^^ 

(XXIACCXJCAAAATCGTCAACACCACCX^^ 

raCCTGC^CAAACOSCCGCCCraTCM 

TTCCTCTGGCTCAGCACCAATATGCGT^^ 

AAACACGGGAACACGGC j 

SEQ ID 5732 

MSGVRFPSPAHPSTDPPSGSLCFFTFMjQtfAS^^ 
GLGVLWLNQHYFHGMLLFYLTIGTi^^ 
EQNMVmQIMARMVKSRSHLAATSG^ 
FLWLSTNMRQEISALVILLQRT^^ 

SEQ ID 5733 

TTGGAGTCATATCMCAAAAAGCTGCCCGGGTATOAOT 

SEQ ID 5734 

l^QQKAARVFBm^FVAKPRFKTOPXAV 

SEQ ID 5735 

TIOT3GCAAAATTTAAATTCAAAGA(^^ 
SEQ ID 5736 

I^NLNSKRKRPMPSESLSDGICQPHARGNPSCLVSIFFG 

SEQ ID 5737 

ATGTTTCTOACGGCGCACTGACGAOOT 

CGATGCGCGTGCTGGCGTCKXXXJTTGCGTA^ 

CCATCCTTCAGACGGCATTTTGGTCAOT 

ATTTTCXX3CGCGGGCATCCAGTCTATAGA 

TGCCCXXITTTGGCGAGCGAGTTCATCACGCT^^ 

AGAACCGCTTTACACCGCCGCCC 1 !^^ 

SEQ ID 5738 

MFLYGALTTLGLTVVATAGGSVLGLLI^ 
IFRAGIQSIDKC^MEAACSLGLTYPQAM^^ 

SEQ ID 5739 

ATGGCGGATTATTTTGCMCGGCGGCAGTTCCCXX^ 

SEQ ID 5740 

SEQ ID 5741 

TTGCAAAATMTCCGCCATTTTACCGTAAAAACCGC^^ 

TTCMGGCCGTGCGGTGTCTTTACCAAATGOT 

TAGCTTTCCACAATCX^TTTGOT 

CTGCACCTGCATCAGGTTWAATCGGTTT^^ 

TTGTCGCGCGAACGTGrAWGTAAGOCTGTTT^ 

K^CACCGTG(XrW3^ 

CGCCTCACTGTCAUTCCCCTA^ 
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SEQ ID 5742 

LHLHQVQIGFAGG^IWIFAIiIAVGFWnTO 
RI/TOSRITQRAQKRQGIXJKP 

SEQ ID 5743 

GTGCGAAAGTCATCTM'CCCTGOa^ 

SEQ ID 5744 

VRRSSI PAKGNPDVKMPSEIRFRDVQTASAFEKYAD 

SEQ ID 5745 

ACGTACTCGG^CTGCAGGGGACCTGGGTTTC^ 
CGTCATCGACGGCGACGGCGCGAMCATAAAATTCX^ 
AAAGTCAGTGTACGTGTGTTTGA 
OTATGCTAAAGAACAGCAGGATMGGCXM^ 

SEQ ID 5746 

LKNMQIKKIMKffljPVAL^^ 
KVSVRWETDRYQREVAQVSAGKTOUM 

SEQ ID 5747 

TTGGGCAAAAGCCETAAA^ 

ATATCGACGGTGCGGACGAAGTGMCCAf^^ 

AAGCAAATATGTTTCCCGGCTGGGCAAATTOT 

TACACCACGTmACGGCMCCAAATCGT(^ 

(XXIATGCCGCCGATGTGTTGATTO 

SEQ ID 5748 

IXRSGmKGAVSTSKKSGELLAKYDI^ 
YTTFYGNQIVDVHGIMDQPLTMED^^ 

SEQ ID 5749 

TTGAAAAAGCCGTGGTGCCyAGGGGGCAGGGGATTCATO 
TMTATTTTTTGCTAGAGCAAATACTAGAAAAGMTATCGAGATATTT 
TCCGGAAAAATGGGTGPTGGGTTTAlTAGGTTrAATA 
TTGTCAAAAraGTOGGTATTGTTGCra 
TTCTATTTTTAATATTGATTTTTGTTO 
TOAGQGTTATTTATTATTAT^ 
SEQ ID 5750 

LKKPWCQGGRGFIMRCFMFSTVITAAVLYM 
LSKLVGIVALVTVASCGADNIGLFVPYFVM 

SEQ ID 5751 

TTGTGTGCCGGACAGCATATCGGCAAATCCGTCM 

SEQ ID 5752 

LCAGQHIGKSVKACPKRGraSNVKSSNIS 

SEQ ID 5753 

ATGCCGTCTGAAATTCAATTC^AAA 

AAATCCCATTTGAAAAAGGGCTGCPCGGGC^ 

CGCXXX&GAGTTCAAAGA!FGCGGACAGtt 

CTCGCACCGCACATOCQCAAATQCCB'GCC^ 

GCATAGAGGCGCAGGCGGCGGTATTGCTCGTACGGAiTT 

SEQ ID 5754 

MPSEIQFKTSGTQMTORI($GYDVHQLT 

LAPHI PQMRANIAADLG IDI SCVNIKGKTNEKLGYLGRMEG I EAQAAVLLVRI 

SEQ ID 5755 

ATGAAACGCAAAAATATCGCGCTGATTOCGCCGCCGGC^ 

TTTTTCAACGGCATGAGGCCGTCGATTTGACCGTC 

CCGCGCCGAAACTGTCC(mCG^ 

CGGTTGATAGAACAGGCGGGCMCGCCGCCGAAGGD^^ 

GGCAGGCGCAAACGOm(X^TTTTCM 

ACfGATACAGGGCGACXXJGOGCAATTOAAA 

SEQ ID 5756 

MRKNIALIPAAGIGVRFGADKPKQYVBIGSKTVLBHVIiG^ 
RIJEQAGNAAEGGILAVFVADTLKRABS^ 

SEQ ID 5757 

TTGCCACACOTTGCGGACAGOT 

GCXMGAATCAGOGCX^TATTTTTGCGTTT^ 

CAGTTCGGCTraTCCGCCTT^ 

CGCGTC&TCGCCAGA'TAGACTO 

CCGGAAACATTTCGCGCGCCATCXK)CAGGGTATC^ 

GTGGATAATCAGTTCOGCACCGCGCAGAAAATCGGCA^ 
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TaymcGGTMm^ 

SEQ ID 5758 

tPHRCGQFRRGSVHRFSTPEPV^ 

QFGnRLNHFQFGTBFHFGRCNRFGCRFFI^ 

VDNQFmQKIGNIiP^^ 

SEQ ID 5759 

ATGATGACACGCCWATCATC 

TGCCGATTTTCra^^ 

ACCGATACCCTG6CGATGGCGCGCGAAATGTTTO 

(XXSMCTTTTGGGCGAAGTCTATC^ 

CAAACTCAAAGTGATTAAGGCGGATGAAGCCGAAC^^ 

AACGCA 

SEQ ID 5760 

MtEreQIILDTETTGLYMX^MiW 

HA 

AAACCGCXX^TGTTTGTCATGATCGGGTOT 

QCTCTCGGGGCAGT^ 

CTCTTGCTCTGCCTGTTTIK^ 

ccOTACtmrGTomra 

GTTTATGCCGIXXX^ACCCGCCA^ 

GGCATCTCGTGGTTTTGGTTTGTCGGCGCGG^ 

GTATTGCCXXXGGTTCGGTACTGTGTGOCAAGTTC^^ 

CGGACACWTTTTGAAGGGCTGAACGGC^ 

TATA(XTGGCTGCAAACCGCCAGCAGCGAGACTTTO 

TOTK&CACXAT^ 

LLLCLFCMGAQSTLFGPUCyMLPDYLOT 

(ftSWFWFVGAVOTQLPTFTQIHI^ 

YTWLQTASSETFRARAVAANNIVra 

SEQ ID 5763 

flTCCTGTOOGGCACRCAJ^ 
SEQ ID 5764 

MLSGTQAASNYPDnLTK^DGASYSYSDQGP^ 
SEQ ID 5765 

SEQ ID 5766 

MKMLSGMDLLGGQGCIPEQRGKPAACLPSP 

GCATTTIK^TGGATGATGATTCGACMGTAAACTG^ 

SEQ ID 5768 

VMESMFILmSIIIAmGWFPWWSGK^F^ 

SEQ ID 5769 

TTGGAAGATGGGACXaXXAACCATG^ 

TGCGTAACGAAIATCT^^ 

SEQ ID 5770 

LEDGT(MVLSPFGQTEIPQYWRVVKDRIEIiFFD 

SEQ ID 5771 „ 

ATGGTTGCCCXTKXXIATCTTCCAAm 
CTGAAATMTGCATAAATTCCC^^ 

SEQ ID 5772 

M\^UIPIFQFNGKTPALQPCFRFVVGKKDEKAYMPSRLTO 

^ gtcaaS^ 

CCTGTTTT 
SEQ ID 5774 

VKMAEVGYFDGKK6DMRFACVFGMGREHFFAFPPSKLVHSPCP 

SEQ ID 5775 

ATGTAOCAGTTTGGACGtffiAAAM 
AC(KXXX3ATTAAGCCAAGCMTTGCAA^ 

SEQ ID 5776 

MYQFGKEKCKEILPPPAENTGKPHIPLFS^^ 
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S ^!maSg^^ 

SEQID5778 

LIKEAGYSSKKSIiQIiIfiLIGRLRENGGSEVP 

S ^^Ki™AGAAT^^ 

(»TTCaAJGCCGTCTGAAA^ 
SEQ1D5780 



ATCKXXAAAAATTGATCTTOnT 
AI^M^ACJflTCWAGTAAACGGC^ 

TTCCGCAGGMTTCTATGAAAAATACGGCCTGCSCCGCTACGGCGK 

GQCaTTGAAGMGAAGCCSXX3MGGGGCATAAAGGCGCGAMTTCGCXT^ 
ACTGGTCOTPACCGGCGGTATCGGCGAAAACra^ACA^ 




IDPSVFGF 

NAGVITTADSKAVAVVIPrNEEIjMIAHITrARLSGL 
SEQ ID 5783 



TK3GAAGAGJiAAGACaTOTCTATAATTGTCQGCGKST j 



WEEKDIIYIOER 
SEQ ID S785 



CTTAGTCGMCTCATOKXaAGGAAMCAAAACACTGGAOT 

^CTGCTGAAAAACiaTCAGAAGAAAAAAGTTTOSGTGCT 

CCGATACCAACCM^ 



ScSamSo^^ 

CGATGGGCX3CG<K3CACGACCGTGCAG<3MGTCAACAAA7I^^ 
AGraTQMGGGMTAAAGGGAATGTTTCCCGGTOT 

NGVVimtMDGDSR^^ 

QIPBBTAEKAMGKVEMIHSBrPKERAlWAIiI^RKERIAH^ 

AGACCGAGGAAGCCCAACTAG^^^ 

AmaGOUUSOGGIAGA^^ 

TTTTOCTTCAGCATGTOjGTGATPTCGGC^ 

S^TACTCOTOTO^ 

TTTCGroSGCAACGAAOSGI^^ 

MWMjpTOHCGTCAACGAGG&TGQATTCGI^^ 

ncQCMraaxjsraiccM^^ 

CACGaTACTAATCGGAWC(?KXX^3AATCCU 

(XHEYDDVGQAFENALRGTQAQEW^ 
FAMVIQORAFBSRAMVQNQDQFIiGHFLTPE^^ 
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SEQ ID 5789 

ATCTTTTTCCACAAAAGATA^ 
CAC(X3TOTCACATCTTTGGA^ 
SEQ ID 5790 
MFFHKRYTHRQTPAWIFSYAYFIJIVFI^^ 

SEQ ID 5791 

TTGTCAGATAGGAAAACGGCAATATCCGC^ 
AA 

SEQ ID 5792 

I^DmAISAIHTEAVQRCmVYKS^mKLKDKRmrKK 

SEQ ID 5793 

TTGTTCATACCACTCGATAMCTTCT^^ 
AACGCCTGATGTTCGGGCGTGTTO 
ACCAGTCGGCATAAAAATCCAA^ 

TAATAGGGCAAATGCGGOGTGGCGAGGTAAA 
CGGCGACCSGCAAGCAGGATAAAGCCGAMGCGTAm^ 
CGCCAAAGTGTAAAGTGCCAAACCGCCCAAAACCGCA^^ 

SEQ ID 5794 

IiFIPLDKLVCFDKTQQRLAAMVGAD^ 

OXjRMEGQSdVQRGRFAVCALPAEPRAAYQQYAAQCBCHSFRTA^ 

RQSVKCQTAGNRIARLTDIAQG^ 

SEQ ID 5795 

ATGC«yOAA3TATTOTCTGAAACG^ 

(33GAAAAGGCAIITCGTGCCGGAGOT 

OTTGTTGGGACAGCCTTCTTTCAOT 

Q}GTATAAATT66TTTTGACCTATCA6GGCTGTGCCGA^ 

CATCCGCCAAAGACCGTTTTT^ 

TACCGCTTGTATOTATC^^ 
ATCnTCAGGCHTTCGCTCTGACTTATACGOT 
CGGCATCGGCTTTGATGGTCGTCTTGGCGCTG^ 
CGTTTCCGTCTTTKPTATGGGTft 
G(»CTTTACACTTTCGC<n^ 
TCCTGCTTGCCGTCGCCGTTTACCTC^^ 
GAMCGCCXSTCCGAAAGCTGTGGaTT^ 
AATCCGCCGGCCGAAGCAGGCAAATTTTCGGAAC^^ 
ATITTTATCCCGACWOTGa 
GCCCGMCATCAGGCGTTGTTGAAAG^ 
ATCGAGTGGTATGAACAAAACCGC 

SEQ ID 5796 

MRTNYLSETDSIMKKMCLFAVFl^^ 

PYKLVLTYQGCAEAGVCYPPVIff^^ 

FVLSVVYVQGIiAIiTmVG 

ALYTLAI/jTGVPLIAIGTFGGHILPKAGDWMNAVKYArc 

NPPAEAGKFSEHGKMFADTAALKAAMDTALKE^ 

IEWYEQNR 

SEQ ID 5797 

ATGTACGATGTCAATAGCCU^GATGTCCGCCG^ 
CCGMTAtt^CCGTTATCTCGAACGCATCG^ 
GGGCATAGACCAGCCGCACGGCATACGCGCCATCCAC^^ 
CGCTACGGCACCGGTTTGGATGTCM 
SEQ ID 5798 

MYDVNTHBVRRFFARVWQQ^ 
RYGTGLDVNFYffllUiRKUGIiGAEDQARIjSHlSIA 

SEQ ID 5799 

TTGGCAATGGTTTACGGAATCGACTACTCCGCC^ 
TGGGCAGGGTCAACCGCATTGATCCGCT^ 
mCMTTTCATGCmmTCAACGMGACAM 
AACCCGGCGCX3ATTAAGAAATGCAAMCTCATCACCT 

SEQ ID 5800 

IAMVYGIDYOUJAECttiKIKN^ 
NPARLRNAKLITYHL 

SEQ ID 5801 

TTGCTTGATTTTGCAGGTGTGAAATT^ 

SEQ ID 5802 

IiLDFAGVKLSHBQACPSYSBIRGRVHQPVLK 
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SEQ ID 5803 

TTGGGGGAAMTtaftTG^^ 

SEQ ID 5804 

LGENECRIjCGVVPLQTAFCRRSVWSGDFMHV 

SEQ ID 5805 

ATCTCGGTTGGGCATOT(&G 

GCAGTATGCOGTCTGJAGCCAGGTTGTOGG^^ 

TGTGCCGTTCGGTTTGGAITrGCAGCC^^ 

AACCCTGTCGGACGCATTATGCT(XjC^ 

ATTGGCAAAAGGGCAQSGTTTATATCCXX^ 

(rGCXXHXXttTTCCG^^ 

GCGTGCCGATACGATGTGTTTGCACMCGCC^ 

SEQ ID 5806 

NITOIIMIALYGKTDAVCVAQS^ 
ACRYDVFAQRPVLIMDWLIMLKRAI^ 

SEQ ID 5807 

ATGCCEAAOCXaJ^TACCTO 

SEQ ID 5808 

SEQ ID 5809 

ATGCCGTCFGAAACGGAACAMCCCTTT^ 

CG(XX3(nTO^G(^TCGTCGGCTTCAAA(^ 
AAGCGCCACATCGACCGTATCGCCGGCAT^^ 
TGATTGTTGQCATCATACATCTCATMT^ 
SEQ ID 5810 

IO*HIDRIAGILTVQRLFHQRENATO 

SEQ ID 5811 ; 

GTGTCAGAATGCGOTCTMTCAATCTQ^ 

AAACCGAAGAACCGATTGCGTATCTGCACGGCGGTO^ 

TGCGTOXXSCGAGCAGGATTCGGAGTTGGTCrc 

fATCGTOTAACCGATCTO^^ 

TCGCACACGGTCATGTCCACGGTTCGCACGGTC^ 

SEQ ID 5812 

VLECACNQSAVPEimOiAIAKNSWSI^^ 
YTlVTDVADGRAVVDGNHPLiAGMKIRFKATVESVRD 

SEQ ID 5813 

ATGGTnTTGAGAGTGTTOTAAAAAATTTACA^ 
TTAGAATGCGCCTG 

SEQ ID 5814 

MVFESVVMjHCNKITKIYIEICAVQFM^ 

SEQ ID 5815 

TTGCAGTGTAAATTTTTTACAACACTCT^ 
CCTATCGGATimAACAGCAAAACCCnTrT 
TAATTACGCAAATTCAGOGATGAATTTOCAAC^^ 

SEQ ID 5816 

LQCKFFmsmpmPTSPicmsYKiur^ 

SEQ ID 5817 

TTOTGimTGGOT 
AAOTCCTSCTGCTGAACXA^ 

SEQ ID 5818 

LliSLAIiAACGGEKAAEAPAAEASSTBAPAAEAPAAEAPAAEA 

SEQ ID 5819 

TTGCAAGCGGAAAATGOTATTTTGCAGCT^^ 

AGCCAAAGACAACAAAGCAGCGGCAAACAGGGATT^^ 

SEQ ID 5820 

LQAmYFAASAMJASAAGASVAAASAAGASAAGASAA^ 

SEQ ID 5821 TCAmtHTO^^ 
ACGATAAAACCATTTCXXSCAACCTTCAAG^ 
OCCACOCTGCCAGTCTTG^ 
GAAAACGTCAGO^TGCTGGCAAAC^ 
TAGCACA(MAAC6CAAAAGGGTCT^ 
TTGGGATTTGCAGGACACAGCCGTA(X^ 
GTTAAAGCGTOGGATCAACAGGGTTTTAOC^^ 
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AAGAAACACTATATCGt^^ 
(XXXXnCMCCTTGOTACG^ 

SEQ ID 5822 

MQHSSPTTYNPMHISLFSGCGGLD^^ 



VKAVfflEQGFTVQASGRQCQLHPQAPKMI^GANDYI^M 

SEQ ID 5823 

GTGCSGCAGTACTTGQ^^ 
' CTACCXXTmCTTG^^ 
TTTCGTCGCXXjCCTATTGGCTGGTC 
GCCTATGOOGTCAGO^^ 
GCCGTCTGAAAGACCCX^ATTCATCC^ 
TGCCGTCGGCTTOTCCTGCTCGCC^ 

TCTGACCGCOXXETGTTCA^ 
SEQ ID 5824 

WQYLPYSVRWCFFINPGMILHRQIHiE^ 

AYAVSGDLNIiLRAQVHLNMAAVH^ 

TGAAKLQNLPASAPLHLITLG^^ 

SEQ ID 5825 

ACCTGCCGCCGAAGCTGCTGCTGC^^ 
GCAAAATAAGCATTTO^ 

SEQ ID 5826 

LPLLCCMMiPAAKKKPLKLIi 

SEQ ID 5827 

ATGCCTGACCTOATACCGAATTra 

AGCCGCGCAGCGTTGAAAGCGTGCAMCCATTArcCGTO 

ATTGCTCAACCTTOCAAACT^^ 

CTGTTCCCACTCAGTCTCGCCAGCGAAGGCT^^ 

AAGTCGTCCTCCCCAACGGCGMCTO 

TGCAGTTTTGAACTCATCX5GCCGTOT 
6CAATCTTGAIP6ATCGGCTTGTC6AATTTCTTT^^ 

TGCTT(XX»CATCR3GGCG^ 
CCCGCMCGGCAC(&TTGC^ 
ATATAACATTATGA4TCCTGGCAMCTGCTTCCG 

SEQ ID 5828 

MPDLHOTSRLLPMffilAE^ 

IiFPLSIASEGS(^IGGlfEACNAGGIiN™ 

CSFELIGRFAAKLSSEFSKLPLPTHS^^ 

SEQ ID 5829 

ATGGATTTAGATTTTTTCCATCGC^ 
GTK^AAAAATTGGTGGAAATTAaCGA^ 
SEQ ID 5830 

SEQ ID 5831 

ATGACAAGGAATACACTGAAGAmOTTG^ 

SEQ ID 5832 

HTRNTUCITIiIASPEVSGUCR^ 
SEQ ID 5833 

toggagcattagaattaacaaattctctatg^ 

mgmgattatgctggcggagctcgttttcgt^^ 

gggaaaagcotg^ 

tctttacaacaacgcccaga(x^ 

acagtgatattttcgaccgcactttacxiacgc^ 

gcaactcggcggcgaqgt(n?tcacatggcaaaac6aaa^ 

acactgaatatgcta(^gaacgttttaotcaa 
acaaccaca0gc6caatfgacttatttg6gg6t 
gcgcaatttgcotcac^tggcgt^ 
cggtaattgaaactca^^ 

GTGGACGTTT 
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SEQ ID 5834 

SI^RPEGFFTKVQERDDIAIYAQQAQAAl^^ 
. TGNVKGFGAGVYATOQLQIKQTGAyVDSWMQfYQRFRM 
AQFAFTNGVTFQPFVAVNSHQQKPFGVEIIX3^^ 

SEQ ID 5835 

ATGCGGGAATGTITGGATAAGTKXKXCGTT 

TGGTTTTGTCG 

SEQ ID 5836 

MRECLDKFARYFLCDKFIiL^^ 

SEQ ID 5837 

TOTTCAG!OTTATGCTAAAATCC^^ 
TTTCTAffTCTT 

SEQ ID 5838 

IJSFlGiKSTPraiiLDKGKPM 

SEQ ID 5839 

GTG6AOTTAGCATAAA&CT6AACAAAAAATGTC&OT 
TGTATAATTGCOTCGTTATTTA^ 

SEQ ID 5840 

VDFSIlOMKCHLVQIFSIYFLPIKSKMiI^ 

SEQ ID 5841 

ATGOXSTCI^CGGOT^ 

AAGTACTGAAAGAAAATCT^^ 

(XHTITGGACGCGCGCACCATCGGCGTGAAACC 

CGCGTGCCGATGCCCATGCTGACCGAGGAACG^ 

TCAAAAAACTCCTCAMGACAAAGAAATTO 

AGAAGACCPGATGGCGGIT 

SEQ ID 5842 

MPSERLGARSNGCPraSYWOJNMIiroiQKTAEGK^ 
RVPMPMLTEERRKDIJKWRGEAEEXnW^ 

SEQ ID 5843 

GTGTGTACXOTATGCCGGACCTGCATGTTGAAACATAAAGGAAAAAATC^^ 

CACGGACACGCCGAlCTAOCACCATCATa^ 

GC^TCCCTCCTCACT^ 

(Xa&CGCGGCAAMGTGTGCCTCGAAGCGTGCAA^ 

SEQ ID 5844 

VCT1£RTGIIjKHKGK1^ 
RRGKSVPBSVQTVCQSL 

SEQ ID 5845 

ATGTTTATGGCTATTGCGAACMTAAAAAAGCAOT 

AGCTTAAAGAGAGTOTATCTATrGGA^ 

GmTTAAAACAGTCCX^ 

AAAGGTAAGAAACAGCACGATAAACGACAAAGTATGAAGGAAGTCGACK^ 

SEQ ID 5846 

MFMAIANKKKAFHDFPIBDMEAGLVLEGWEVKAIRA 



SEQ ID 5847 

AGTGGTCGATGGCGGTGTTCGAG(X3TATGCCGGM 

(EGATACGACAGGGTTATCGTGTTGC^ 

ATACGCAGGCTGGATAAGGAACGTCTGCCTPTGATGff 

GGCGGCGGGAAATTTCTGTCGAAACTTTOGGTTO 

OmAACACTATTCGGAGGCGGGTTGa 

ACCGATTTGTtX^AGGCAATGGATTTGC*^^ 

CCAGCCCC^CGCATACGCCGOTTTGACX: 

GTATCCCGAGAAGATOTGCAGGCGGTTGAAGW 

SEQ ID 5848 

MSIKILIISPSWIGDCVMTQPLFRR^ 

IRRI£KERLPU4VDRY7ALAHQS^ 

TOI^ANDLLSIAMW^ 

SEQ ID 5849 

nGCCGCAACCAGACCCTCAAGGTTTC^ 
TTACCOTXOTCCTOAACM 
GCAGTTCGAGCGGGATTTTTGGGACMTA1CGG 
GCCGCTATTGOCAOGCOTTGGGC(M 
TGCGCGGACGGTTTGTGTTTCGGGAAGAC(X^ 
(^TO^GCAGGCAJK^ 
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GGGCAGAACMGGTGTCTGTCATGATTCXffl^ 
AAGTTCAGACGGCATTGTTT 

SEQ ID 5850 

LPQPDPQGFRRSLFLYERAQ^IGDFAVKFMIGIRSKR^ 

AAIATPLGm»AVFGVK6IiCIiIiEFV^ 

GQNKVSVMI PCHRVIGSIX5KLTGyAGGIMKQFIiLAliffiGEVQTAIiF 

SEQID585H ■ 

ATGWkTCCACTTATTCAT^^ 
ACCCGTCCGAAG&CGGCGGCTTCACGOAAA 

SEQ ID 5852 

MNPLIHQAKBSSMQTRIIiSAVL^ 

SEQ ID 5853 

fl^GGGTOTCTCAAACC^^ 

SEQ ID 5854 

AKPAFERALRLRLFGRQCF 

SEQ ID 5855 

ATGAACCGTCGTCAATTTTTGGGCAGCGCCGCTGC^^ 

CATCCCmACACCGCaJFCOX^^ 

TGCOTTGCGCCAGATGCTTGCCTTATGCX^ 

TGTAAAGAACACTCCGCXXACCATGCGGAATGCAAAG 

SEQ ID 5856 

HNRRQFUSAMVSLASAASF^ 
CKEHSAHHAECKACYBSCLDC IKECEKLAA 

SEQ ID 5857 

TTGCCTGCCTCTA^TCTACaTCGCATC^ 

ATAAAACTGTTGTCGGGATCGTGTGGGAAACG^ 

TTTGTTGGCGTTTACGTOGCGTTAT^ 

GCTTTGAACGAA6CGGGCAGGGCGCAAACGCCGCCGOT 

CGCAGGCGGCGAAATTGATTGAGGATTGGGCGGAGCAGGGC^ 

TACC«m6CAGAA(»CTTC 

GTGTTGGCGCAGGGGOSflCAGGTGTTGTTTC^^ 

TGGCGGC^GGCAGGCQCACGCAGGATTATCTGCGCGCGATt^^ 

TOK5AGAGTTGGCACAAGGCGCAAAGCGGCG<^ 

ATAACGGCTTCTCACCGCAAGCCTTGC^ 

(XATACCTTCGGCTGCCOGMC^^ 

AACCAAGACCTGACCGCCGTCGGACACGGTACT 

GGGCAGAlTiOTACCGCOGCATCGCCAflCGAO^ 

CAGCCTGTACAGCGCGGATTTCCGTGCGOT 

(^CACCCTGTCTTCGCC^ 

AT(XX5CCGCGTGTTGCCGATCCGA 

C(^CGCGMCGTGCGCMGTTTTCCTC^ 

GATGTCGATCCGCAGGAGGCT 

SEQ ID 5858 

LPACHIYHRIAVNWLSIKIjLTySHSEPLPPGTRVLVPFRNKW 

AUJEAGRAQTPPPARFHKKAAMI^^ 

VUQGRQVLFLLPEINLT^^ 

UiSWHKAQSGAYRLLQLTl^^ 



EHPVFMVKAQDYAVPAQIBIiNERQMPAMPPFGP^AVRAI^ 



SEQ ID 5859 

TTGCGTACCGTCGGTG<?ICCGCTCGGMOT 
AACCXyiDCAGCACXjGGTGTGGATTCGGCXjGT^ 

SEQ ID 5860 

LRTVGGPWSSGAGGISAHGAGQIU^STVNFASPKLFAL^ 

SEQ ID 5861! 

SEQ ID 5862 

VRRYAAGSASBRTTDGTQPNSSLKRALKRDGQS 

SEQ ID 5863 

GTGTTTCCTGCCTTCGGGAGGACA^ 
CGGACATCGGCGGGAATCTKX^ 



WO 02/079243 



PCT/IB02/02069 



TOCGGGCGMGCATTGCCGCCCGAACAGC^^ 
AAAGGCAGCGGT 

SEQ ID 5864 

WPAFGRTRMVARRAEVRAGLQRAFGRPASLCGNA^ 



SEQ ID 5865 

TTGAGGAATAIAM 

T«5TTATCGTGCGGCm^TGCMGAAGCTGOT 

TTCAGACGGCCTCTATCAGGCGCM 

CGTATGGQCGCGGCGTTTGATTATG^ 

TTGCCGTGTACGGAGGCGTCMCCTGATGCGG 

GCGGCGG(XGTTGG(XXXX3CAGGACACGA& 

GTTGGCTATGACTGCCCGTCCGOT 

TCGGTGTTCGACCG 

SEQ ID 5866 

LRNITVPSNYFHTEERLGAI^ 

RMGAAFDYAAAVLADEEVPHNAAEAALCGVLAWLQH^ 

VGYTCPSRFSARFEIiFGCVPS^ 

SEQ ID 5867 

ATGOXXSTCTTATACACCMTCCMCCCGGAG^ 

MGCGTTGATTTCCCGCCXXjTCCGTTACCX^^ 

(^TCTGGTTGCGACGCXSGCACGAAAGCT^ 

GGAAGATOTACGGGCGCGGCGC^ 

CTT(XGACGAAGAGGGCGATGCACTGGACGGCACGAC^ 

CGATATGATTAAAAACGG(XXjGCGCGGCTCG^ 

TTGTTAGAGCTGACGCAGGAMTTTGGGATGAAGGCA^ 

ACGTCAAATTTAATTTCCGCTTCTCCACAGAGTO 

G{XCTTCCTGACCCACGCGGGCAAACTGACCG 

GCCATTC05AAAGAACW^TCX^™ 

GGTTGTTGGCTGAAAAGGCOGTC j 
SEQ ID 5868 

MPVLYTNPTRRSSENQI^DISDD^^ 
GRLYGRGAAEMKTSIACFWACERFVAEH^ 
liLEI^BIWDEGNEYFPPTSFQISNB 
AIAKKLIELGPSNATIHQINENViaiDDIPKLSA 

SEQ ID 5869 

ATGCTGACGCCCAAAAGCTGCGATTTGTK^ 

TATGGCAGCAGTTGATACAGCAGGTCGCCOCAGATTTGAGCGAACC^^ 

OGCOCAATftCAAAAMTCOQC^ 

AACXX&TGGCTGGATGATTTCGATAGX^ 

TGCAAMCGACGMGACTTTTTCTGCATC^^ 

CTGTCACGGCAAA 

SEQ ID 5870 

MLTPRSCDLFNIPFFQFAQLKKYQPES^ 
NR}TCJ)DFOTEKYAAFIJKWH^ 

SEQ ID 5871 

ATGAATCXX3TTTGAAACCCAAA 

AAAACGGCTGCAATCA6CTTGTTTTGCAAA 
CGCGCCGCAAGCCCGTGAAGGCGTGGACGAGCTGTO 
GATGCGGMGCGTTCAAACGCTTTGTGGAGGGATATGATG^ 
TCATGGCGGCGCGCCGGTGCAAA 

SEQ ID 5872 

M^i^SVTPAEPimYACHGKVWlFCGQIA^ 

DAEAFKRFVBCmJVHIiAIEEPIiPEM 

SEQ ID 5873 

TTGAGACGATTGTTTCAGACEGCATOTT^ 

GCX^GATGCAC^TCATATCCCTCCACAM^ 

CGATATGTTGGCGCAACAGCTCGTC^CGCCTTCA 

CCGGGAGATTTQGCGGATGGTTTGCAAAAGAAGffl^ 

ATTTCiATCGGTTCXmAAOT 

TGTCTTT 

SEQ ID 5874 

LRRLFQTALFAPAREHDFADTOLLrcDKR^ 

PGDLAIXSljQNKLIAAWGDIARQHm^ 

SEQ ID 5875 

TTGAACGCCmiraCGAMGGCAGATA^ 

GTTTTCTTCCGCGTATMTGGGACACAAACGC^ 

GCGTCGCCCOSAAGACGAAGT^^ 
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TOMTCGGCA^mCTGC^OTATCGAAGAGGC^ 
ACAAACT(&TTGC^ 
CAATTTCCTa^TCTGGCm 
TTCGGACCXyOTCOGA 
SEQ ID 5876 
LNAEPRKAIMSSTQTILEFffAIPRHIAVIMDGNGB^ 
FNRQIL(£IEEAEMTANNTGLTI£ 



SEQ ID 5877 

GCTGT 

SEQ ID 5878 

LINLWISOTERVGKVALYQTSVAPALRRGSimKIASGC 

SEQ ID 5879 

TTGCACTTTGATTTCCGTAAAACCGM^ 
CCGC^CCTGCTGCCGAGGCMC^^ 
CGAATCCMCGACMTATGaGTTCMCAOm 
CTTCTGATTGCCAMGCTGMGACATGGACGGCGT 

&XXEGAAGAGTCTOCCTGAOT . 
SEQ ID 5880 
UJFDFRETEYYFIDWRF^ 

SEQ ID 5881 

GTGACOCGGGAAGGTGCAGGCAAATTTCTAGTCGTO 

TCAOGTTTGACATACnCGGTATCGGCAGCACCTACOCCG^ 

GTTTCAGACTGATGGTGAACTCTTTACA^ 

A^TCTGCAGGAGCAGCTTC^^ 

GCAGAAATCAGAG(£AGATMGCTTTCATOT 

SEQ ID 5882 

VTREGAGKFWAVSQFGRIQSQGR^ 

SICRSSFGGRRFGSGCFTGRSCLGSRRGRFLRAGSQTDNGG^^ I 

SEQ ID 5883 j ' 

GTGTTGCACAAACAGATfnTTCGGGO 
TCPTTAATATTAGGGCAGCOGTCXXXmr 
GGGATGCCGTTCTTCCAACTTTGCT^ 
GACCG!EATCGAAGCATC^ 
CCGTC 
SEQ ID 5884 

VliD^QMFRGGIUjQTSFEIKACLQVYNRLPKKRAIL 
DRIEASSPASCGNTOGAIIFKRDAAIIjPAAKRTSLPSSIHAV 

SEQ ID 5885 

ATGMTTTGGAACAGTTGGGCAC&C^ 
ACTTTCAGACGGCATGTATCGMGACGC^GG 
TTCGATACGGTCTGTTOGGTCAGGCTGGTTC^^ 
GAACGGCATCCCGAATTGGCGG^ 

SEQ ID 5888 

MNLEQLGRRDAUiSGIiliK^ 
ERHPELAEATiRKT.VRKYGA 

SEQ ID 5887 

A!W3CTGACAAACATTGCCAA^ 

GCGCCACTTCGACGTGCAGCTTATCGGCGGTATGGTO^ 

GCCGGCAAAGGCGTGCACGTCGTTA(XGTCMCGAOT 

AGCCGTTCXSACCGTCAAMCGCCTACG^^ 

ATTGMTTTTGCCGTTGTCGATGMGTGGACTCX^ 

O^OCCC^mCGTCCtmAGAim 

aaatggggctgc^regcagaaaacgactcgct^ 
ccaagacggcgaaato^^ 
mccaaaccctcxkttccatcaccttcx^ 
ccgto^tcosac^^ 

Ga^CCGTCXmTCGGTACGACCAGC^ 

ATTCyttXKXmGCCCm^ 

AAACCTTGaGOaCGAftGAQAAAtt 

AAGCCGCCGTATO3ACAACCA(n , TGC^ 

GCCCmTCCimCCGC^ 

AGGTOT^TACGADGAOn'TGCCAW 

CAGCGGACCTCGTC^TAOTATATGCCGC^ 

GAAGGCGGACAATGGGAT 
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SEQ ID 5888 

MLTNIAKKIFGSROTRLLKQYRX^ 
AGKGVHVVTV^YLASRDAGIMEPLYHFLGLTVOT 
PPHLVRQETEEGEGDYWVDEKAHQVILS^^ 
NQTIASITPGNYFRLYmSt^^ 



AHLNRIAPERGVAIBHNLLTRQIEGA 
EGGQCD 
SEQ ID 5889 

OCTGGCTGAAGGCGGACAATGCGATTGACGGTCAAGA(^ 

CGAGCGCAAOTK^TGTTGCAGGCCATCGW 

TATAAACGCGAAGCCTTTACCATGTTCC^ 

CCGTCCmACATCCMett^ 

AGCCTTCGAAGCGCGGGGACAMTCGTCC^^ 

SEQ ID 5890 

MPSADLVPITHPPDSMEE^ 
YKREAFO^QDIA«NGIKFHIASLLTSVQIBQNPV^ 

SEQ ID 5891 

TTGCCTGAACCGCCACATCO^^ 

TGGCCAAAACCGATATTGTCG(X!ATCATC^ 

(^COVAGCAGTTTTATCAmiT^ 

ATGATTGTGCCTAAAGTGCGCGGGCAGMC^ 

TGAAATTCAATCCGGCT(XX3AAAGOT 

ACCOTATCCGAWACCGCGTTGGTGGA^ 

GGTTTCGGCGCOGGGTGCTGGACGACTCGAAAOT 

AGGCAGAACGGATTTOX?K&TCGAAGGCT^^ 

GATGCGGCAGGCX^CAGTATTrATTTCTGTTTC^ 

TTATTCCTGCCGGAAGAACAC^ 

GCTGGTOmTCGACCC^ 
ACATT^rrcCAACGOmTCC^ 

AGGCGGCGGCGMGMGATTGCGAAAACTTCCAMTCG(^ 
AGCGAGAAAAAACTTTTGCTGTCGCTGCTGW 
SEQ ID 5892 

LPEPPHPTAIPKNPDFVPSVPKTDI PS PHIHIPSDFIDELIiAK'rDIVGI IDEQVPLKKGGANYMACC PFHKEKTPSFSVSPTKQB7HCFSCGAHGSAIGFVMEHQGI*SF PEAVQFLADRVG 

MIVPKVRGQNDNPEVRAERKKKQQTLEETTAAAADFYAQQLKFK 

GFGGRVLDDSKPKY^^ 

liFLPEEHDPDSHRAYGKAQFE^^ 

SEKKLLLSLLTAKQN 

SEQ ID 5893 

ATGCTGAAACMCGGGTAATAA(XXX^ATG^ 
GGGACTATGCCCGTATGGCCGGTOGTGCAAAAC^^ 
TGTTGTTTTGGCATTTTGGOTGCCG 
CTCGTATCCCT(XXXXCG(^TCCCG 

acc(xx^ra(Xx:ccG(^(^anK3GGAA« 

TACCGTGTTMTC^TTTGGTGTTGACCGTTG 

GTGTTC^CCGTACCGACAGCCTGATTGOT 
SEQ ID 5894 

MLKQRVITMfln^^ 

LVSLRPHPDDALPLLAVMGLVWAD^ 

VFDR.TDSLIAVISVYAAIMSVLH 
SEQ ID 5895 

ATGAAAACCMGTTAAMAAATCTO 

lATCCGTTGCCGCCTOTP^ 

GCAGATTATCTACACCGATGCCGMGGCGGCTATATGTOGTC 

TCCX^GCCTTTGGACAMGCC^ 

CCGACGTGACGGTTTACAGCTTTATGAT^XCATT^^ 

CAAATXXSGTTACAGCCCGATG^ 

SEQ ID 5898 

MKmiKMPrni^ACG^ 
SbPI^KAIKmGNGKLmWSDPKPFCmJ^ 
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SEQKD5897 

GCTGCTCGCCCAAAACCAAATCTGCACTT^ 

CAGCGCAAACTCAGCCGTCMACCXjGCGC^ 

GCCX^TGTGCTGCTCMGGCAAACCCAAATGTTTC^^ 

CGGAACAACCCmCXXXmCG^ 

AAAACCTGAAGCAAAM 

SEQID5898 

1£GXAVIAVSSAQAVRDPVLVFRIGAGAQ^^ 

ADVLLKANPNVSMGRLHAGSVLffi 

VPAAKTAASETAAESAPPRSRRFCHRHADRKHR 

SEQ ID 5899 

atgtcttaaaaccaaaatcacgaagmtaccaagac^ 

actccgaaatcaacgacgcactgcccgacgat^ 

catattgttaagcgacaatgccgcc<n^ccgacgatga 

c«:gaaatgggacaggtcgacctgctc 

aaatcttagmctcatcgaaaaaatctgcam^ 

CACAraCCOSAGAAACCTTCCM 

GGCaCTTTCTCCAMTCGAAAAAGACT^ 

TCCGTTTCGCCACCCGGCAMTCGACAGCCT^ 

CGACTACTTCATCCAAMOTKX^TG<X^ 

AAACAAACCGMOTGO^ 

TTCAGGCAAACTTGCGCCTCGTCATTTCCATTGC^ 

ATAOCGCCGGGQCTACAAMTCTCCACCTA^^ 

AACMGATGAACCtfATCTO^^ 

CCAAAGAGCCGATTTCGATGGAM 

CGAACTAACCAAAGAMTCCTCGAAAGOT 

GTMCOXmACGCATCOTCAAATC^ - 

SEQ ID 5900 ii 

MSKNQNHEEYQDDARPLTIEEQRARI^ 

REMGQVDLLTREDmiAm 

G^AQIEKEYKKmGaEKKHSRHKDYLAYRIM 

KQTELADMEKBTRISIEEa^ 

NKMNRISRQHLQETGEEPDS^^ 

VTRMIRQimALmRHraSDllLRSFLI)SEDSKL 

SEQ ID 5901 

TTGAAAATTTTTTGGC^CATTTTTATGC^^ 

TATTGTG 
SEQ ID 5902 

LHFVfitf FMRQISLTDYFCKGLGLRA^^ 

SEQ ID 5903 

ATGACCGCACTTTTAACCGAAACCCAAAGGGAAAATO 
CGTGGGCGGA6AACTATTTGAC(^C^ 
OTGGGCATCAGCGGCATATTGGATTT^^ 
GCCCAAGOTTGTGCTTGGAAGAMTG& 

SEQ ID 5904 

MTAIjLTETQRB^DTRLIPLSALQ^ 
AQGIiCLEQfrcQTVSEGALWW 
SEQ ID 5905 

ATGAAATATTGGCTAGTTGGAGCCTCTTGGGG^^ 

MGGAGAAAAMTCCMGTAGGT<^TCGMTTGCAATCAA 

AAI^TCTGTGTCGTTGATTGGATAGTTA 

TTG 

SEQ ID 5906 
L 

SEQ ID 5907 

GTGATGACCACCX5AAGTCAACAGCGGACAAGTGCG^ 

AAAAAGACTTGGAAAAAGAACGCACCATGGGGCGCAAATACATW 

CTTAGCCAAACTCTGCX^AGCCCT^ 

CCTGCACATAAACTGTTTGACGCC^^ 

GTGTGGAAGAGTATTTG 

SEQ ID 5908 

VMTTEVNSGQVRGPVQLAFAQSIDPIVPPEVSITRMAVTNOT 
P AHKLFDAVKVERVKGBSGTP ASGPGDTOSWSIXil^SVEEYL 
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SEQ ID 5909 

ATGAAAAAGGCTTGGAACCA!^ 

SEQ ID 5910 

MmKNHVPSIiFQ«PSEACRGSFRRHI£SmiNPAPMHLSD 

SEQ ID 5911 

mXIGACCATGGCCGTTGCGTCm 

AAAAGGTOTTGAAOTGT^^ 

SEQ ID 5912 

IiPmVASHVSLDMS^^ 
ANLQHYNPLPDVMAKAWn 
SEQ ID 5913 

TOAtOTTGW^^ 

TCCGTMTTTTGTCGAAATCAGC^ 

AGCCAAAAACTGCCGMGGACGAAOT 

SEQ ID 5914 

LSLVRGKSRRLFCMSCCKLPCWAKAKTa^ 
SEQ KD 5915 
GT6TTOCGCCGTC6GATGW6AACAA6AM 

TGCGTGGAAMCGCCGCCGTCTATTTGGCGGCT^^ 
SEQ ID 5916 
VFTPSDDEQESmFKVLEQIGKGRPIiQEIAPEl^ 

SEQ ID 5917 

ATGATTCTTGCGTCCCTTGrcCOT 

TGGATAAAGMGGTCGTCTGAAAACCGTTC^ 

TTTGTGGGATAAAATCGCCTACXjCGCTTG^ 

GATTTACTGCAAAACAGCGAAGACGAA(^TTTACAAGCCra 

CATTTTCTCTTGAAAAACC(M 

CGCGCCGATOX^CGGCTGCATCCXK^ 

AATGCGCXTOITTCCGM 

CGGATGATATAGTQ6AT 

SEQ ID 5918 

MILASLVRYYRRIATETDETGNPKVPSYGFSE^^ 
DI^NSEDBGLQALCRF^^ 
NAPVSBQSAFAYTTAIiNYLLRREN^ 
SEQ ID 5919 
TTGGGCTGGATGCTGCAOGATATTGATTTl^ 

SEQ ' D59 20 

SEQ ID 5921 

TTCTACGCX^XXXXX^ 
GATTTTCCGJCTGCCAACGC6TTOT 
CCGCCGTCGGCGTTATCGGAAAGCGAGGCAAA 
SEQ ID 5922 

LYGSRMjPCAQTDAIX&QKKTMYKGQD^ 

SEQ ID 5923 

ATGCTTTCACCTOTCGGCCT 

ATC(XX3(HTRKXmOT 

CTOTTTCATCCGGACAAACGCGTTGGC^ 

ATCT(TrTTG6GCGCATGGCAAACX3GCG^^ 

CGGCATPTAGGCCGOTAACT 
SEQ ID 5924 

MI^PPRIUCTAAHQSSRLSFACGKNAAC^ 

IOX&W^AAAVQSKO^ 
SEQ ID 5925 

AT66CATACTCT6CGGACTTAAGAAACAAAGCOT 

ATCACCTTAGGGAAIKXjTnXCTO 

TCAAGAAACACGCTTTA(X?rGCG<^TTCGOT 

SEQ ID 5926 

mSADLRNKALKHSGLTKIRTRW^ 

SEQ ID 5927 

A!K£CCTCTGAAAA^ 
CCCGCCATCCCGAAAAATTCCGOTATT^ 

ATTTATCTGGCAAACAAAGAAACXXSXXn^ 
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GC^CGGCAGTGTTGCCCGTCGACAGCG^ 

CXXOTmMCAACCGAT^^ 

GGCTT6GAACTGATT6AAGCXX1ATT66CTG? 

CGCAACTGGGCMTCCCGATATGCGAACGC^ 

GCCCGACTTCGGCCGCTTCC^^ 

CAGATTAAGTTTACCGACA3^^ 

GGGCATTTATCGGCACACTGCGC 

SEQID5928 

IttSENASIXjErfKVIblOTPQVLTIL^ 
GAACSjPSALAAAQKOTIYLANK^ 
GLEJEAHWLFNCPPDKLBVVIHPQS^^ 
QIKFTDIAKTVAHCIxAQDFSNGJOIEGIiIiAQ 

SEQ ID 5929 

TTGGACATATCCAAGCTGACATTTGACGC^ 
GATATCAGAATCTGCATATCTGCAT^ 

AGGCTTTCACGGATGACCGCAGCATACGCGCAAATCAAAAGC^ 

CCATGAATGAACAGCTTGMGTTITAAAG^ 

CGTATOTTTCATCATCTMCATOT 

ACOUtfmTCGCAACAGGAA^ 

TCATttTC(XX«aT1^^ 

SEQ ID 5930 

LDISKLTmTAMVGKTE^ 

KLSKKTAAYAQIKSECAAMKNRHHAAK^ 

•TKFGMRHJU^VLFliPAMVAYRIBAFPDnKRIiBEK^ 

SEQ ID 5931. 

ATGACTGATTTTGCCGACCGCGCATTCGAACGCGAA 
ACCOGATACCXKjAAGCAAAGCGAAAAGCCGTC^^ 

SEQ ID 5932 , 

MTDFADRASEREAEFbVEAlAKHQPPS^AGPSHCEDCGDPIPE 

SEQ ID 5933 1 

ATGGCTGMGCCGGCGGTGTTTTCAGACGG(XXj 

CGTGTGCCAGTTCTGCCA(XXX?im 
TATTCGACCAGCAGTCCGGMGGAACGGGCGGAATTT^^ 

SEQ ID 5934 

MAEA®3VFRRRLOTQGFHQKFGFAFGCAVGKISHV5D6IjSCAS 

SEQ ID 5935 

GTGMGCGGCAGACAMGACGGCAACCGTTTTAACGGO^^ 

TTCCITCCGGACTGCTGG^ 

TGAGGGCTGGCAGAACTCGCACACGAAAGGOT 

SEQ ID 5936 

VKRQTKTATOTALARTACTSTTVPS^ 

SEQ ID 5937 

ATCATOAATTTGTCCGAGCXAAAA^ 

(XXSCCTATCGGCATTCTTCTATGGTTGCGGCG^ 

(XXXX^TCAGCGMTTGGACAAAACGCG^ 

SEQ ID 5938 

MIEFVRAKKRLLWAFVLIM 

SEQ ID 5939 

TTGGCTCGGACAAATTCMTCAT^^ 

SEQ ID 5940 

LARTOSI IAAPPLICGPFDFRQLRNCRYAAFD 

SEQ ID 5941 

GTGOGTTCTTTACTCCAACATGCTTATG^^ 

CTACAGATGACTTATTTAGTCGTACTCTGGC^ 

TGATAAAAAAA(mTATGATTTTTCTAGAAACA 

GAAMTGCTGAAGAAATCTATTTTT^^ 

ACCTGAAATTCATTTCTACTGATTO 

SEQ ID 5942 

VRSLLQHAYAELVffifl^ 
ENAEEMYFFSQPAALLVYtfLIEKVGADBVWKKWPLP^ 

SEQ ID 5943 

ATGAGTOATATTGAIX^TTTCAAMGCTAW 

CGGTTGCCMTTTOA^ 

CGTACGTTTTGTGGCATTGCTTGCGGAACAT 

AAAGCAATT 
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AAAGTGAGCAACGGA 
KVSHG 

OGWK^GTTTGCAGATAGGGAATAGGCTTSftAAJW 
SEQ ID 5952 
ffllTOMlJOTSRCmAVAIRIIW 

S ^S™rSmKaCGC(XXTCCOTAATCOra 

nac 

SEQ ID 5954 

VFWHQAHPVIGISNKELSKLQLTPSQVSRDIDDLVSSGLVIKL 

SEQ ID 5956 

IftTOI/SHTVWTAHHAIHSfflVMDKFSGGE^^ 



SEQ ID 5958 



ABADKLWPJGL 



jkSKBGNKPGBLSGGQQQRVSIMAU4WKEI 



WO 02/079243 PCT/BB02/02069 

-451- 

SWSFYYDQFVEAnffiS^ 
ASLYGVGEQYFDVRGLKLE^ 
IAELLKARHGTEDFPMNNSDSIRQMVESTO 
SAASVIGAVACSTGIGLAFGFMPAMKAAKUIPIDMAQD 

SEQ ID 5959 

AT(&GCCGTCAATCCCTCm^ 

CAOTTTTTCGT^ 
SEQ ID 5960 

HSRQSI4QAAESRRSVY5UW 
SEQ ID 5961 

ATGOTGAGCCGGGACGCGCGCTGAITCA^ 

SEQ ID 5962 

MFEPGRAHIIKBHTHiGRPAEIAEKELfiSDGGHPKD 
SEQ ID 5963 

ATGGGCGCnTOCCGAAATCGCTGAAAAAGAGTTC^ 
SEQ ID 5964 

SEQ ID 5965 

TTGGAAGTCGGTATCCGCTCGCAAATCAGC^^ 
TTATTTTGGACTGCGAGCAACTGCGCGAAAGCT^ 
SEQ ID 5966 

LEVGIRSQISRLRDHFE^ 

SEQ ID 5967 

ATGAAACAAGCGCAGCCTAAGC^ 

TTGACGATGTCX&CCGTATGACTATTAAGGAGCTGC^^ 

GGAAAAGCTGTCGAAAAAGCAGACC^TGTCAAAGAGOT 

GCCGATTGO^ATATGTTCX3AGCA(^TGGCGGC^ 

AAOTATGCOCTACO&tt^ 

GGAAGCAATGGGGCGCGOCGCACGCTCTGCATATC^ 

CGGCGCAGCGATGCGGGCAAGACGGAGOTTAAACCGGM 

AMTGCTGCGCGCCAAC<X3GAAAATCGAGGCGGCG(XiGA 

aaarmrGCAGc^^ 

CTOGGGTGATGAAGGAAGAGGAGTTTTATAAAAACAAGOC^^ 

GTTATTACTTCGGCGGCGAAACCAGCGCGAACCTC^^ 

OX^CA(mWTACTTC(XXXJGC^ 

ATTGTGGAAACGGCATTTGAGAGCGGGTTGCGCTT^ 

ACGGCATGACCCGCTATCAGGCGTCGAACAA 

GGATTTGGAAATCCGCTTO&CGGACGGGW 

MCGGGGCGCGGGTCGCCACTTATGACGCGGAGGCT 

CAAAATCGACCCCTACAAACATCAAG^ 
AAGGTCGAMTCGCCAAACXX^PTAAAACCCCGCGTC^ 
AGGCGTTTGAACGCACCCGGACAAGATGCCG^ 
SEQ ID 5968 
MRQAQPKlfflliGKSKtLEliLVH)DDT^ 
ADCAICSSRVmRTARRMDLTTARRWSARSIKL™ 
RRSDA(KTELKPEEAKLISAVLAEAMRRNGK^ 
LRVMKEEEFYKNKPKNVVKI^ 
IVETAFESGliRFTEVHDI^^ 
NGAmTYDAEQIEVWSVFEVVP^ 

SEQ ID 5969 

TTGCCGACGCGGTTTGCGTGCCG^ 
CEATOOTACTGTCTGCGGCTTCG^ 
SEQ ID 5970 
LPTRFACRVGLPVRPEASEX^ 

SEQ ID 5971 

ATGCCTCGGGACGGAC(KK3CAMCCGACAC^ 
CX^AGACCGATCAGGCMTCAAJEACGMC^ 

SEQ ID 5972 

MPRIX3RANRHGTQTASAKMPSEKIQTAS^ 

SEQ ID 5973 

ATCGGCGGCGCATTGGAMTC^TGACGGGCTTO 

SEQ ID 5974 

SEQ ID 5975 

TO&TTGGTGCMTACGGAGGAAGTTC^ 
ATCTGOTAAAACATOACGACO^ 
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TAAAAAGGCGGCGAGACGGGCTTTGTOGTCGTGGOT 

GGCCGATGC(^TAAGGCGTGOGGCCGTTCTG^ 

CGGGGGAAAAGTCGTOIt^ 

AnMGGAMTCGGTrAMTCGATACX^TTGGTGMAGCTO 

SEQ ID 5976 

IJGMRRKSVFFPALGSLHAYHKAELVGRYHStO^ 
GRCTJKACGRSDEQVLAFAHKI^^ 
SEQ ID 5977 

TTTTCGTCAGCCTGCACGAATTCGGACT^ 
ATGGTGCCTCGCCC(X&TTCCGTTGGGOGOT 

TTGCO3CCXm0C«3CTT^^ 
CGTCXjGCGTTCAGACGGCATCGGGCGCGC^ 
ACAACCGTTGCCGGCGGCGTGGAAAAAG(£AGCrc 
TGACCCGCCAAAGCCCGGGCAAAAAMTCA^^ 
CGGCCTCCGTCCGCAGCCG(^(m^^ 
CPCmTTmCGGCAAACTAATCAGCGGCA^ 
TTTTAGCGTTGGTCAGCATCAGCCTC^ 
TGTCCAAAAGATCGGTlTCftCTTC 
SEQ ID 5978 

MPGHAHKRGHI1SMCADTVRTPI1SKBNHI1QTLLAF 
AIVAAGPLTmAUmyGLSPSPGVTELRmG^ 
TWAGGVEKGSPAERAGLKKDRLTAADGKPI^ 
LKFFGKLISGNASVSHISGPLTIADIAGQSAE^^ 

SEQ ID 5979 

ATGQCAAAAATGATGAAATGGGCGGCTGTTGCGGC^ 

GCGATATCAGCC(^CGGTTT(X:(XX^CGGGCGAGA 

AMGG(£GATTTGATTG<XGAAAT^ 

GCGGAGAAGAAATATAAGCGTCAGG(XX£GTTGT^^ 

CTTTMTCAGACA(^GCAmTTTCCATCM 

GAACGCGGCXKAGTCTAOKmCGATaX^^ 

ACGATTTTGOra^CC^ATACG(XX^TT^ 

ATTATTATGCCCGTTCGTTTGTGCCGMTC^ 

GAAAAATCGCGGCXJGCAAGGCGTTCGTACXjCGTGTTGGGT^^ 

SEQ ID 5980 

MAKMMKJ^VAAVAAAAVWGGWSYLKPEPQAAnTBTVRR 
AEKKYKRQAALWKDDATSKED^ 
TIIiJEPDTPIKAKIiDSVDPGLT 
EGDKWISEITAAEQQESGERALGGPPRR 

SEQ ID 5981 

TTGTTGTTTGAATmAAGAGMTAAAATGATOT 

ATCTGTAT^GACAGGGTAACCGM 

GGGGATTTCAGACGGCATGGTTGCCGACGCGC^ 

TTAAAGCATGAGTTTCATCGTGCGGGTATCGGATTTTO 

GGTOGGGATTGTTGTGGAAGACAGGCATCGTGC^ 

TTTGACGAATCCGAAAOTTTTCCTG^^ 

ATCGGTACGTTTGMCGGGCATATGGCGAGGTTGCGGCT^ 

SEQ ID 5982 

LLFEFKENKMIWSRWPIjIiE^ 

UHEFHRAGIGFSSPWtfS^^ 
IGTFERAYGEVAALUKGKAPV 

SEQ ID 5983 

TTGCCTCGCCTTGCCGTACTATTO 

AAAAGTTTGAATATGATTCGATTACGTCT^ 

GGCACGCCCACGGGGAATCGAACCCCGGTTAC^^ 

SEQ ID 5984 

LPRIAVLFVLSGASSHCPDU^^ c 

SEQ ID 5985 

ATGAAAAATGGTTTTMTTGTTTT^^ 
TCGGACAAAATATAGTGGAT 

SEQ ID 5986 

HKNGFKFRTITQIWYYTL^ 

SEQ ID 5987 

ATGCAGAATATTTTTGACCCOT^TTA 

GCATCGGAACGATTGCCAGTGTAGATTTGCGCCACOT 

AATCCAAAAAG(XAAMGTGCTTCAGAGGGAAAA^ 
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TOATGGGTGCCGGOTXmm 
TGAAAAAAG<£ATATTGrc 
TGAGGAAACGTTTGAAGTGTTTAAAMT^^ 
TC(X£CGTTCAAATCGCTACG(OT 

(^GCTGCGGTCTGCGCGACGGGCTI^^ 
CCCTTCGGTMTGCCA^ 

SEQID5988 

MQNIFDPIJIRGKSIJPIVQG{MGVW^ 
VKGAGLPLDLPEMTEGYHKBVAIJ^ 
SAVQIGTAPAVTEEGDAHLNFKKTIiASAM 
PFGNAIRTVRETIQYLLTG SEFVATLGR 

SEQID5989 

ATGCCGTCOJAAjGG^^ 

SEQID5990 

SEQ ID 5901 

ITCCAAACAAAATGCCGTCXKMGGGCTTTCAGAC^^ 
TTATGTCGGACATACAACACAMTGCCGTC^^ 

SEQ ID 5992 

LQTKCRPKGFQTAFFRLC^ 

SEQID5993 

ATGCaSTCTGAAGCCm^ 
SEQ ID 5994 

MPSEAFQTAFPRLPFGMMLKNKRDEIHLFSSDKSDRHAE 
SEGID5995 

TTCTTATATGCAATGTC^ 
SEQID5996 

LLYAMSGRHCMIRIUK^CRTKINVSHPVTOTSFRKAA 

SEQ ID 5997 

GTGTCTCAAMCAATCCAATCAAGCCTTC^ 

CGACAGATTTGGTTACGGACGATTATTATM^ . 

(^TOGGCAAMCTGTTTGT^ 

GCGCAGAACGGCAGGGCX£MTATGAGGC<X^ 

GCCAGGGCMTGCGGTCGATTT(^C^ 

SEQ ID 5998 

VSQIWPIKPVMOIVWPWVI^^ 
AQNGRAEYB&VFEniPPAtfflWYTO^ 
SEQ ID 5999 
TOTACAATAGCGGGAATATATTT^^ 
CaTCAATCAAAAATCTMGGMTCCACMmCCA 
TCCGCGTGCCAGCGTGATTCAMTCCATCC^ 
GTCATK£GTGGTTCAACTG^ 

AGGO&TAGAAACAAAC^C^ 

GCCGOTGGTTTGTCCXCAT^^ 

TGCGCGAAAAAGTGTGCCTGMAIPGTGTCOT 

GAAAACGGTCMCMGGMGAGGCGGGTTTGGGCGACTGCATCAAC^^ 

GCCGCCTCTATCGACGCXJPGCGA^^ 

GGCTGCTCAGACCGCGC(7rGGC^ 

AATGGTGOGCGAAMCGCCAAAGGTTGOTXjGAAMCG^ 

ACCGGGCTGCCTOAGGCOT^ 

MTAOTGGAAAGCGGTGCATCAGACGGCAAGCC^^ 
SEQ ID 6000 

LYKSGNIFWEKRIFSKLETQCLKKTRKOT 

VIPWFHWSGRQAVVPHIPE^ 

AGWFVPIRQFVPDLFTGAAGGGAMFAAAFYGFMTOT 

AACIDACDEIMDKMGYPSGIJRYTT 

TGLPEGGIKVAPRfJTITLPVQTO 
SEQ ID 6001 

TTGTACAACmATCCGO^^ 
SEQ ID 6002 

LYKPHRRPDVLGEnSHIflQ^^ 

SEQ ID 6003 

ATGCTTTCAAGAAAAGGGCGGACGCGGGtXMG^ 
rcTGTGCCGGATTGTTGATAGCGTTTGOT 

SEQ ID 6004 

MLSRKGRTRGKAAAGGGIWGRIFDLRAKPAIIPFEIKRYE^ 
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ATGrcTCAACTCmMCOTMTCCG^ 
TOTK2MTCACMTCCCGCCAATCCCAM 

GQCATGGCAmraGAAAAAA^ 

TGAAAOTACX3GCTGGCACGTCGTTCC(^^ 

TTAATCGGCAAAGGCAGCGCCAACAAAGAAGGCAGC^ 

TCCCGCAAGAAAm^ 

GCGCC(XATGGATAAMAGCTGCCGGAAAAOTr^ 

ATCTTGGCAAAAGAGCTGCCCGAACTGCjTGGGCGGTTC^^ 

ACGGCGTGCGCGAGTTCGGTATGGGTGCXaATTAI^^ 

GGC(mCTGATGAAAATCAACCa 

ATG<aCGTAK^ 

GAGCGAGCAA^^ 

^GT^GCCG(^CACCMCGGCTGOT 
TCA(XXTPGGACMTGTG^^ 

QJMAEniAAEFH^ 

LIGKGSANKEGSHKTHGAPLGM)^^ 

ILAKELPEL\TCGSADLTPSm J TDWSNSVSVrmGGM 



AVEAGHTNGWYKYVGLNGAWGI^^ 

SEQ ID 6007 

GTGATTCXXIAAATATTATMTGGAGATTGGCGGAAT^ 

SEQ ID 6008 



SEQ ID 6009 

ATGTTTTCAMfTCTCTGTA 
TTATACAATCT(XTmXCGAA^ 

SEQ ID 6010 

MFSILCMRQSHKPEADST^^ 

GCTGCTTGCCGGTATTTTAGAMTTGTCGGC^^ 
TOmAATGCTITOTOC^ 
AGCGTAAAGCC 
SEQ ID 601 2 

SGNAIiFPIANG^VLFC^ 

SEQ ID 6013 

ATGGGt^TGTCGAAGTCC^ 
TGAAAAAACXXX3CCGCOT 
AGTGGmGGCAGACCGO^mSOTAC^ 
' (^TCTGAACCAC(XXX^ 
ACARA&AAGCCCJAQCCTTCGCXCX 
A(K3CO?Krimra 

ACGM^GCTCGCTGQC^ 

GAC6ATGGTGTGCTG(X^GTGTTCGGCAAC6ACGTO 

CGCCTCTTGGGCGACGCGTGCMCAGCTT^^ 

GCAAAATCGGmOAAAAC^^ 

CGTCCCTGCCGATATGGTOATCCGCGC 



LSGLPFVSAPNKFEMjGGRDMVAA^ 
RHiGDACNSFNENCAAGIEPVPmDYFLHHSI^VT 

OOTCtmAAlCSTO 

GACCGGATCACKKXXX^C^ 

TGT6TGCGGCAG6CAGTATGGCCTGGGTGTGTO 
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TG(^GGCTTCCAAA6TCAGCGCGGTAACA^ 
OTKPCH^KXmCTGGTCG 

SEQ ID 6016 

MENQRPLLGFAIAIJjAA^ 

D^AAQKIGLVUiLVGLI^^ 

WIASKVSAVTTI&FVFTVIFSL^^ 

SEQID6017 . 

TTGACGACGACATCCCtSTCTGAA^ 
TTGTT^CQCCCCCTGCGTTGGGCGTAACGT^^ 
TTTTCAAATAGTGCGAAGCMTAGTCXjGGCTTC^ 
CAATACGGTTGraAAAAATAGGGAAATC^^ 
SEQ ID 6018 

LVAAPCVGRWAIAPFGRPEASDA^ 

ffiGRS 
SEQID6019 

ATCTCMTCAACMAG^ 

GCAACQGCCAGCGCOTAGAGCGCACCGAATGGC^ 

AAGCCGCAAATACCAAGGCAAAGACGGCATCGMCGCACCGC^ 

TACGGTCAGAGTCAGGAGGOTA(XAACGCCC^^ 

CCGCCCCGGTCGAGGAI^TTGACGAC^^ 

SEQ ID 6020 

KSLNKVILIGRLC^PEVRYM^^ 

YGQSQEAYQRPA^RQPAPDAPSHPQEAPAAPRRQPVPAAAPVEDIDDDIPP 

SEQ ID 6021 

TTGCAACTGTTGATGCCGTCTGAAAMm 

GTAT(nTOTCC^TGCCCGTTTOGmT 

ACTGCCTTTGGGTATCGCTTCCGACAA^ 

GCCGCGCGCGCCATACAGGGCGCGGGGGCGGTCAGTCCTGCCGTAACC^^ 

TGACGTITrcGGTCAGCCTGGTGGTTGCGCCG 

TOTGATCCCGAAGTTTCCAAOT(^ 

GCGCAAATGGCATTGTTTACCGCGCTGCCTTTO 

TGATTATCGTCGGCGAAACGCGCAACAAGCTCAMCAGGOT 

TTATCTGGTTGTTTACTTTATCGGT^ 

TCGCTOGGACTGTTTGCCGGCGGOGCGGCAGGAGGC^^ 

CGOXGCGCCCAA(mX3TCAAAAACC^ 

TTTCGACGGGCAGACCGTCTAf^ 

SEQ ID 6022 

LQLIHPSEKLFMHl^ 

AARAIQGAGAVSAAVTAUiADLTfilX^VRTRAMAMIGLS 

AQMAOTALPFAMTRLGLEKIQH^^ 

SLGLFAGGAAGGIiLFQKCGFAtm^ 

SEQ ID 6023 - * 

ATGAAAAAACCGA(XX^TA(XICTACCCGTCMTCTGCAAC^ 
CCGACGATGTGGOTCCGTGATGAGGAGTTCTGTCGGCAG 
ATTCX3TOXm03AG«X3^ 
CAGTATGC(^TAA(XX3GGGTC(X5(X3CGro 
GCACaffCCTGamTTATCGAAAC^ 
CCGCTGGCAGT6GCCT 
SEQ ID 6024 

QYAISGVGARGLMQWFWKNYIGKPAHNLFDIRTNLRYGOT 

SEQ ID 6025 

ATGAAACTGAAACAGATTGCCTCXGCACTG^ 

TATTCAACTACCTGCCCGTCA^ 

GCAGCTTCTGCTGACCGTTATCGMCGOCC^ 

CAATACmAATCAGGCGACACTO^CCAGGa 

TCGACATCGACATCACGATTC&CGA 

CATTTGGACATGGCTGACACGMGCGACCGGTT^^ 

GACATCCAAACCAACGAAGACAAMraGGCAGAC^ 

MCTGGAAAMCIOTGACCATGM 

CGTACAGCCGCTGCCGAACGC(XK5AACC^^ 

CGCTOXXXXffACCXX^^ 

ATCGCAGGACAACCTGTTCGGTA(XX5GC^^ 

CTGGOTACGATATOACGGAAAAGCCTPC^^ 

ACC«^raATTTC(X3GCT^ 

CAAAGGCCTGCTGTACAAAGGC^OCGTO 

CTGCAATACTACTCCOX^^ 
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TATGACGAATACGGCGAAAAMTCAGCTACGCKXjGCAACAAAM 

GAAAACGOTAACMCAAATCGGTTTACTC 
TCAGCTACGCCTACCCGCTGAAGAAAAMCO^ 

SEQ ID 6026 

MKLKQIASAimGISPl^MJraQDIRVEGLQRTEPST™ 

QYFNQATLNQAVAGLKEEYLGRGKM 

DIQTNEDKTE)(^in!IVHEGGRFRWGKVS 

VVRRELRQMESAPYDTSKLQRSK^^ 

IiGYMYGKAFDPRKASTSVKQYX^ 

LQyySATHNQTWFFPLSKTPTUDiGGBVGIAGGYCK 

ENGNNKSVYSENAHXSTFTM^ 

SEQ ID 6027 

AmTTGCTTCCAACGCXaAA^ 
ACACCGCCC(MjGCATTTCCCGTTT^ 

CCX^ACGGCGTTGCCTC(XXITTGCC^ • 

SEQ ID 6028 

MIASHAKNRAQNPSAAYRPALFGQTL^ 

SEQ ID 6029 

TTGTCGAAGACCCGTTATGTGGTCGAACAAAG^^ 

SEQ ID 6030 

LSKTRYVVEQSFGTLHRKFRYARAAYFGLIKVSAQ SHLKAMCLNLLKAANRLSVPAAA 

SEQ ID 6031 

TTGCGOTXXXnCAGC6G(KX5GTTGCGGCGGGCTTT^^ 
ATAGAAAAGTCCGCATTGCC^CCACCCCA^ 
TGAAATGCAGATMTCAAAATGGCAGGCAAAAK^^ 
TGCCGCCAGCGTTCCGAA 

SEQ ID 6032 

UlLRQRAVMGFAHNAVQQiWQFSVGRILESDITSIVFNRK^ 



SEQ ID 6033 

TTGCimCCGTCATTCCCACSGAA(^ 
TCGAGATTGCGGTGTTGTCGGACGGGTTTCGAGATO 

SEQ ID 6034 

Id^NRHSHGSGNIiGRGWATVI^ 

SEQ ID 6035 

ATGCCGTCTGAACCTTCAGACGGCATTra 
(MX3CAATCTCGAAATTCGTO 
TGACGGTTCAGCAAGCGTAGGCCGG^ 
ATTCCGAGCATAATACTA 

SEQ ID 6036 " 

MPSBPSDGIFNRPPFTGAGRAWKYPl^R^^ 

mm 

SEQ ID 6037 

GTGCWJGCGGATCTGGOTACGCCTACGAGC^ 
ATTTCATCCACCCCAGGGTGTCGGTCG^^ 
AAACAAGAATQUjAATAAGAGAGACCTGAAGACGGAAAAT^ 
CCCTATATCGGTGCGOGCGTCGCCPACGGAC^^ 
ACGGAAGTACXmGACGCCCATCAAGAAAGCGACA^ 
CXmACTGGGGACGOTGGAAAAC^^ 
SEQ ID 6038 
VQADLAYAYEHITRDYPDAAGANQ^ 
mGAmyGHVRHSlDSTKRTimTAA 

SEQ ID 6039 

ATGAAAAACGTTCAAAAAGGOTTACG^ 
CGGJUyGTCAAATCCACCCTGCTGA^ 
CTTAATCAAAim(XXXXKACCAOT^ 
AACCMTOTGCCCCGGCTTCGAC 

SEQ ID 6040 

MKNVQKGFTLLEIfflAVAILGI^ 



SEQ ID 6041 

TTGTTCATCCGCCATATTGTGTTGA^ 

SEQ ID 6042 

IiFIRHIVLKHRPEPDIIRPSTSVKIPFI/TG 
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SEQID6043 

ATGtfmTCAACAAMAC^ « 

SEQ ID 6044 

MADEQKPVRVAPPRLKGNGSI^^ 

SEQ ID 6045 

TTGOGCTiUiXJGCTTGGGCGTGTCGCGCGAAATC 
GCCTGTCGTTCGGCCATCAAAACCGTTCGGTM 
CTCCCGACGCTTGGGCGMCGCTGGCXJGC^ 
CTGATGTA7TCCGCGCCGAAAGA(XGGCTGC^ 



GTTGCGGCACGACAAAATCTCTT(^^ 
GTGGAAAMGOTTCAAA 

SEQ ID 6046 

I&YGI£VSKEINA(XmH?LYGGIS 
I^SAPKDRLLYGGADWPHOT 
VEKDPK 
SEQ ID 6047 

TT6CGCTTATPTGMATCCTTTTCCACGCTCA 
TTCCAAGAGATTTTGTC(HXXXX3CAAC^ 
CCCO^GCCCCMGCCGCCGTCXSMCG 
OCGGTC?rTrCXjCGCGG^ 

MCCAGCGTTCGCCCAA6CGTCGGGAGAAATCGGCATTGM 
MCGGTOTGATGGCCC^CGACAGGCGCAGGCTCTO 
GCGCGACACQCCCAAGCCGTAGCGCAAACCXn^ 
GATTAACAAAAATCAGGACMGGCGGCGGGCCGCAGGCAGTACGGATC^ 
TACCGGTOTTCT^TCCGCCKPATTOOT 
SEQ ID 6048 
IMiFEILFHMENLRIFACVKTOT 
PVFRRGIHQHIARQRHMAAVSGG(^^ 

ARHAQAVAQTVCLLRQAVGGFRPASARKFMRPFGKSVVYSG P 

SEQ ID 6049 jj 

ATGOCGTCCGAACGTra^^ 

SEQ ID 6050 

MPSERPSMlPCCRKHTVGNIREQNQKQPHHIiLSTPAPSE 

SEQ ID 6051 

TTGGAACCCAAATC^CGGAGGCGAGGAWMTGCGGGATT^ 

SEQ ID 6052 

I^KSTEARMFAGLVVMADTG<& 

SEQ ID 6053 

TO(»TTCCAM^3TTTCC(^ 

GCACATCCTO3CAGGMQGGCTTTGGACTO 

CGCAGATTTCCTTCCCAAGGCX^GAAGCGGC^ 

AAAAT6CTGGTTATCGACATCXXXX3GCGGTW^ 

AAAACAAMTCATCGCCAAAGACTTCCMGCOGCC^ 

GGCAAAATCCATCCGCGACGTGCTTGCCGCCX^^ 

TTTGAAAACCTGAAACCGGAACGCATCGAAGTTT^^ 

GCGTGTTTTACC^TTTGATCGGGCGCGGCTTA 

GCAAACCTPTATGGACAGCCTCTGTCATGCCA^ 

ACCGGCTATCACMGCATTCCGCCTACATOT 

TGGGC6GCATTATC66CG6CAACGAAATGTTGT^ 

O^TACG<^GCTCCG(»?rTC^ 

AAAGTTGAAGCCGTC 

SEQ ID 6054 

LGSNSFRLQICESMGQIJWIDSFKQMVRPAAG^ 

KMLVIDIGC&STEFVIGSTLNFDI^ 

FENLRPERIEVFAGGIAVMMAAFEEMKLDRMTVTO 

TCmHSAnLEHADMPGFSRKEQTXIJ^ 

KVEAV 

SEQ ID 6055 

ATGCCGTCTGAAGCCm^ 

ACGCCGAO^TOCAAAmTCGGCmAT^^ 

GCAAAAACTGCAACGCGAAGGCTTC^TTTGGAA^^^ 

AAACAOKJGCAGTTTGAAGAAGACTim 

TOAGGACGTCMTTACGTCAM^^ 

SEQ ID 6056 

MPSEftLQTAFRGN 

KQAQFEEDYNLRRNEEFASLQQNAJ^^ 
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ITCAAAAGGAGCGCMTCATGAAAATGCAGGCAGTTGTTGTGAATAA^^ 
TCGAMMTGCGGCCTGTGCCATACOGA^^ 

CraCATCAftCGACGACAiOT^ 
<mOTAMXmOTAICPaa<XXX^ 
TTTTCGACGGCATCMAAGTGGTCGGCTCTTTGGT^^ 
AGCACCCGCCATTTTCCAAGAMTGCSXAAGG^ 

S SsA^AVWHKNVAGDVEVVERmPLKGEALV^ 
TAIXX^THCIVSADYAVKVPBGIiDPAQASSITCAGVTTYKAIKVSGVRP 
VVTAVSAAAraSATOCVRAKHVTOVGIiPPESMDM 

MAACACCGTCOaG^^ 

AAAAGCCGCCGGGG(M3ACGGAAAM(XGCAGGGGCTGCTGCGCGTM 

(SOTTGGAICTGAWraGGACAACC©^^ 
^ipTOXXXTGCTTGCCTCGCCCGACTTCCT^^ 

OCAGTrTCATCGATTTTCTGAACGAAAAAATC^ 
ecn irj 6060 
MBOTSLKVFRCVVQSGGFraAADAIiM 

CAGGCMCGGTA^GAAACCGATAGAGAAACCTGT^ 

AACCGAAAGCGGACTCTPTGACGGTCTGTTCGGCGGTTCOTJ^ 

PSmPTKmPFPM^PVKIiPPNPU>m£l^^ 
S ATGfl!!iS 

ag^TCGOTGCGGTO^ 
AH^GCMTTCCTACAAACT^CAAAATO 

C0UIAAAGGCGGATTTGACGAAAGC(MTCBGTGCAGSK3GTGCA^ 

OCTOATCCADCGGTCAWIGGAWPGGTCGGCAI^ 

•RSGCGTAGCATTTGTOATAAAAMGCG^^ 

ATTGCTGAAAACTTTGCGGGAAAACGG 

^H^LAQM^PSmVM^^ 

KBSTBXHEAFTroYA^ 
IAENFAGKR 

BFO ID 6065 

CATCATTGCGGCAATCAA 



CGAAAI 



CCAAAGMGCGCTAAGCmOSX^AAAGACA^^ 
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SEQ ID 6069 

TTGCGGCAATCAATAATTTTTTCAOT 

GGTTTCCCGTT 
SEQID6070 

LRQSIIPSOiILLELIJKVL^ 
SEQID6071 

TTGGATTTTTCAAACCTT^^ 

SEQ1D6072 

LDPSNLAEKVRAAGKLKRRGCVLQFSELFS 
SEQ ID 6073 

ATGCTGAAAATACCTTTTGCCGTGTTGGG^ 
AAGTCAAATACATCGACMTACGGCGATTGCCX^^ 

cGaxjTcamsACco^ 

TTTGCCAAACTOTGGGCAATAT^ 

GACGCTATGTGCKX3AAATCGACAGCGAGGG 
SEQ ID 6074 

HLKIPFAVLGGCLLIMCGKSENTAI^ 

FAKLVGNIAEDGGKLTDYLISHSALQF^^ 
SEQ ID 6075 

TGCTGCACCACAGGCGCGCACaCGA(^C^^ 

CCCXXXX3TCAGGACGMTTGCMAAA 

GGTCGMGCGTTCOXAAAAAACAGGCGCAGTTO 

GAAGGTTACGATGTCATTTTGCAGGACGTGATTO 
SEQ ID 6076 

CCTTGAHADTFQHGFIMTl&IYI£^ 

EGYDVILQDYHVNTQ 
SEQ ID 6077 

GTGTACCGCCACGTCAAATCAGTGGTGATGATCG^ 

AAAGCAWXIAGTTGGACTITCGGCACAACCAAAOT^ 

GTGTGCGCO^CamTITOCTGMTC^ ■ 
ACMCGTGCGTGCCGMTACTTTTTTCGC^ 
TGGTCACGCCGGCACMGTMTGCT(X3MGCTTGCGCAG^ 
CGCGTTCAATACGGMCGGCACAGGGTTTCGCGGCC^ 
TCX^MCTTCTmACC^C^^ 
CCAATGCCTCGCCGTATTCCAACGGGCGAAOT 
AAACCTCTAAACGGAATGTTGTCCGATATM^ 
TGGACGG(X3GCCGGATGCGGGGGCGGCAGGTT^^ 
CGGAGGATTAGTCGAAGCCG6GGCAGGATTGGTTTCT 
MGGGTGAAGTGGTCGGGGCTGGCTTTGAT 
SEQ ID 6078 
VYRHVKSWMIAAAFFFHIDHHTAGDFAFAH^ 
VCAAGFFLNHFGSVLFGGVDNQIRAGFFGKC^^ 
RVQYGTAQGFMGIAVFARAAALEQPGDADAVADFQIFHTONFFNQTDAL 
KPVNGMLSDIRVSIRFRILRVKWY^^ 
KGEWGAGFD 

SEQ ID 6079 

ATGAGCAAAAMCGA(OTCTGACCG<£^ 

TCTTCCTOmATTACaCGGCATCATC^ 

CTACCGCCAAAGCGACATTCCCGMGTGATGGAGTO 

AACGGGCAGGAAGACCCCGATTTCGGTGTGGAAATGGGTT^^ 

AACACGTCGAAATGG(XjCGCGACATTGCCGGC^^ 

CAAAATGTCCAAATCCTACGGCAATAttlATTCCGC^^ 

GAAAGOCCTTTGTTTGAAATCTATAAAGCCT^^ 

TCMCGCCGAATO»GAACTGCGCGMCGOT 

GGATAMGTACGC^TCCGGT(XGCATCCGCCCG^^ 

SEQ ID 6080 

MSKKRVLTGVTKOTPHMNYVGAIRPAVRAAQ 
NGQEDPDFGVEHGIiFSYK^ 
ESPIiFmKAFSTPSETAEFTQMLAIXJU^ 
SEQ ID 6081 
ATGCCGTOTGAAATGCCGTC^ 
CGCGCTTTTTGCG(XX£TGCCT^ 
TCGCCTCGCCCCAAGCCMGCCGTCGGCAAGCATTTG^ 
OGGCTCTTTCATGTTGGTAAT^ 
AGTTCGACGTTm^TCGATTTra 
G^OTCGTTGGCGTTGAA^ 
GGCTTTGTAGGC<mXXXX^^ 
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AGTOX3(^G(^GCCAAGTGGCGGCAACGGOT^ 

TTTGCGCCGCGCGGACG^^ 

ATGCCGTCTGAAAGGGAAAAGATGCGCCG^ 
SEQ ID 6082 

MPSWSETWQAEVRTALGLFQRADADRIOTIQ^ 

RLFHVGNDFVDRFLGFFV^ 

GFVGVGAVHQTLGGDAGQNPVQLHHFGNVALA^ 

MPSEREKMRRLYPICHIHPADNRLPKIKKI 
SEQ ID 6083 

ATGCGATACGACAAAOTAACC^^ 

ACCAAAACAGCGGAGCTGCCGCGCTCTTGGCra 

TCTGCCCAGCCGC6AATTGCA66CGGTGTO 

GCCXSGCAAAATOTGAAAGMGCCGCTK^^ 

AAAMTACACGCTOjATCTGACCCAACGCGCCCGAGA 

CGTGCTGATTGGCGAACCCGGT(?rcGGTAAAACCGC^ 

GCGGCTTTGATTGCC(^TGCGAAATAC^ 

CGAAAMGACGCGGCAOTGAACGCCGITTCCAAAMGT^ 

GATATOCCGA(XCTGCTATCGTTGC^ 

AAAAAGAAACCAAACCGGAAGCTATGGACAAAATCGA(XGCC^^ 

AGACGAGGAMTCGACGGCTTGCAAAMGMTACGCCGA 

AAAATCGAACAGGCAAAACGGC^AGGCGATTTGGCACTGG^^ 

MCCGGCAAACAAACTCTTGCGTMTMTGTCGGCGCGGA 

GATGGAASAAGTATTGCACCGCCGCCT^^ 

CTGTTCTTAGGCCCGACCGGTOTGGGTAAGACCGA 

CCGTTGCCCGCTTAATCGGCGC^ 

C^TCCX^ACGTATimCATCCTGCT^^ 

CAGCACATCCAACmTCGGCACGCAGG^ 

ATGGCTTGGATCAGGCCMTATCKX£AGT^^ 

CAAAGCC(WTTTCGACCCGATTTACGGCGCACGTO 

AASGTAGAAGCCGACGGCGACAGACTGAAATTTQCC I 
SEQ ID 6084 

MRYDKLTAKFQQALAEAQSLAL^ 

AGKIIiKEAGATEQNINAAIDAVRGGQNVNDA^ 

AAmGAmGEPEERIiKGVLNDLAKDDOT 

DITDPAIVAAASLSDRnTDRPLPDKAIDIJDEAASRVKMEKET^ 

KIEQMttQGDLAIASKLMYEDIiEHfcE^^^ 

LFLGPTGVGKTBimLAGFLro 

QHIQQMGTQDYEAVKEVVMEDVKffl^ 

KVEADGDRLKFA 
SEQ ID 6085 

ATGC&TAAATTTCCCAAGTCTG^ 

GCAACCCKXXCCGTimXS^ 

TGTTCACTACTATCAGAACAAAGGTTTGCGCGATATTA 

AlOT^TTCCCffiXCCCGACTACC^^ 

AAGCCAAAATCACGTCCAAAACCAAAGCCATCGTCGTC^ 

6ATTATTTTC6CCGAC6A6ATTTACGACAAAATCOT 

CGTGTAGCCGGGTTCCGCCAAGGCTGGATGGT^ 

AGCACGCGATTCAGACGGCA3'TG(^^ 

TOCTGCGTCAAACtt&TCGtXX^ 

GTACAGGGAACGGGGTTTAATTGGATCAAACCCGACCATTT^^ 

AG 

SEQ ID 6086 

HDKFPKSAKUJHVCYDIRGPVHKKAI^ 

MPAPDYPIMAAATLAGGTV^^ 

RVAGFRQGWMVLNGPKHHAKGYIEGLIM^SMRLCANTH4QH^ 

VQGTGRMKPDHFRIVTLPYVHQIEEAMGRIJ^LQTYHQ 
SEQ ID 6087 

TTGGGCAATGGGTGGGAATGTACTCTTTTTC^^ 

A(^ATC(X5TATCCACTTOTC^ 

TTCCTTGCOXXGOTACTTTM 
SEQ ID 6088 

LGNGWBCTLFHAKFKAS^ 
SEQ ID 6089 

ATGCOTATGTCMTATTAAAGT^^ 

C^CGGTTGTOT^TTGACGAAGTGGATACOG^ 

SEQ ID 6090 

MPYVHIKVTGGKEAPTAAQKAELIGGVIEUjARVI/M 
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SEQ ID 6091 

ATGATTCOamCCTG^ 
GCTTCCTCGCCWflCCGAAATJOAJ^ 
mTCTCTATTOXmAAGTCGCG05^ 
GCGMCGCCTACATCGGCGCGM 
TTTATTACGGCTGCftCACTCGGCAfiJOOT 
GGTAAOGCTGGGCGACGACGTAGAAATQSGCTCGAAC^^ 
TGCAAMKXXmra^CACCGTCATC^ 
ACAAAACCACCATCGGCGOX^^ 
CCACCGCTTAAGCG^TGAACAAACGOT^ 

SEQ ID 6092 

ffipATCTLSQITARLGGEWRGEDISTO 

iNAttGANTVUEGCR^^ 

CKIGSHTVIAAKTCISGSVTIGSYCIIGGGVGTVGHIEI^ 

SEQ ID 6093 

AT(£CGTCTGAACGCT^^ 
AWflCAGGACAAHXX^^ 

SEQ ID 6094 

MPSERCSMIFYGFGFDFNLFajKYRimU 

SEQ ID 6095 

TTGAAATATTGCTATCTTATCAAACAAG?IT6C(^^ 

CATATTTTGTTAfTAGCA 

SEQ ID 6096 

I^CYIJKQVASAQ^ASFSEKFKSIQKDSFQAANPSSHDTYFVIA 

SEQ ID 6097 

ATGMGAAAACATTGACACTGCTCGCCGTTTCC^ 

GCTACGGCGAATTCCCCGAACTCGAACCOT 

CTACCAATACCGCAGCAACCCT^ 

CCmCGCAAGCTATTGCGMCmCCOT 

OXWaOATCCCGCCAAaT^ 

CC(XAGCAAAACGCACAmCCGMGCCCAA(m^ 

GATTTCCCCGATCAAAGCCTGTGCC^^ j 

SEQ ID 6098 

MKKTI/TUAVSAI^ 

PDASYCEQTBMFGKNIVNVGHESAD^ 

DFPDQSLCQKQANOTLTFQIGHSHH 

SEQ ID 6099 

TTGATGTTTGAACAAAGGATGCCGTCTGAAAC^^ 

SEQ ID 61 00 

I^EQRHPSmSDGIFV9TEUCVRMAM£G 

SEQ ID 6101 

ATGTCGTGGATGGTGGATGCCTATGAAAACGTGGTGAAACATTO 

^CCTCGCC&mCCAA^ 

CGCGAAAGTCGTTACG^TCTACTGTTCA^ 

GGCAAAGAAATOCCAAOXXXjAAOT 

AAGGCGCAAACCKXXXGACTACGCCCGAW 

GCAAAACCT(X1AAGOTATTATTGGGAGTTT^ 

ACC^CGTOTATATGAWjAGTATCCGOT 

SEQ ID 6102 

MSWMVnAYENVVKHSAPGVFTGKPVEFGG 



TAS^IRRVEKAMKLRGWY 

SEQ ID 6103 ■ 

GTGCCAAAAACAGGCGAACTACACAAOTT^^ 

CAATACCAGCCGCGCAGCTTCATCGCTTT^^^ 

GCAGGACGACGATTTCTTTTIOT^^ 

GACmTACGCCGTTTK^TGCA«^TAACGT^ 

MTGCGCAAGGGGCGAGTACGKXATATCCAAAG^ 

(X^TATCCAAGCCGTTTTCATTC^ 

GGCGTAGGTTGCGCCTTTCACATCT^^ 

CCGGTAAATACGCCCGGCGCGGMTGTTTCAC^^ 

AGCGGGGCMTCGCTTCGGCATMGCGCGGGOGA!!^ 

SEQ ID 6104 

VPKTGEiaNFNtfNRPF^^ 



LRHKRGRCVPVPPRNNEADPR 
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CGCGCGATTA 
TTATGTCGTGGAT0GTGGAT6CCTA 



CPADRHKKSIPPPW 
GIGKFNAVAKVIXjQVAVBAVIMCAKKVV 



(acCGTCCG cc<xx^ 



OOCGCGCMGA^^ 



SEQ ID 6112 _^„ TODM0D , ramW roiqnmim^A^ 



MAEQLVGVT^EKGVSRTOVDlKQALE«mV 
SEQ »D 6113 
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SEQID6115 

ATGMTGCAGCCATCGAACACGTCCAAGCOGTCGC 
CGCTGCCCGCCMAGran^^MAGOT 
GTOTATATGAAATACT^^ 
AACGAAAIOTK^^ 
Ca^(nTTTGOTATCGATGCCGCC^ 
GAttKTlOTCTC^^ 
SEQ ID 6116 

MNAMEHVpAVAPDIiDGTLCDSVPDIAAAAEAMLEQI^ 
NEILAAELLICQLGIiADYFSLILGGDSIiPEKK^ 

SEQID6117 

ATGAAACCGCAAAAATCO^ 
ACGTGTTAAAC6AATT!TGCCGMOGCAACTG<X^ 
GGGCATAGATGAAAAAACCAGCCGCAACC^^ 
AAAC^GOGasmCOXXSCCT 
SEQID6118 

KQARPIAYRSPDADTVQTALKHAWDSnJBDSC 
SEQ ID 6119 

ATGTCCGAACAACOXJAAmCACCACGCC^^ 

CACTCGTGTTTTTCTTGGm 
SEQ ID 6120 

MSi^PEKHHASPIEDERKNPVYRH^ 

SEQID6121 

ATGACACAACAAC(mACTGCCTTOC^ 
TTGCCTATTCCTOX^ 

CTACACGGGACGCACCTCGATGGAAATCGGCATCCGTG^ 

aEGTOCCCGTO^^ 
SEQ ID 6122 

MTQQRQLPSHmMSE^ 

PVPVPPLEII/TDRQRCRYEKAKKRREI SLQASGDVSCGC 
SEQ ID 6123 ] 

AIR3OTGTCCTGTCTTTCAGACGGCA 

TCX3GTCAGGATTTOCAGO(MAG^OGGGGACG6G^ 

GGATGCCGATTTCQTCGAGGTGCGTCCCGTGTAGT^ 

AGTTCGTGGGAAGGCAGTTGGCGTTGTO 

CTATCGAGGGATAGTCTGCCGAAATGGGCCGCXX&T^^ 

CAMCGGCGAGCATGAATCCGGCAACCG^^ 
SEQ ID 6124 

MLVLSFRRHRPPSA^GHVSGGLEADFSAPPGFFVAAALAVGQ^ 

IMVTAGAGIGNLVEQEQEFMVYVAAEIGGVRHHQ^ 

QTASMNPATARPIK 
SEQ ID 6125 

TTGAAGCC(KSAGAGTTTGATGT6G(^ . 
TCCCGCCCCGGTGCGGCAGGATTTTGCAC^ 
SEQ ID 6126 

SEQ ID 6127 

ATGCCGTCTGAMOCGCOCTCCCCCTTO 

GTTCGCC^TTATCGGCATTFGGATTGACGATG^ 

CTACACGGCGGCATCGCCTGCCGG6CTO 

GAATTGGACAAACACGG&TCX3CGTTAA(^^ 

GGCTa&CCGOTTTC^ 

(X£<mMa3aW(&AAC<£ 

TATTmGTGTOOTGCATCTC 

ftCGCGGATGACGACCCTTCAGACGGCAATC^ . 

CAGCCTCCACCCCGCCCTGAGAC^ 

AACCAAGGCAACTGGCGGCITGCCGCCTC^^ 

T(^GCCX^CACCGTCCACACGCC^^ 

SEQ ID 6128 

MPSETAIiPLYAOTRA^V^ 
HJ)KHGIALTFVNDRSTO^ 
YlSVGCISPHIiLARESLEWttJJA^ 
NQGNWRIAASRHTCPDIAAA^^ 
SEQ ID 6129 
AGGMTACGACAGACATGGACGTACAACTCCCXM^^ 
CCCFGACOGOMC^^ 
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GGCGATPTK3A(mAGGCGGGC^ 

CTGAC(^GCCGGCGCGGCATCGGCAMTT(^^ 

CTGAAAGTTTCAGACGACCTIAA^ 

LTSRRGIGKmVAKVD(^ 
SEQ ID 6131 

AGACGGCATTTTTAACC 

SEQ ID 6132 

LQTOEIKQIPKHSQTNLXAPSIiRNDLKl^PPKQ 

SEQ ID 6133 

ATGCCGTCIXSAAAGGTCTOAG^ 

SEQ ID 6134 

MPS^SSDGIFVIMWSNBSBGTGRINLFDCVWm 

GCGGTCAGGAAAGAGGAATTGTGCT(X^TGATGA^ 

SEQ ID 6136 ^ 

LTHLEMKLRSGSIKTHQS^^ 

TGAAACAMCXjACACITPTGGCAGCTT^ 
«PCT6CACCTGTCGTOC(K^ 

CCGCAATCTCCTGCCGCTI^^ 
(XGGCAATGCCGGACAACCCGTTTTGGCGG^^ 
GTACGGGCACMCCAAAAATTGCT6GTCGGCGAAGGTO 
MCGGCAAA(X!GGTTAACC(X3AACAGCTATATCGOT 

SEQ ID 6138 



SAPAVSGTYVPSYAPVDINAATHTIWGDTVYNISKRyM 

PQSPAASPSOTRSVGGIVWQRPTQGKVVADFGGG^ 

NGKPVNPNSYIAP 

GTCtStI^GGTATCMC^^ 
ATGCTTCCGC&CGCTATTGGGCAA 
GGAAGATGKAGGGGAATCAAGATTACCCGTTTGGGCAGA^ 
TCCGATCGGGMfiAGGGQ^ 
OGGAC 
SEQ ID 6140 



SDREEGTDFGECGAGFITVTPLQrDI/TAYPDMAETAAFW^ 

GGTCAGCAATTCGN 

SEQ ID 6142 

MNVLISNDKYIABG^^ 

TCATCGGGATTCTCCTGAACGGCGGGTTC^^ 

TTCCAGCAGTTGGGACAGCAGGTCTfflTT^^ 
TCGCGGCTTTGGGAMTGTCCAGCCGTTC^ 
GCGCGTCATCCACGGATTCGCCGCCC^ 
QCTGTTGCGTTGCGAAATCTGGTT^ 
SEQ ID 6144 

AVALRKLVKQFD 

{^(XiGCATAACCGGTTTGCCGTTGCCGACA^ 
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CTGCCTTGCTTTTTTGTO 

CAGATTTCGCAACGCMCAGCC^ 

GCGA^TCCGTGGXTGACGCGCATTTTGAAGA^ 

GGACATTTCCCAMGCCGCC^ 

CTGCTGTCCCAACTGC^ 

OGGATTACGCTTTGGTGG!K36A^ 

CGTTCAGGAGAATCCCX^TGAAAGCCTGACGGTGGAAGG 

CTGATTATGGAAGAATTGCAAACCATCCCCGATGT^^ 

TGCCCGAAGAA 
SEQ ID 6146 

LRLNLRNTKIMDFSWIjAEPHM 
. EGHNRFAVADNQKKHAPFWGVVAQIIJII&^ 

QISQRNSRKNDHGSSWW^^ 

LI^QIiLBTGGLDI^ALRQPLVLPDGTTALGA^ 

LD!EEI£TIEDVGDFADFHG^ 

SEQ ID 6147 

CTGAATTGTATTTATTGAAmATCmATCM 
CCGCTmTAT^TAGCATA 
MTMTAGCGGAGGAAAGTTT(^CCACAACCAATTO 
TGAAAGTCTTTCCAAAAAAGCCCAAATCTACA^ 
TTGmTAGTmAAAfiGTGCAATAAAAGM 
GATCAAGTGATTGCGGTCTGCGATGAAATOG^^ 
SEQ ID 6148 

LKRQFKNKESRVSVNYKECCSIFSDFRM^ 
Nl&GGKFIHNQLVAGLGFGFWRYIiFA^ 
DQ7IAVCDEIDKL 
SEQ ID 6149 

TTGCGTrTCCCACACTOXX^ 

GCAAGCAAGCAAGCAAGCAAfiCAAAA . 
SEQ ID 6150 ' 

LRFPHFPDKYTEEDSMKTKTVKQASKQASK^ j 

SEQ ID 6151 1 

TTGAAACATTTCAAMTAAGCCCATGCOT 
TGAAAGAOCCCGAGCAGAGCAGTAAGCXCGCCCGC^ 
SEQ ID 6152 

I^KHFKISPCRPHII»SIX3MAPCHCPDKIiL^^ • 

SEQ ID 6153 

ATCttTGGAAAACGGAAAAACAT^^ 

Aam^TCX^ATGCCGAGCTGAC^^ 

GQC^CGCATQGTC^TTTTTCX^TGGTCGG^ 

ATTGTTACGGCGGCGGAAGTCGTGGTCGAAGATTTGCAC^ 

AGAAmGACGCTGGATmrCCCTGAAAGCT 

CTTGGCGGATGCXjAGAAAMCGACGGTTGCGGACAT^^ 

ACCX^TAAAGAGGGCAAGACCAAAGGCTACGCGCCGATA^ 

SEQ ID 6154 

MMENGKTFPK^ALKSAGWHLI»IS^ 

OTAAEVVVH)li{RAPBGLQSIjSWFGIPRIAI^ 

TDKBGNTKGYAPIDGFIIP 

SEQ ID 6155 

ATGCCGTCTGAATGG(^GGGTTTCAGACX3GTG^ 

SEQ ID 6156 

MPSE^QGFRRCRILSSLYLLYNAPCFYGYP 

SEQ ID 6157 

rTCAAAGCGGmTCOCT^ 

CGG(mMTTTOGCCTATGTCCGCAA^ 

ATOTGGCGTTGGTATTTTTTAGTTAAAGAA^ 

GT&XKTGGTTTGCC^^ 

GGMTTTCCCGTCCGCGATG^^ 

CTACGAAGCGGGOTCCCGCAGGATTTGCT^^ 

GTCGGCGCGAAMTTTTTGCCGGCAGCGCG© 

CGCTGtmGCAAGTTOGCAACAGCGGT^ 

ATTGGGCGACGGCTTCGACGAAGGTGT^^ 

GGCAAACGCAGCG0G7TGG(^6GMCGTTTTTO 

AAACCGAAGCCGAAGTCATCGAGGCTGCGAACAATACGGW 

CG6CATCMTACG0GCFT!AATCAGTAATG^^ 

TGCATTGATGCOGGG 
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SEQ ID 6158 

LKAVI PYTGGHEMNEYSQIjIKHPDISLS^^ 

VRWFAEEARRIDGBVLTSV^^ 

VGAKIFAGSAADIKKLSLEUj^ 

GKRSAIO^PTV^ 

CIDAG 

SEQ ID 6159 

TOCTTOTATATATOTACOT 
GCCCXKjATGTTGCQCTGCC^TCGGA^^ 

SEQ ID 6160 m 

UMYCTLSKGaTFLSKDrai^ 

SEQ ID 6161 , „ ,,,,„„, , <>MiriPB Bfl>m ^ rrrTrnnrfTm 

TTGTCAGGAMGTGTGGGAAACGCAMGATTTTAATACGGA 

G6ATTTCCGTTAAAATACX3CTTCTTTTO 

TTCTCGCGG1TATCGGGC0GAACGGCTGCGG 

SEQ ID 6162 

I^KCGKIOTFNTDGRDVQNPAAP 

SEQ ID 6163 

TTGtfXXnCGGC^^ 

SEQ ID 6164 

IAVGWFRRFSGVADYFSPRSGPILSAAAQCQL 

SEQ ID 6165 

AT6TTCAGGTTTGG6GTOACGGCAATA6ACC^ 

TOTTTTATGACAGTTCGG^^ 

TAAAAACATGATSAGCXjGGGATATTPGGCTGCAC^^ 

GATMCAGCCCTGMGAAATCGGAGCAGGTTTAMGTTCGCAT^^ 

ACG«XyUAACAAAACC^ 

SEQ ID 6166 

MFWC^^PISmiPIff^^ 

DSSPEEIGAGLKLALSRCRQD^ 

VQPQRDFAAVQIADAHBQnVLA. 

S ATGC^ 

CGATGCCTftCGGCAATOCXSGW^^^ 

SEQ ID 6168 

MPPVQ^AGPDDFRAFSCEMSLSVBVRNLDGGKIA^ 

SEQ ID 6169 

TTGAACAG(XIGGCATATACC^^ 
TAAAAAGAAAATATGTTTCAACACACAG6ACGACACAT 

SEQ ID 6170 

IflmiPASmiRFRHGTGRQR^^ 

SEQ ID 61 71 ^^r^n 

SEQ ID 6172 

LIYRVSFSVEPQPFGAVKSDPIWEGBHPFRIRTAHRAVL 

AGGTT(£GACGGCATC(^ 

SEQ ID 6174 

VQECTRTNVKRKJRII^ISTUSVFTWHLEKF^ 

(XIATATCCGOTACTGTATCGCCTACCGTTra 

TCO&CGttCAAAGG^^ 

TTftKjCGTGCTATTTtXXX^^ 

TTGCCTCXMCGCTATTTOA 

TKXXTOTTTATGGCTATTO 

TCCATCXKKXATTGGTTTGACCTTGACGGCAT^ 

CCTTCCTCAAAATCGCT 

ATTGTTCCCGTTCTTGTra 

TACGGCGGTATGTTGATGGAAAGTTTCCn^ 

ACACCGCCGOSAAGm^^ 

CCTOGCAGTCGGTATGGCGCAOT 

GOTACGCGCGTCGCACGTITrATGATTCAAGACT^^ 

GOTMrTTCCTOACMX^ 
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TCCTOTCT 

ACCGGAGGAACAAAGTGAAGCA 

SEQ ID 6176 



FGVVPAGAVQIHfPVIiFVSMRRIXSKSDSDI^ 

SIGHWFDLDGIQLTWAIMHGFVASVI^^ 

YGQiLMESra^^ 

IAHAGKySDTIiAKKBV^ 
flfcTCMCAAGCT^ 



SEQ ID 6178 _ 

VKHKLASWVHCTIKLIANIM 

SEQ ID 6179 



EG ID 6179 

ATOACCCTCATCC^ 
GTCCGCACGCC^ 

GCGACTACGCCATC&CCGCCro 
TTTTACXXXIAGMCA6A3^^ 



GCCGTATTTCAAGACTT^^ 
SEQ ID 6180 



DCVVGNmFANNASIAGHV^^ 



AVFQDFPAQSraGIIR 
SEQ ID 6181 



MTTTAGTATAGCAGAAGGCTATCTGGCATCTAGTGATGGCT 
TAATAAAGGTCTTTATGTCGATGCAGAOGGCAATGTAACAA^ 
AGATCTTCTTCACATMTCTTTT^^ 
GCAGTCACABCAGGGCTTTGGCAGAAGAAAGTTTGGCATT^ 
AGTGAQTOmTACATTGAAACGGAAAA^ 
ccq in 6182 



CCGCATCCGCGCAATGT 

SEQ ID 6184 

MPSEARFPWPFCTDDPISSVFIiLSIIRYAfiTTOQHyraASAQC 

S a^SaaamLot 

OTATGATGTATTGGATCCAG«XMm 
TOACGCAAGTTTGACG^^ 

SEQ ID 61 86 

MRKSRLSRYKQmiE^ 

TCGCCGCTTCTGCCAGAGCTGCCGTTAAAGACATG^ 

CTACTCCATTGATCCACTG^^ 

ftCCACTTCOTATCTACACTTAA^^ 

-K&CGACCAACCC<^^ 

OTATTCGGAAOGATACGCAACAGAACC^ 

CCCAAAACCAATGCGTFTGMGCTGCX^ 

TTSIOTJiAPAIfiimjSI^ 
PKTNAFBAAYKQJYATVD6AV 
ccn in 6189 

SEQ ID 6190 
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SEQ ID 6192 

MmjGtKAAVIRPPBQQGOTKPISKEIfiPKaR 

GGCTAAOCGTATGGATGATATTCGCGGCATCGTCCAAGGTGCGGT^ 

ftainiii*wm«» * , if,ni^>pn^» > > » "^AATCATAGATTATTTGTCCC 



TAACCAACAACTTOCTGAAAAA 



HPEOTfIX3P0GGGYPPP0GARDnSVHIK(?reTKTKINTVPQM'FS 

SEQ ID 6196 

MEILMSEKQHEKUIDYLSLTHNFIiKNSR 



(^Ju^vflujjLuituw.iw.v»«n~. ">~~-^>>"«^GCCCTCAGC(X?mTCCCGCCGAACATT 



ScCTOCGACAAAAAC^ 



AAAC0CiMA0CGTT6GMAGAACAACGCG(KGG 



^ATOTAAATCAMOT^ 

TITlTTTATCTCAAATAAMiTAGAGAATGCATACGTCTCOT 
acq in 6202 

C^TGCGGCTCAATATGAAACATGGAAAGGTCCT(a 
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SEQ ID 6204 
SEQID6205 

ta 

SEQ ID 6206 

KVYRENIKNlKIMKAIHCQnniU)ILSNKKyFS(^^ 

SEQ ID 6207 

TTGAATTTTAAATTAGATACAMTAATTTC^^ 
TTCATGTAGGGATTATTATAAAAATATCTGCTGAAACMW 
SEQID6208 

SEQID6209 

GTCAGCOTATPACAGGACAGTOOTTTOCX^ 
CCGCAGCTACGGTOMAGAGCTAAAAMCT 
SEQ ID 6210 

VSIJ^DSAFAVKDGINSARQWADAHPIOT 

SEQ ID 6211 

ATGCAGACIOTAGACAGGGA&ATG^ 
AAGMOTAAGGCAAACCCACAGGGCAAMC^^ 
tt^TACAmCATTGAAMTAC^^ 
SEQ ID 621 2 

SEQ ID 6213 

ATGTTTTTAGATGATGTAAATOTTTTITOAGAT& 

TTGATATTCTAAAACMTTTGTTGATTACATGATTGM 

TACAAATJCi^GAAACAGAAM 

SEQ ID l^^^^jjj^pj^jj^iYi^gp^i^gjjg [lESYBfiTOimiQFVDXM I «« M f«m^ 

SEQ ID 6215 5 

ATGAGAAAAAGCCGTCTMGCaKJrATAAACAAM 
OTAl^TCTATTG(»TGCGGGCGAAmA{m 

mACGCAAGITO&COXJOT^^ I ! 

SEQ ID 6216 

MRKSRLSRYKQNKLIELra^ ' 

SEQ ID 6217 

ATGGCGTTAGCTGT!K£AACAACAOT 

OSGMTCAGCAGCAGTATCTTACGCCGGTATAAA^ 

CCGMTCTCCATACCGGAGACTTTO^TO 

ACAGGC(^TCC6TCGG(AGCATATOT^^ 

0C 

SEQ ID 6218 _ 

MAIJWATTLSACIiGGGGGGTSAPDF^ 
TGESVGSlSVPmGRmGYlsWKNKLQKLYGVYAEGSA 

SEQ ID 6219 

ATGCGGAA(XJAAGCGCCTGAAGACAGAGGCOTAAAGACATOA^ 

ACATCGATGTGGTCTCC^TATOATT^^ 
A 

SEQ ID 6220 

MRKEAPEDRGGKDIEASLDNEALIETEAKPTDIRHVKn 

SEQ ID 6221 



TOTGCTGAACGGTAA&AAC^ 

ACAAAGCGATTTCCAACCTGCCGGAAAAACTGAAG^ 

GAACAGGAAAGCGGCACAGACCCCGAAACTGCCGCATW 

AGGGATT 

SEQ ID 6222 

HNFTRLIiiQVLSTVQK^ 
EQESGTDPETAAWIMEYRIiPASIGDI 

SEQ ID 6223 

ATGTCTCCAGCCATOXmira 
CGGAGGAGCMACC^^ 
SEQ ID 6224 

KSAMRNAWVKLGBRGVRTVNNSFGTO 
SEQ ID 6225 

GAmCHtGAGCAACGTTTGGCGCTA^^ 
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CCAGCTTGAOCtt^ 
ACGCAACGCCCGCCAAIAATAT 
TGTCTTTACCGCCOT^^ 

TERPPIWmSMCPISTimSVGPASVSIRMLSKEfta^ 

(XSUGAOGGTTGRARCTGCGGTCGCCGACCGCSCWj&rATGCCGGGCM 

ATCGCSCAAACCCAACA&3ACGGTGGAACGTGGGAACAGGGCGGTGTO 
CCRCACTCGGttTWSGACGCiGCACATGGMOGAAAACAGTGCA^ 



ATCGCGC ^^ 



SEQ ID 6228 ml ,rOTVTOTTWMn™<H^<»GYHI^^ 



CATATTGTGT 
SEQ ID 6230 

LRLFLMKRRPY! 

SEQ ID 6231 

ATGGCGGATGWOAAAACCGGTi 
<XOTGTCCT<^^ 
SEQ ID 6232 



SEQ ID 6234 

l^IBHIVLKHRPEPDIIRPSrSVKIFFUrG 

SEQ ID 6236 

— — ^™^^ 



SEQ ID 6240 

MPSETPKAFtP'iSVFFKAVCCIiIIASPCIW 

SEQ ID 6242 

MSWMI^AK(%Y(XCRAKHRMAEPGFGRVKlftV 
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AIH^CGCOTCGMCCCGTCCTCGACM^ 
CMCOkTGCTGGCGAG(&GAAAC 
ACTTGACCGCCTC&CAGCCATO^^ 
GAGGATACCCmCCGTCXTOCAAACa^^ 
<^TGCC(£AAGOTTC^^ 
ACTCGGCGGCTCGCTCGGATTGGGTA(X3GCA^ 
ACCCTGACCCTCCTCGCCGC^^ 
TCCXX3CCCGMCTCAACAAAGCCC(K^GT^^ 
GGGCAAA 
SEQ ID 6244 

MNKQPLSLAW(3n , HTGKTSLL^^ 
IDAREPVLDKYW)ELTII^OKPW 
EOTATVLQT^AEIRQLERQMQQIUiFALYRPYHSEIIX3G^ 
TLTUAARMIlLIiHVLQ^ 

SEQ ID 6245 

ATGCCGCMCAMCCATGAATTTAATGCGCX^TGCA^ 

AGCTGACCEAGCATCl^ 

<XJC<KjATGCGGTGG(X&GATTG^ 
SEQ ID 6246 

MPQQTMNUIREOTIFTVLSDE^QILH^ 
SEQ ID 6247 

ATtXXXSTCTCAAGm^^ 

TACCXIGGCGTCCAAGGTaQGTTOGG^ 
. AMCCGTAGGCXX^CGCCATCCTGACCGGCGGCGCT 

TTTGTOTTGAGTTO^TCCT 

rar(X3TTCACCTTTTTTCTOT 

TGGMTGCGTACGGATGTTTCTGAAATMTCGCTT^^ 
SEQ ID 6248 

MPSffiSDGIGIGBSEAVAHAQRGFVGFEAGVFQASPVVVAVPGVQGQVGR^ 

FVVEFEIVNGGQAERRNGVERTVFLIFEI^ILRHmFALV^^ 
SEQID6249 

TTCCGCATCOTXXICTCGCCOT 

TTATG^GCACGAAGAAGGCAACCXaCCMCGCCCTCAAG 

AATGGATTTCGGCGGCGGCATATTGGCGGOCXSAAATCGCGCGG^ 

GGGCATTGCGTCCGCGTGGGCAATACTTCCC^^ 

ATCTCXSCCATCGAfGCGGAAGCXIAATCXTO 
SEQID6250 

LRIRPSPALVCLKWVmTSRLPVSGR^ 

GHCVRVlOTSIiQIiKVEVWVK^^ 
SEQ ID 6251 

AOtXX:GTCTGAAACTGTCTTCAG^ 

ATCACOTCAITCCT(Xy^ 

ATTCCACACCCTCATACACCrarc 

CAGCTCTGCGCGCAAATTCAGGCTAMCTGGCG^ 

OCGCGTTCCACCATTTGCGGCAAAA'ICT^^ 

CCAGCCCGGQTTGTTGATGAGGCAGTCGATGTCGGAAMl^^ 

CMTTCATCC3GCCAAAGCCTGAAGCCTGT0CGC^^ 
SEQ ID 6252 

MPSEOTRRHSVSrePRPDAGKQRHTIJFK 
QW^IQAKIAHKRLGRPVNVAAGISIIACNRT^ 
QFIRQSLKPVRTARRTDMVSRPIKRAAHRFAEACRSTGN 
SEQ ID 6253 
ATGGCTGTTTTMTTACCXK3TGCTTCGGCAG 
CGGATGAATTGGGTGCTTTGTTTTACXX^^ 
TGamCTGGOTTGGGTTTGGAa 
GTGGMOGCGGOGGC(Kj!ITATGTGATGM 
(XXX^GAGCTG(XX£ATAAGAACAYCCK£OT 
GGGTtm^OTATCCGCCCCGM 

ATGAAGGTCATAAAAACAGCCGTGCCCGAAGTGCGGGAAGACTTTGAAAAA 
SEQ ID 6254 
MAVIJTGASAGrcEMOtTFVGAGYKV^ 
VERGGGYYMNLGSIAGNYAYAGSNVYGATKAFVRQFSLNIJy^ 
MKVTRTAVPEVREffEKQSMSIiFSRIRSWFK 

SEQ ID 6255 

GTGCCXXACATCGCTTCGCCGAAGCCT^^ 
CGCGCCGGCTGAAAACGftCAACAACCGTACCACC^^ 
CGCttCCCGCACAGCCCAAAGGCGAAATTTCCG 
AAATTCAAACAGGCGG(MGC<K3CGTT^ 
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1TGAA6CCGAAAAAA( 
(X^CGACi^TTTCXXXXmXXX3A& 

KreQAAAAFAESSXKimFESDP^^ 

SEQ ID 6257 

TTGTAIAGTGGTTTAAAOTTAAATCAGGAI 

SEQ ID 6258 

LYSGLStNQDKATKPQTVQEVRQGEKFLVRFWKPL 

GMTTTCACAAACACGAACtSXTGCTCMGCATTCCCGAGGCX^^ 

RFEAQSmPGFKARFWJTVK^^ 
EFHKHERIJjKHSRSGAIjQ 

SEQ ID 6262 



GGTAAATCAfcAA 

SEQ ID 6264 .^^^^^^^^w^™™,^^ 
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ATGTGGGACAAAAACCECAAAGCGCAAJGA^ 
CMTGGGTACTGTGCCCMCGTCGGACTGATGGCGCAA^ 
TGTCCTGATGCA(KACX5AC^ 
G0X5TGTTCTGGCTCGACGAAAACC^ 
CCGCTAAATTCAGCTTCGGCCGTCT^^ 
AATGCTGTCCATCGTTCCATT^^ 
GGCGAATTCCTCGCGCTTCCOT^^ 
AftHBCOCAAAOX^^ 
TttATTGGCAGCCGCTTTGACTGC^ 
GCGATGCGTCCiMOGCAACCmAA 
SEQ ID 6266 
MTQKSTIVrfflTDEAP^ 
AVRERYDRIKGSAVNPVUlEGNSDRRAPKAVKlff 
DAKAKGVLFSLHMKATM^ 

MWDKNGKAQDTKAVIPDSSYAGVYQATIDFCREHGAFDPTTOTV^^ 
AVFWLDEftRPHDKSLLAKVKA^ 
GEFLALAVSFEHLAQKTGNAKAQVIiADT^ 
AMRPSATFNQVUIAI. 
SEQID6267 

ATGAAAACTGTTCCAACCATCCT^ . 

MTATOCAAACGCGCCGAMCCTOT 

MTGTGCCAMTCGCTATGGATTCGTTTGCAGA 
SEQ ID 6268 

MKTVPTILTGILIiATAIiPASAHGMHKSKPLA^ 
SEQ ID 6269 

TTGTTCAGGTTGTGTCAACTACTATATAAAAATCATO 

ATGGTCATTTTIXXK^TGGGQACTTGCCIA 
SEQ ID 6270 

OTLCQLLYKNHRFTMIFRNALQQTM 

SEQ ID 6271 i I 

TTGACACAACCTGAAOAA^ 

ATTTMTTTACmAAAAACAttX^^ j- 
SEQ ID 6272 'f 

LTQPEQKRTALNQGGFAL^^ 
SE Q ID 62 73 

mTTCTTTAACCCAOTCGGAAAAACTM 
ATCCTTCTAATAGTGTAATGCAG 

SEQ ID 6274 

LFTNPLGKTNFICTLFAPFFINIIQIFNL^^ 

SEQ ID 6275 

ATGCGACCCTCCTATTTC&TTTCC^ 

TGGGCGATTTGTTTGATTTTTGGGTGGGC^ 

CXXXX^TTTOTAATCGGTCGGGATTTTTGC^^ 

ACCGACGACAAG(^TACCTGCOT 

GCAAMTGGAAAMCAGGTCMGCCCGCCGATAm 

TATCCATCACGAAAACGGCOTACCCGCATCGT^^ 
SEQ ID 6276 

MRPSYFISDLHLSEKHPELTELI^^ 

MKAYLRFRRIVHCRRI^KLFI^PIjK^ 
SEQ ID 6277 

GGACAAGGCX3A0GMGCX36CAGACA6TACAA6AA6TAC6G(y^ 

TCTGCCGCCCGCCACATCXXEAT 
SEQ ID 6278 

GQGDEAADSTRSTARRGNAVPV* I *TTIDFRSMAVGSIDDFCRPPHPH 
SEQ ID 6279 

TTGTCAATTlTRaATGTCTTT^^ 

AGTACGGACTGATGAAGGCGCGCGTCAAAGTCGAATTAAACTGGC^^ 

AGTCATCGAAAACTTTTCATTAGAAGACGCGGCCGC^^ 

GTCGCCGCCGTGAGCGAGTTCATC^ 

TCATCOX^CTCACC<^^^ 

CX^TGCAACGCCAGTTCAAAATCCTTCAGGCGCM 

CGCAAOTCGTCGAAATCAGCCTCXX^ 

ACTTCAGCCGCGACGTTTGGGGTTATATTTCATTC 

C(^GGGCAACCTCGGCATG(mACGCXX^ 

(OTAAIGCGCCOTAC^GTG^ 
OGAAGCXAAAGCCAAATTGCTTGAGCTGACCOCCGCGCT^ 
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SEQ ID 6280 

LSIFDVFSBQDADtllireiASl^ 
VAAVSEFIHFACTSEDIOTiSHAIilLQEARETO 
RNFVEISLGLTFNPYTIQIEPHDYIQ^EFFQTLSMNTILIDF 
GYAVIiGFAAHLRGIJOCI^PNPAAIjAADIiDV^ 
SEQ ID 6281 

SEQ ID 6282 

IJU/3EWI/nMOTIIQFIGASGADTGKAPPKRGLYM 

SEQ ID 6283 

TOEtXOTCmATC^ 
GCCGATACGGGAAAAGCCCCACCGAAGCGGGGCT^ 

SEQ ID 6284 

LAASIiCRPKALQfPAFFPDFVGGMTO 

SEQ ID 6285 

ATGftQCTTCATm^mCTCCCTGTACAAG^ 
CCCAACAAMTCAMGAAAAATGCCGT^^ 
SEQ ID 6286 

MSFIFUjPVQAPLRWGFSRIGTR^ 

SEQ ID 6287 

TTGCCTATGGTTAATGAAAAACTGGCAGAAAACCXj^^ 
TCGATTTTTCCCMTGGGTTAA^ 
SEQ ID 6288 

LPMVNEKLAENRKRYEQKKWR^ 

SEQ ID 6289 

TTGGTTCGGTTTTCCATTAACGGCI^^ 
GATTTTGGGCATATTCCAAAAGCOT 

SEQ ID 6290 

LWSINGCAWRIIKVDTFKVNYFLIF^ 

SEQ ID 6291 

ATGMTGCTGTCCAAGTTTTAAACTTTCAACAAM 

GCCGAACaSTTOCCl^CTGC 

ATCCC(X^GCCTGCCGCAGAAGCGTTTKiAAG 

TTGCGCCMGCTGTTGCXXjCGCTTGTC^ 

OTAAGTOSTCGCCTACGTCC^ 

TGCCGCTATGGTTTATTACGGCACATGG 

OCAATOTGAAAAGATCOGCCGCGGCCC^ 

SEQ ID 6292 

MNAVQVLNFQQNSVRTOIMGEIitfFLAND^ 

LRQAVMLVGI^IBYSSAYSMIH^ 

PILKRSAAALEVLRGYADJU3ASDRIAACLBGIYGKATVR 

SEQ ID 6293 

TO^mGACGATWGTGCTGAAGAGTTm^OT 
CGGCGGATTATCGGGAAATGTTGAMCGCTATGCCC^^ 

SEQ ID 6294 

LDLMSAEEKmQIAYWLNBSRKASDNADLKAFEFAG 

SEQ ID 6295 

TTGGTTTATTCGGOTTGGCGGAGGCT^^ 
ATTTGCGCCGTATCACGCGTG^ 

SEQ ID 6296 

LVrrtALAEAFGIPDAETFDPQE^ 

SEQ ID 6297 

ATGAIOTATGGAATTGATTTCCGGCTCGTO^ 

AACGGATACTGTCGATTTCATACGCATMCCCX3C^^ 
GCGCA(^CGCCTOTGAACC^ 
TTCAITCCACGATACCGAAC 
SEQ ID 6298 

MHYGIDFTILVPWRHIPRQAIPPGKQPAVWRR^ 
AQARLVNRIVQPSRHLFVAPnRIjGLIIGKHAETR^ 
SEQ ID 6299 

GGCAGGCAAACGCA 

SEQ ID 6300 

MFSPTETDFPANAPIiPAASIIPG PPPEMTAIPESDSREARRQANA 
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SEQID6301 

ATGMTTTGMGGCTTTGATTGCGCGTT^^ 

"KXXXXKCCGGCTGCT^ 

CCGTTTTGTCTCCGAGGGC 
SEQ ID 6302 

MNLKALIAmVLANDKAEPYF^ 
SEQ ID 6303 

ATGATAGi^ACGGTCGCG(XACACGGCACGCCCG(X^ 

ATCCCOGGGCGGCGTTTCATTCCGCTTA^ 
SEQ ID 6304 

MTOTVAPHGTPADmmiHWPRRRPW 
SEQ ID 6305 

SEQ ID 6306 

LVFEDGQVmSAGAAAKL^^ 
SEQ ID 6307 

ACGGTGCAATCTTGGGGGCGGATGOT 
GTCTTCCGGTATCACTGCCGATGTCGGCTTTO 
CGCTGCCAGTCTGCCAAACTGGTTACGCTO 
GCCAGTTG 
SEQ ID 6308 
^SWGGFMARIHVRKKGGNRFGGVPYGMjAAEHYCIVA^ 

RCQSAKLVTIiPEQAIiI/rVTLKGAENKKAADIDI , . 

SEQ ID 6309 

ATGaaCAAMCfta^TAO^^ 

CTGCCGCTACCOXKSGTGCGG^ 

(XXTTGTTICCGCCCrGCCGATTATGGGCGA 

ACGATCACGAArcAGC(^GCCCTG(m 

TTCATGAC^TATCGMTGGGGCGTGCCTTTGGOT 

MC6GGCGTT6GGGACAACGG06GCGAGTTGMGATTGOT 

CGCGCGCCTCCCAGGCAMTCAAMTCCGCT^^ 
TTGCX3CCGATAAGTTCAGC6AGGCG6AATQjGGCTO 
AATACCGQCAAACACGGOOTATGCCTTTCTTTTG^ 
ATCTCGGOGGGGGTGCGMGGAAATTACCGACTACGGOGT^^ 
SEQ ID 6310 
MAQOTTAYGDPQAMMKQAAGLPAMHMQRNSTIiNRIA 



TOKHGOTFFWSEKEIiIBGNRVETLVGTIRG^^^ 
SEQ ID 6311 

TOAGGGAGCGCATCGAGGAGCTO 

AAAAGCTC^TTCAGGAGAAGCTGGCGGGTTATGAGGCG^ 

CCGC^A«mAGGAGTGGCT^^ 

AAATCCGOCGCCGAACC&GAAAAGCOCGCOGAAMGCC^ 

ATTTGAGOGGCAACGCGCTGGCGGAAAAACTGGCTICCATGA^^ 
SEQ ID 6312 

LRERIEEI£KGAEKR3VETADGSDNSLFOT 

KSAAEPEKPAEKPPAGKOTPWn^IPAGRDHTASTCPP^^ 
SEQ ID 6313 

TTGGTOTACCMTTEATCTTTTA^ 
SEQ ID 6314 

LVLPIYLLraSGVRDWPNLDAPYElANA 
SEQ ID 6315 

GAAGTATGGCAGTCGGCAGCATTGATGAOTCTGCC^^ 

GGCCTGTCTTCGGTGAAAAAACC^ 
SEQ ID 6316 

EVWQSAALMISAARHIPIRTFHCDIjR^ 
SEQ ID 6317 

ATGGAGAATGTAAACXXXEWXXXXjA^ 

aggatcatgcggattatgcgtttgcctgto 

CGGGAA(STCGGTTTCGGTAGGGC^ 

CGTTTTGGTCOS^^ 
GTCGGAACACGGGCTTCAGACGGCA 
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SEQ ID 6316 

MENVHRVPEQAFY 

YSQAYVVMPGGFGTLDELFEIMIjVQTGKVPPRPVVIjVGK^ 

SEQ ID 6319 

mCCGACGGGCC(£CG^ 

SEQ ID 6320 

SEQ ID 6321 

CTAAATACCGGGCCAAGCCGTCGGAGGCGGTGGGGATTGATO 

CAAACAGGGTTACCWmXXXX^^ 

CCTGAATGGGTGCGCCGTCATGCGCAGACGGTC 

CCGGGGCTCTACGCGOTACCGTGCGCMC^ 

(XAGGCKK^GCGTTTGCAGGGGG^ 

CAGGATGTGCGGCAGGATGTGCGGCAGGGTTTGACC^ 

CCGGCAATGCGCCCGCCCGTCGGCAAAAOTACAGGTTC^^ 

(XX3(XXXjACAGGCGCAGGCCGGAAAAGGC^^ ' 

AAGGCGGCAT6CCX36AGGAGTTTQCCGA^ 

GTCGGMGCCX3TGCC<n£CGA&XX5GG 

{XXSaGCCTGCCCGGGCAAAAGATACGCCOOT^ 

TCGATGGCGGAAAGGGCAAGTCGGCCGCCGCA^ 

GGCACG(KXK^AG<^GTGTGTTGa 

GAAAGCATAGGCX^CTGGCGGAACGGGTGGATGTCGCCGOT 

AAATCACGCTGGTGGCGGAAAACXTFACGCCC^ 

TOCGGACGGCAACGGCCTGATTCQGOGCATTGCGGAC^ 

TATGCGGCGCAGCGTACCGGCGGTTGGGCGGGCATTGAAMCCOT 

GTGTGCTGCMCGCCTGGCGGGCAAGGCCGGCGGTG^ 

l^TCGTGCAGTGATGTTC^CGCGGGCCGMGAC^ 

GGCGGCAATTTGCGCGAATATAAGCGCMTGCGCTGGAATAC^ 

TATTGCGTCACGACTATAAGGACK^GGAACATTTG(^G 

TATTCACGGITATGAATATTATC^TTCGGG^ 

ATAGAAAAAGGCAACTTGCTTTCATTACMGAC^^ 

ATGCGGGCAAGGGCGGCATTGCCGATTTTGACACGGAGGCGGT^ 

GAAAGACCGCTGGGATGCTTCCAAGGGGATTCASCTC^ 

TTCGGCGCGGATGCGAACAAGGCGGraCO 

AGGTGGATG(X6ACCCGCTGATGC6AAAG6ATGOT 

TGCGCGTATTGCGCGTGCGGATGCCGCTTACAAT^ 

GATATGCGCCTGCGCCGTCTGTOT 

AGGCXKKCGOGGATGTGMCKXSTTTGGA^ 

AACGAGCTGGGCGGTGTGTATTKXCCT^^ 

A(^CGCGATGCOTK5M(K3CTGA™ 

CGAG(X!TGTCGGTATGTTGGATrTGGA 

AAG(MXX5TGCCGGGCTTCAGCGAT6ATGCGAGGCGCGCATAT^ 

TGCAGGATTTTGTGGACGGGCGCAAATATGAG 

GGCGCAGGCTTTGACGGGCTTCCK3^^ 

TATGCC(X5TGCGGCGCGGGAATTGCTGCXKJGC^^ 

CGGCGGATTACGGWreATCGATTTGT^ 

CCATCATGCGGAGAAGTTPMCCGCCAGGTTACGTITC 

CATTFTGAOTATGCGGCGCAAAACCGTCCGC^ 

XTGCGTTTAAGGGGGATMOGAGGCGCGTAAGACGCTGGCGGGGC^^ 

GTTGGGCAGTGAC<^CGATGACCCGTGGGATGCGGAA 

TOGACGTGTCGGGGCGTTTGGGCTTGAACCA^ 

CGGGTATOGGTGCGGCGGAOGGCGTGCGGACAAGGT^ 

SEQ ID 6322 

U^CTERQPDVSGIPSEDGAEPLDT^^ 

SVAPGGSAAAASGGIARVAKiP^ 

PEWVRRHAQTVADYTAI&IMRR^ 



AADRRRPEKAGVPI/3(£EYRF^ 
AQPARAKDTPVAGKAAAAKNAATEKPSSIDKVM 
ESIGGLA0WDVAAVSEfIAPDKA®OJjSQRVBGWPD^ 
YMQRTGGWAGIENRYGVKVGNGLKRGIAGVLARIGAL^ 
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GGNLREYKRNAIiEYGKSLRGPY^^ 
IEKGNLLSIiIiDRVSTTGAS^ 
FGADANKAVTEADKWR^ 
IMLRRLFYADSEAKRALRRAGADVAAESRAKM 
NEWKJ\nfFPLARF6DYLVVVKDAIX3^^ 
KGVPGFSDDAI^YAQNMGSGANYLAKLR^^ 
YAGAAREUiRASKQIGLRFGEKFOTI^^ 
HFDmQNRPRFlMGNAAKV^^ 
LDVSGRIX3LNQLVFPDIQIX3LEG™ 
SEQID6323 
ATGAGTGATTOETCAGATAC^^ 
GCXXXjTTTAAAAATTCTATGCOT 
CAMTCCXIAAGCGCGCAGGGAGTTGGGCGAGGCTT^ 
GCCTTGGMGATGTGQGCGAGATGGCGGGTGCGGTGCT^ 
CTGTCGGCGCmTGCGGGCGC^^ 
TATC0XXX3AGGTGC^^ 

GCCX36GAMGCCGCATTGGAGAAAAT6GGOTTGCGG(X3GCGGAT 

(XXSGTGCGGCACGGCACGCGGCGG^ 

TTTCTCTTCTTTGGGGCATTCTGC^ 

CMCAGGMGGC(K3GCAGGCCGGCCCCGGM6GGGCGCAGGCGT^ 

TTTCGATACGGC(K^TCACGATCAGTCXX^ 

6AGGAATTGGCGOX^C5GACGGAAAAACA(X^^ 
SEQ ID 6324 

MSDLVRYDPIiEHGRLVGGlJCEYRGFTQKDARAAADIW^ 

AIiEDVGEMA(&VIiEQAPNA^ 

AGXAALEKMGVAAADmVAAAXGTPCT^ 

QQEGGQAGPGRGAGVACGGTGGCGRRHKRIjRTEAW^ 
SEQ ID 6325 

TT6TTCGGAG6GTTTGCGATGGGTTOTTTGA 

CACAG<m:A(XX3TGaX&T^ 

GCGCCGGGCGGGTGCTTTCGAACGGGGTGCGGATGCCGGT!^^ 

ACGGATGCGGGCGGACGGACGCCGCCGOTTTGGGfl 

TGTCGCCTGAGGAGCGGGAGCTITATAAGAGGGCGACT^ 

AAATGCGGC^TCAGG(^TTGG6CAGGGAAA» 

GGTGCGTTGMCCGCGAGATGGCGCAGAGGMTGCGGACAGGG(^ 

TTGAGGATAG(^GC(X!ATCQCC<^TTT^ 

GGAGAAGGC^TOACCCGAATGTGTT^ 

GTGGAGTTOGCGGGAGAGCGCGAGAAGCGTTATGCGGCGT^ 

AA 

SEQ ID 6326 

LFGGFAMGLFEPSAGDPWEMREKEEKEKARRGAEERERAAAQAHR^ 
TDAGGRTPPPLGFDGNVYRGGI^VRDFDAQRPLVSAGro 
GALNREMAQRNADRAFDVQQAELGm 
VEFAGERSCRYAAUSFKPNGQKTAGGRIIYENEKGEKRVEQ 
SEQ ID 6327 
TTGCTCAACCCTCm^ 
ACATCCAAAGGCTTCCCOGTCCGCA^ 
CCCCTTGCCGC^ 
AAGCKX^aaTCCGCCTC^^ 
CCCTOTTCGCCTTAACCGCCG^ 
OTAmTOTCCGCCACAA^ 
CAGCGCATCCGGCCCGGCAGACACCAAAGGACGC^ 
GTa^XOTATCGCGCCTCGC^ 
GCAACCXXX^KSCCKXymX^^ 

AAATAAAGTC£AAAATAAGCAAAAACCCAAGCC(^ 

CGGGCCATOTCCGGGGGCAAArAAAAAGCOGOC^^ 
SEQ ID 6328 

LI^PLI/nijILIHDFTAGRPIjTVRLKAQRRITLIALSRM 

KLQIORLTIjHIiKPLPIflPQIiGIiI^^ 

QKIRPC^mRVKITHRI^ 

CRRPLPLLRPLPGILFFLLFLHI^^ 
SEQ ID 6329 

ATGGCGGGGCCGGGTAOGTGTGTA6CAA(XX3C^ 

GCG(m(XCGCATTACAGACMCGAaXXX5GGTATC 

GGTTGTTTCGGATGTATTGAATGCCATAGGATTTAC^ 

AAACCamTCATGGAAClTCa^^ 
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SEQ ID 6330 

MAGPGTCVATAYPAFCAYIQVKQGKHQ^ 

SEQ ID 6331 

GTCCAAACAMGTTGGTTTTTCCCMTGGGTTAM 
TTCAGACGGCATTTCTTTACACGGCAGGA 

SEQ ID 6332 

VQTKLWPNGLKIKSMGCKNSPTAAKPQT^ 

SEQ ID 6333 

G'KmAAACAGCCCGACCGCCGCm 

TGTCAGGTTTT 
SEQ ID 6334 



SEQ ID 6335 

GTGAAAAMCCAAAACTATTTTTCA^ 

SEQ ID 6336 

VKKPKVFFKRTYCLIEYSVrajPDTTKVVF 

SEQ ID 6337 

TTGACAAATTGTTTGCAATTATACATCACTCT^ 

CCTTCG(^TCATACCGCGC«^^ 

TCAGGATTTCAAAmTT(»TCCAGCGTC(XmG 



ACGGCAATACCCGAATC(X5ACAGCCGCCGC^^ 
CTTCGTCTGCCTGCCTGCATTC(m 

SEQ ID 6338 



LIRIRFLLVRQQ^PYRNRLPGKRALWRIiH^RPAA 
TAKPHPPSPNLFSDGIFGNLLKSPA 

SEQ ID 6339 

TTGCATCATMGGCGCGTCGAGGTTTGGAAAGACGTT^ 

T6 

SEQ ID 6340 

I£HKARRGLSlRSNPAHPIFWIiKDKLVRPI IMRIjHTLRNLM 

SEQ ID 6341 _ 

TTGGAGGCTITATTGATGGGGACGGACGTArcGGAAACAGGCCT 

CATGGCGTGCCGTTGCCGACAAGGAAATGGACTACGCCG 

GACCCTGCTCyrcGGTACAGGGATATGAGGTO 

(^CGGCAAAGCCGTGCCGAC^^ 

GCGTCATCXJ^GCAACGCCATCCATTGGGATATGA^ 

TGAATATGCCQGACACTTCAAAGCGATG^ 

AACACOTCAGCGAAGAGTTITG^ 

TGGAGmSACGGCGCAAACCCm^ 

CGAGCCGACCCCGTATCCmTCAAAAGTT^^ 

MCAGCACCMCAGCAMTTACGAT^^ 

TCGTGCTGAACAAAATAGAGGCCGCTCAGCCGGG 

CAGC«^mCCXX»TCATTTAT6GGCAACC^^ 

AGCCGCTCATTGGTCGGCGAGTTGCTGCTTGCG^ 

(mrW^CCGACCCCGTAftCC<mA^ 

raxaTGA&SAGGCGGTC^T^^ 

OftGTGCflSGAAACGCOT 

CGAGCXjA2ATCAAGGAAATCX^GCGCGGGCXOTACAAAT^^ 

GGTCATGCAGGGCGCGGGATATATCCKSGCCGGCGCGGGG^ 

GATAttXTCXXXftTGACC^^ 

SEQ ID 6342 

USAIiIiMGTDVPETGVLPDKNGEPM 
ERQSRAmCSDAPR(^IACGIGWVEVTRNP^ 

msEEFWFNEmEiiAVAEvwmwrarc 



PFKDKVIHKtKEV 

RVQETPEQIEARIAQAVQDM^^ 
DTGPMTAVPPKSAJfflAQTGMETPTVSENL 

SEQ ID 6343 M %r ^ wvwtM%nm 

ATGACGGGAAAGACGGTOAm<^ 

CACTGCTOSCGCT^^ 
G(^TTTCTACGAAATAACC6ACTTCGAGA^ 
TGGGTTGAGGAAGGGCAQjGCXjI^GTAAAAAAA^^ 
ACCGGCGTTTTATOGCTATGCXXjTC^^ 
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TAAAGAGGACTACGGGMTATTTGGGAAG(X^GGCCGCGCATG^ 

TCTTCTTTGCCCGTGCAT^^ 

03TTCGACT(^TGTTGCXX^ 

CGGACTGGGGCGCAAGTCGGTTTTCGTGC^^ 

(^TGCCTGAAACGGTACGGCXXSCCAAATACATGC^ 

TAATGGATACAGGCAGCMTACGGGAT&CACGGAAACGC(XCT 

SEQ ID 6344 

JflEKTVDIiKIiPAKLDGIiFKP^^ 
WVEBGHGVSKKSWDVI/rOT^ 
SSLPVHTVWDLGWNDJOTGLVQRDLTSVRIIGnE^ 
BCLmCRQIHAKTGVAIffiPIiHDEYffi 

SEQ ID 6345 

TTGGCGGAAAACCTGTCCAAAAGCGCGGA(Km 
TtmATCGAGCGTAAA^TACGGTATGGATO 

SEQ ID 6346 

LAENLSKSJ^IVAELYGSPEOTFTKABMFQTVAW 

SEQ ID 6347 

TTGGCGGATTCCX3AAATCTTCGATTCGGCGOT 

CCTCOKXSCTTaCT 

TTTAAGGTTGAAATCOGCA 

SEQ ID 6348 

LADSEIFDSAVSIPniUlSISTKTUIPAA^ 

SEQ ID 6349 

ATGCCTGCGCTOTACGGCGGCAACCCGTTTCGC^^ 
TTTGCCTGTACCTCGTCATTCAAATCGCGC^^ 

SEQ ID 6350 

}ffALYGGNPPRFRVRYLFKVEIRIIRRIAVFW^ 

SEQ ID 6351 

ATGCAAGTATTTTTAGACAATCTCm 

GCTTGCAGTTCACTATGTTGGGCTATGCGCTGAAA 

GTCCCKX^CATO^Ca^ 

GAAGGCGTGTTGGCGGTGAAATACCGCGTCAACAATTCC^ 

CGCTGTTCGGCACATTCGCTTCXn'TC^ 

TCTGACTGCCGTTGTCX?mTAG^ 

GAIIH^CTGATGCCTGCOTCMGC^ 

AGGCGGGTATGGGTTCTGCGCCT^^ 

CX^ATCGTGTTGGTCATGGGTCTGC^^ 

ATTGT(&CCATCG<XX^ 

ATGmCTTO^TCATGCCCXOT 

CATCGTCMTGAAACACGC6ACTTCAAACAAAAAATO 

SEQ ID 6352 

MQVFLDNIjRAFFEWSSWVTOS\^ 
EGVLAVKYRVNNSKGEMSGGPMYYIEKGIXjKNWKW^ 
DALMPAVKLIFSDAFSAQAVAGGMGTVIRYGVA^ 
IVTIGIJFFAYSTILGWCYlfGEKCAVW 

SEQ ID 6353 

ATGGCACTTTTCCTCAGCATATTO 

TITrcTACrTCGGCJGATGCGGGAATGCTGC^^ 

CACGGGCTGCATCGATGTCATCCGCAAATC^ 

ACGCCTGCCGCGATTGCCGCGCCGATTCT^ 

CCTGGTTOGGTTTCGCACCGCTGA^ 

cgt(OTtg<h , ctgaaa^ 

(XaATCGGCCTGATGGTGTCCGTAM 

CTTTGGGTATGCTGATCGGCATGCTGGTW 

TACCGTCAGCC&CTCCCTGACGAT^^ 

ATCCTGCTGTATAAAACCAAATTCAAAGATGCCTGG^ 

TOGGCGGCGACAATOGATGGTGAAAATCAKX^ 

CACOHGTCCAAOTSA^ 

TTTTCTTCCTC 
SEQ ID 6354 

MMjPLSIFPIVLIJWLKVKIM 
TPAAIAAPIUlSL(^LmiraVMNSVPVSP^ 
AIGUWSVFAANQCmSKDHAKDKffiEK^ 
ILLYKTKFKDAVTCHAVTFNQTKKP^^ 
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SEQ ID 6355 

ATGCCGAGATTGTCAGATGAGGATTGGCXKyUUlG^ 
(X^AAGAACGCffPGTGGACGCGCGATT^^ 

SEQ ID 6356 

MPRLSDEDWMCVELDYRR^ 

SEQ ID 6357 

ATGTCGGGATTGTC6GATftTTGACAAGG?ICT 
CTAAATTTACQIAGCTGCTGGTC^ 
GATGGGCOTATTGGAAATAG6CGATACG6TATTGWOT 
TTTTCAGTGTTGOm^ 

SEQ ID 6358 

MSGLSDIMCWQGVVDLBNABQVVSREA 
FSVLRNYLECSAKHGRLEF 

SEQ ID 6359 

TTGGGCATCTATGGAGCATGGTGTITKITGGGATTMCA 

ATAGCGGTTTGCATTM2AAATCWKSGAM 
ATTJATTTGGCT TTTT GTOATC 
AGAGGGAAGTOAAAAATAATACTAGCO^T^ 
ATATTGAAGATGCAGCGAAAGCGAGAGAGGCAGCATC^ 

SEQ ID 6360 

LGHGAWCFMGM^EVLKEL^ 
RGKPKNNTSGHTGVYFHKPSKKWQAR^ 

SEQ ID 6361 

ATGMCGGGGCGGAATTTACACTGACGCCCCAA^ 
AGTCAAAACGGAGCGTCGAGCAGAACAGGCGGCTATGGTOT 
AAMTTTATTGGATGTATTGAAATGCCTAACGGGCMTO 
CATGGTGTTTTATGGGAT 

SEQ ID 6362 

MKSABFTLTPQNKKQVMRSIWDSPDGWFES^ 

HGVIM) 

SEQ ID 6363 

TTGCGCGGGCGGATTGTGATTTCAAGGTTGCCGTTT^^ 
TTTCAGACGGCCPTTCAOGCCGA(^^ 
ATCGCGCCGCGGC(&<^C^ 
(XXXXS3CGATGTTCGGCGTGCATT(KXX3GCC^^ 

SEQ ID 6364 

LRGRIVISRIiPFSIKPSGIiSQIDMTCF 
RPRCSACIARLWRLCVSB 

SEQ ID 6365 

ATGTCTAGCGCGATACGCAAAGCCG(£AAAGGCGG 

(EGGATTGAAAAGCGGCGGTTTGGGCGCGGGTTTC^ 

ATTGCGCACGATACGCCGCCTTGAAGCATTGGGG^^ 

SEQ ID 6366 

MSSAIRKAAKG 

SEQ ID 6367 

TTCKXXmTOCCTATCGCGCT 

CCCGTTACCGGTTCGTCGCCATATTCGGCGGCM 

TOTGTTTGCGCCGCCGTCTTO 

TTTGCACGATTTG 

SEQ ID 6366 

LMLRIALDIPTLl^IPTLVC^^ 
RMjRAPDRPVAPNPPISVCRYGVRQDKARHAHSPAGCVHRPA^ 
SEQ ID 6369 

SEQ ID 6370 

MTAARLHGYIRAARAACPIUjAGIiWTQPAGECA 

SEQ ID 6371 

flj»(£CGTCXXXXX£AAGGAAGGGGGGTCT^ 
CACGGATACATO^GCCGCGCACGam^ 
ATATTG«X5GATTTGGCGCMC^«KXXX?ra 
TATCHTOX3CTGATA(XXZAAAGCGAACAAAGACGGCGGCG^ 
GGCAGGTK3GGCGCA^^ 
CATCGGACAGGAGGGTTGGGATGTC 
SEQ ID 6372 



YVRUPKANKIXSGANKTVIDLI^ALKVTL^^ 



-481- 



PCT/TO02/02069 



SEQID6373 

TO^GTTTCCACAGAAAAATCCGAA^ 

MCGCACC^AGTGMGGCAAACMTGTCTT^^ 

C6GCGAA6AGGAGTTTA00GCCGJAGCCATC^^ 

mGCCAAAQQ(^TGTTCAAAGCCG(^ 

GCGAGATCXXXBCTGAAATCG 

ACTCXKSAi&AAAACG^^ 

CTCCCCAAGGCCGACGTTACCGOr^^ 

CAfflXTO^TCCAGGCGCA^ 

ATCCaAGCCATCGAGOX^^ 

CtfraxmMTAO^CCC^^ 

CGAGATGATGATTCTTCOCMCAGCJ^^ 

MGGGCATQGGCGTGCAOC^^ 

AAftGCGATCCCGAACTCT^^ 

ATCACCXMTTGGATGCXXaAGAAGCM 

SEQID6374 

LSFHRKIRKMHIPYEESGQFKVAVVVQKHDATYQ 
KAKGVFKAAPEETLKQAIAAIEIUffi^ 
LPKADOTAFSIDDESITEVIBALSL^ 
IQAIEPHFHAETGIiDQAGEMMFAHH^ 

ITSLDAEKQFVSLNY1KAAAPGGKTAGNAV 

SEQID6375 ratmraaa^^ 

AAAATCATTGCXX3CGA(^CGG^ 

TGAGGCACACXATATTGOKATC^^ 

GGTGTTGGAGCCGGAGAAGAACGCACCX5AT0GCG^ 

CCGACCA(£ATCAGCT(^OCA^ 

GGATGCAMTCCAAACGtHCAGCAC^^ 

(^AATFTTQGACAAATCAAAOGGCAOT 

AMGGGGCGAGTGCTTTGGC^ 

TTGC6CCG60C3AC(^GAOt^^ 

CftDGMGAAGCCCAAAt^^ 

CAGGTCGGOSTACCGACC^^ 

(KXSTGCCGAAGCCO^TGCGCCTT^ 

CGTGGTTTCCATCATAC(XjTrGAACATM 

AAAAGTTTGATGGCGTAMTCACKIACTGCGGT^ 

CCATAACGAATCCTTOTAGGCATTTGCAT^ 

SEQID6376 

LFGGAGCSGTCILPDAPPGmVSVPRVLPLWMAAFRL^ 

GVGAGEERTDRAQIRRKINPVPARHCGKFLTDDFHHRIVA^ 
GNFGQDCRQLEAEPFFFAGQNAFDSEL 



RGFHHTVEHNRASfflGNI^ 

SEQ1D6377 

TOCOXJGGACGGCAT^^^ 

T 

SEQID6378 

I^TAI^SmVPPQALQmSSAILIFPFVFMaiSEAV^ 

SEQ ID 6379 _ 

ATGGA^COTX3TTAAGCATCA3^ 
AAGAC6ACTTCGACGGCTOGGATAT 

SEQ ID 6380 

MDTOiSIITALSFGGAATIA^ 

SEQ ID 6381 

AIOTX5GAATTTCGGGAAACrGAT6AAOC^^ 

SEQ ID 6382 

MLEFRETDEPSFPPISVIPAKAGIHKPRTFQmGHS 

SEQ ID 6383 

(TPGAa^TGCTTOACAGG 
CGGCAGGMTCTA3XmAACGGCT(^ 
GAIKSAACCGTCATTCOXXXX^ 
OGGGAA 
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SEQ ID 6384 

VmNKVSIVIJPFGRFTCSGS^ 
RE 

SEQ ID 6385 

Gl^TTCCCGACGCTTTCGTCA^ 

SEQ ID 6386 

VIPDAmPAQAGIRSVRFQPFPINSCm^ 
SEQ ID 6387 

GTGOGGGQCmCOCOtXn^^ 

AGCXOT5CCTCCGTGCCX^ 

GCAGGGCTmTCCMGTCTTCG^ 

TTTGTAATAGCCCGGGTTGG^^^ 
SEQ ID 6388 

VRGPPRFH^HFFQQIFDTC^PRnW 

FVIARVGIAALLRRGFRPDGFGLRVSDCHSVFPFKFAF 
SEQ ID 6389 

ACCTCAACTCAACCTCGGCAAC^ 
GTTCAAGAAACTCAAACA^ 

GAGGCATT 

SEQ ID 6390 

MTVRffi^EmmAPQQC^^ 
IXjEDfffAWAAACVEDLLEKMPPEAGRAPHPKPMPPETAGGGI 

SEQ ID 6391 

TTGCGTTTTAACGCfflTmTm 

ACCGGCGTCOCXX^TTGGTTT 
SEQ ID 6392 

LRFNOTKSRVOTIPKAFYPP^ i 
SEQ ID 6393 

ATGCGTGAAACC'KOTIOTA 

CGCQAACCMTCCGTGCGCOGCCGGCGCGTTTCAGAI^^ 

GGGTAAMCGCTTTGGGMTATCCCAGCCTACCCGM^^ 
SEQ ID 6394 

MRETCFYCKHADFKmGTPVI^FAKCAKARNAraATYYPOT 
SEQ ID 6395 

TTGCCMTCSCAGGCCAGCTTTOTACGOT 
CGGCCTCTATCAAATCGMT^^ 
TTTGAAGCGCAAACCGACATGM 
TGCCGTACCGCCGTCTGAAGCCCGAAGMTGC^ 
CTTCACCGAAAAOCTCTACAAA!^^ 
CGGTTTGAAACAGATGCCGTIXjTCTGC^ 
CAAATTCCGACGGOGCAOMGTGTOCAC^^ 
ACTGCXXGAAAAACGOXmAA^^ 
CC6TTAATCGGGCQCACCCGCTTTGAAAACCTC 
CCGAMTCCGCAGCGACGATTTCAGCCTGTO 
SEQ ID 6396 
HKVLVLGAOTAOTS^^ 

FEAQTIMFEGRKKGTLQIFRQTEEVEAAKQDIAVIiERYGVP^ 

RFETDAVVCALGCFSB^ 

PLIGRTRFEHLFLfflX&CTLG^^ 
SEQ ID 6397 

GTGCCGGAACAGCCaXXXXXXXW 
SEQ ID 6398 



SEQ ID 6399 

ATGAAOCGAATCGAGGAAACGGAAGCCGTCC^ 

CTGaAAAGTTm 

GGAAAACGCtmWCTGGOT^ 

GOTKXOTGCGATTCAAAAW 

AGACATTCGGCCAGACCGTTGAGC&TTT^ 

TGGAAACCTGATTGTGAl^GGCGCGTO^ 

OXXtfAGAGCTGGCA^ 

MGAGTGGAAGTTTGACGTGAACTGCGACCTGCTCA^ 
GCCAAGGGCAGGGAGGGACGAGGTGGCGGAGTTGGGGM^ 
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AAGCAGGCAGACAAACX3CCCGAACATGGCA6A^ 

mmXXSAGCTGATTATCGCC^ 

ACATGAT6AATATGACCOTACAGTGTC 

SEQ ID 6400 

MNRIEEmVQSLASVGAEQNIL(KILIEPTAIARCM^ 

ASmEmVSKIXX?rVAEiaSKAADEL^ 

AVELARRGMAAECNIPMQNLKTGNL^ 

KQADKRPNMADIRGSGAIEQD^ 

SEQ ID 6401 

ATOMTATCAACAGCCCCQAAOTC^ 
ACAGCOXXSAAATGC^TATCAAaGCCC^ 

SEQ ID 6402 

HQTQQE 



SEQ ID 6403 

TTGATGATGAAGCCGTCTGWUUjOTGA^ 

GGAACGATAAGACGCAGTATGMTCAGGCATTT^^ 

ATTGATTGAGACTGAAAMCGAATTGAACAKX^ 

ATCAACAGCXXXX^CTTCAAAATC^ 

CGAAATGCMTATCAACA(XXXXmCTimAA 

AAACGCCGAAAACGGCAMGGCGGTTOAACGTO 

AACCm^CTTTCGGGAAAACCGAAAAATG^ 

«£TCCAAGTGCTGAACGACA<^^ 

OTTGCCGGTTTCTTCC(X^^ 

T6G(^TCCGCCTAAAACGAACCAGGCGGCMGGGGMGGGCA 

SEQ ID 6404 

IJ^SESLRAAGRPIAYYPKI^ 
INSPELQNQHSGGCHSPTAPKCNINSPE^^ 

ngglsgkpknanvprkik^^ 

WHPPKTNQAARGRA 

SEQ ID 6405 I 

ATGAGOEGGAACAAAGAATAC^^ 

ATGATGAAGCCGTC \/ 

SEQ ID 6406 

MSRBQRIRGARLPWAHMQAEGAGKARE&LDGK^^ 

SEQ ID 6407 

GAATTGAACCA 
SEQ ID 6408 

VTQLAARITDLEGRGFVFAKPK^ 

SEQ ID 6409 

TTGmGCATTGCtnrr^^ 

SEQ ID 6410 

tfALRLFLIHNRIiSPIiPB^^ 

SEQ ID 6411 

ATGCCGTCCGACTTATCCTTCATACTC« 
GTCAAAAACCCGAAGAATCGG^^ 

SEQ ID 6412 

MPSDLSFILAKELIRSGSIRIiSGSTAK^^ 

SEQ ID 6413 

ATCAAGAAGCAGGACAGAAACCGCCTGTCGAAGAAA 
ACGOTTTGAATTACTTGGAAAGCAGCCCGAT 
SEQ KD 6414 

SEQ ID 6415 

TAGCGGATGTCGACATACGCGCCGGAATGCAC^ 

CGGCATCGCGCACTTCGTTCAA3T^ 

(HCGGCAAACX^GOmGTC^ 

CATCAATTTCAGCTTCGCTCACGCGGCTG^^ 
ATTTTCn-GCGACGACGGCATCGAT^^ 

SEQ ID 6416 

IiliDSRLRGNDBWRCKSNSKFKKMPPGLLPDRro 

VGKPAQVAAIARACVLGISACQTCEIRAVARLVVDFArc 
ILCDDGIDFRFTCLN^ 
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ACCCOXSmTCAAAmiTCATra^ 
SEQ ID 6419 ^..o^mmmto^im^m^t^^ 



SEQ ID 6421 




CGCGG«*TCGCCAC«^^ 
GGCAC6RT1R0CCRGCAGATG0GCC(XSJWjGGCTGO^^ 



ocreoxxswaumxxaooGW^a^ 

GTG6TACGACGMGCAGGTMTGTTTA03TCGCC^CAATCAGGC^ 



SM3ADALRFTMASYASLGPFG 



GAGAATI 
SEQ ID 6428 

MDMU9FQBIS 

GATGMGi^CCCAAAAACGC(^GGATGCCGAT 
DEETQKSADAD 
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SEQ ID 6431 

GTCCttAAGAO^^ 

GOTATTTSGGMTATCGGGCAfiaTAAAAGO^ 

CXXXXjTTGTATTTGGGAATGCCGATGT^ 

GG8 

SEQ ID 6432 

VPmSLSLPPWKKP^ 
G 

SEG0D6433 

ATGGCMATTCC^ACAGCGGGCATOT 
ACG^GGAGOTATCCTPGCCGCCCTGGAAAAAGCCGG 
SEQ ID 6434 
MANSNSGHSKKIiRAATAAAAI^^ 

SEQ ID 6435 

ATGMCGMTTMTCAGCA^^ 

GOjAAAGCATCAATCACACCGOOT^ 
SEQ ID 6436 

MNELISRIKRPGAEAKDEQSIiIiIiKV^ 
SEQ ID 6437 

ATGGAAGCAAATAAATTTGAAGTGAAAAGTTTGTCA^ 

AC^CCGCAAttTGAGC^^ 

GTOAAACAAAAAGCAGGCMTGAAGTG^ 

AGCCAAATTTGCGGCGAGCTTtft^ 
SEQ ID 6438 

MEANKFBVKSIiSMiIKVFAGIAADPBAAMGVKRADISTOT 

SQICXSEUSVWiSMBIIPVTGGAABSIK^^ 
SEQ ID 6439 

ATGAGTTATCTAGATC&ACCATTGAAGCACGGCT^ 

GCGGCGGTAGCGGCAGCGGCTACGGCAACGGCAACGGCTACAGCAACC^^ 

C^CGGCTACAGCAACGGCAGCGGCAACXJGC 
SEQ ID 6440 I 



SEQ ID 644D " 

GTGAAATATTACGGCACGGCGGCTTACGGCAGCCCC^ 

AATCCGGGTTGATTGAGATTGCCAAAAMTCGGCGCGGGAGTTO 

TAAAGATACC6AA6CC6C6AT6AftCTA0G0CAT^^ 
SEQ ID 6442 

VKYYGTAAYGSPWCJMSRyYAREDMRQAL^^ 
SEQ ID 6443 . 

TTGCGTTTMGTTC^OTATTGGCGGCGCT 

TGGCGSTOSCTCAAAAGCAGGCGCAGGTTGCGG^ 

GCCGGAGGGTAAAGAA 
SEQ ID 6444 

LRLSSRYWRRmRGYAFGPAKGGGSRTA^ 
SEQ ID 6445 

™CT(X}6TAC(^^ 

GCATTCAGATGCCGCTCTTCCAGCG^TO 

TTCXXm^CXTITOmATCC^ 
SEQ ID 6446 

LVRTALFUKRYKRl^^ 

SEQ ID 6447 

TTGCCXXX5AAMTATCCTTTAG 
CCGCTGTMTCCTCGGGACCGGACAACt^ 
T . 
SEQ ID 6448 

SEQ ID 6449 

ATt^TCAAMTOAAGCOTa 

AAAMGAA(XXmCCCGCACAACA(X^GGGCGOT 

AGCCACCCT«XXX5AAGACGCGCA^ 

(^CXXX^TCGAT^^ 

AAAACAGOCGCGTGAGCGAAGCTGAAATTGATGOCTI^^ 

AGCCGACAGCAAAAAamCOTO^ 

ACATCATCAAATTGAACGAAGTGCGCGATGCC(K5^ 
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SEQID6450 

MMRIKALMIAAAliIAAADVHAAPQK^ 

BAEIDAWAQSPALKNIiSPTQRREIjADNIIAEKVR^ 

GNGGDI/^AIXJVMVPAFEEAVHALKPGQVGA^ 

SEQ ID 6451 

ATGAGCTTCCATCCCGAAACC^ 
AAAAACAGGCAGAAGGCCACATTAAAAGCATTATCAGA 

SEQ ID 6452 

MSFHPETAYNGGGETEPYGPS PEEIKYKQSPETAETRKfTEKOAEGHIKSIIE 

SEQ ID 6453 

ATGCCTOCGCGGCG<^ 

GGTCCGTACGGC^^ 

ATTCAGATGCCGCIOTCCA(^^ 

SEQ ID 6454 

MPCAAM^DlftFNVAFa^^ 

SEQ ID 6455 

WGCTTTCGTCTTTCACACTTAAM 

SEQ ID 6456 

LLSSFTLKINDIFGIiTiTiKVVSSIGI^FYALRGGVI 

SEQ ID 6457 

TOATTTTMGTGTGAAAGACGAMGCMTTTCAAAC 
ACXGGCAGCACAAGAmTAGTGCAGTGGGAAATCG^ 
MGCCMCTGGCMCAGACCTCAAMGCTGGCGCGGM 
GAAAGCAGCGACGGCAAAACCACCTATGCCMCATTTCX^ 



SEQ ID 6458 

IjILSVKDESmPCPAGSHHATCIKIIEU?I^LVEYQNBQKRQm 
ESSKJKTTYMISAISNKMKSYTPKHPDTO 

SEQ ID 6459 j 

TCACCATCGGCAGCGACGACCTATTCAO^ , 
SEQ ID 6460 

LQLLRTATEKGNKKMANIDLTQTO 
SEQ ID 6461 

ATGACCCXXX^CACACOTAO^ 

MGCGCMTOGTTATCGCTTATATTAAAMC^ 

AGACTACTTGGCGCGCAATATGCAGGC^^ 

CXXX£CGAATTTATGCGTGAGG(XX^ 

TTAGA 
SEQ ID 6462 

IffALTLYRCMDVOAGUJYYFDSETER 

PAEFMRKAVKTEPDKTAIRKAIESGRQVAGAKIEGRKNLQIR 
SEQ ID 6463 

ATGAGCTATTTGGMGATGTAAAAAACGCATTMG 

AAGCGGATACCTCTTOGAATTTCTAMG 

SEQ ID 6464 

MSYLEDVKNALKVII^^ 

SEQ ID 6465 

TmTGAAC^CCGAAAACCAAGACTGCGOT 
MGCCTACGAMCCTACCGCTTTGATTTGGC 
OTTCAGCGCGCEACACGCCATACOT^^ 
GCCAAMCCGCCGACAGCATCATGCTCGCCCGTO 
TGCGCGGCGAAACGGGCATCCAGCCCAACGTGAMGOT 
AGTCGCCCXXX^K^^ 
MGCTGCAAMAGOTTGGACm 
TGGCGAAAGTGCAAAAITCAGTTGGCGAAGITCAAX^ 
SEQ ID 6466 

AKTADSIICJ^PETDGGEIVQTAFGQMT^ 
KLQKALDKLNAKLSKPGYTEKAP 

SEQ ID 6467 

TTGATTTTCCTTTGCGTGGTIX3^ 
GCGOTACCCTTCC 
SEQ ID 6468 
IIFLCWGVRPmiGRLKDIFGA^ 
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SEQ ID 6469 

TTGGCTCGTTTAITITC^ _ _ . _ _ _ _ . . _ . , ^v^o^wm-^^^oui^ 

Ctf^GGGCGCGTCOMTC^^ 

AGCATCCGF(XXXX^GACTATACGC(X^ 

GATTGGGCGGAmCXSACCAGTCGGGOSA^ 

GCGAA(XXX!ACMCGTCCGTATGGAAACCGAACMGGCG^ 

CTGTTCCCXX^GAGATGCCGGCTGGTATGTCM 

AOmTGGGC<KACTTO^ 

AOTCGATGCCAOTTCGCCOXXm 

TAAGAAAAGCGGCAGGMCMCCGCTATCAGGCAAAATGGCAG^ 

TTTTACGGC(XXXjAAGAAATCG^ 

CXXTGGCAAACC^AAGCGGCTACAAAGACGAACCTTAC^^ 

CAGCCACGAOSGCCGC^ 

CCATCGAGCCGCGC^^ 
C«mC<&CC<XATC^^ 
GA!K^TGCGGTGATGCTTGACGGC^GCGTC(XK^AAM 
ACCAAAACGACAAACGC(X!CGAACAT^^ 

ATGCTTGCCGGTGCAGAATACAAAAGCAGTT6CGGCTGOT 
AAGACCraGCAGCGTCGGCAGAAATCCCGC^GGC^^ 
SEQ ID 6470 
LARLFSLKPLVLAI^FCFGTO^ 
DHADYDQSGDTVTVGDRPA^ 
TPWADFPLDGNRKSG^^ 

FYGGBKIAGNVNLNRRVWLDYGGRAAGGSLNAGLSVQKY^ 
IflSFGGKASRSVGKVLFV^ 

MIAGAEYKSSCGCWGAGVYAQRYVTGKNTYKKAVFPSL^ 
SEQ ID 6471 

TTGACGGCGCGTTCCTGCCGCGTTGAJTCOT 

TCTCAATG(XX^TCATAAAGCTCTA!I^^ 
SEQ ID 6472 I 

LTAESCRVDSF AARMAS I CF AVGFVATVC I LF SRFFDAVLSMPNHKAVSLTR5 PHLN 

SEQ ID 6473 

ATGTTGTCTGGCGCAMGGTATCGGAAGACGAAGCCCT^ 

^TCAAACAGGTGCAACGCTTGGGCCTGTA 

TGACMTTTAA(X:CCmTCAGCGCCAACTCGT^ 

TCATTGGCXJGTGCGGGTTGCCAAGAAAGCCATGGCGTTT^ 

AAATGCX^GTACGCGGAATTTACAGGCCGGATCCTACACGOT 

CCCCCC&CGGATMGCX3ACGTO 

GCCGOTTCAGCTGCMH^AOT 
SEQ ID 6474 

ffljSGAKVSEDEALTCGIMMRLSIjQM 

SIAVEVAKKAMAFLELDCMIJSAVGI^ 

AGFSCPNWTEHFKRTANUSUIHREAAA 
SEQ ID 6475 

ATGAGGACG^TCCGAAtt 

AGACG«»AACCGCAGC<£<^G^ 

CTACGCCGMTACATCACGGGtfSGCGAAC^^ 

CCGCTACCGGATGCCGAAOT^ 

SEQ ID 6476 

JffimRimHDSGTKGSOTASGISGTAAGAABQAFFSAACT 

PVPDAEARRKRKKQAA 
SEQ ID 6477 

ATGGATAGCGAATACCTAGACCCGCAGCMTGCGC^ 

AACGTTTTTGGTGGAAAAW 
SEQ ID 6478 

MDSEYU3PQQCADILSVKKRTPLERYAPRTO 
SEQ ID 6479 

ATGCC^GGCGGACGMTAGCGGATTTTACCG^ 

MTTGGTAGACGCACCAGATTTAGGTTCTG^^ 

TGCAAATCACATCATCXXSGCGTTTTTCTT^^ 

GC6TACCGCTCTAAAAATGTAC6CTTTTO 
SEQ ID 6480 

MPGGRIADFTAAKHKPQI^ 

AYRSKNVRFFTDKISAHCCGSRYSLSI 
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SEQ ID 6481 

CGATTGACCGTTCTGTTTTGTG^ 

TTCACGCTCGACAGGTGTGCX3GA6CGCGG06GW 

GATGGAAATATTAC^ 

CTTCGGCGGCGCAAATTO^ 

TCTTGCCTTGGCTTTGTCCGAAC^^ 

SEQ ID 6482 

lgmpaahki}fiiltqglklpu)evrmyltaqtvmm 
dgnitdvslacrsaqs(3wmnvasij™ 

SEQ ID 6483 

TTC&GTGAAMTAAACGAGCCAA^ 

SEQ ID 6484 

LSEKKRAKIAPQVGLPVRIWIVTRNARIl^^ 

SEQ ID 6485 AMm^ 
TCGATGGTTTCGCCGTGTTGTTCCT 

GGTGTGTT(X3A(MTAGGCCGAMQjAGOT 

SEQ ID 6486 

VFCXQGQVLFtmVCA^ 
GVPEVGGKBSIBQNGWFIiLADWRKGKRSHGMPSEKJfGL 

SEQ ID 6487 

ATGAAAATGTCmAmTOC^CCATGTCG^ 

GCMTTGTTCAGACGGCMTCCGGTGTTGAC^^ 

ACAGGCAGAGGGOTCGGCAT^ 

TTGGCACTXXKSGCTGCCCGTC^ 

AAGATGCG6GGATTAACGTGGTCGAACGCA3^^ 

SEQ ID 6488 

MKMSELLDHVASCRLPTEWGVFTffl 

LALGLPVDARDFRLAQS IYEYLG IRSVKLLTNNPEKIQTLKDAG LNWERI PLHVGENLENERYLQTKADKLGHU<SE 

SEQ ID 6489 

TTCATAAAACmTTGCT^ 

GTTTGGAGATAACGCTCGra 

CCGA6CGGATQC(XA6ATATTCGTAGATA6ATT^ 

GGCGCGGATTTTGTTMTCAGCCCGATGCCGCGTC 

TTTCTCGAGAAC^GCKGTCGCCCGrcAGA 

SEQ ID 6490 

LIKLIAI^SYGNl4FPERRRi^VPCOTAIiFRH 

GADFVHQPDAASFLTQimDAAPLCLYRPHCRPKiaSAVAF SREQRYARQTFGVDARQHRIAV 

SEQ ID 6491 

TTGATATTTATGATTFTCTCTATCATCGTCOT 
ACBGTTCGCO^^ 

SEQ ID 6492 

LimPSIIVPIYNVEKYIiRCCVDSVI^ 

SEQ ID 6493 

1TGATATATCCAATCCTACG(XXOT 

TCATAATCGGCAAAATTTEC(^ 

TTOXOTATGOTCAAACCC^ 

SEQ ID 6494 

LIYPILWnfAWOTYMRIFACIFVANLPM 

SEQ ID 6495 

TOGATATATCAAAGCTGACATTTGACGCATCCG(^ 

(&TATCAGAATCK&ATATC^ 

AGAAAGCAGGTTCMGCGAACCAAAACAGACAAACAGGATGC^ 

AGOTTOACGGATGACC^ 

CCATGAATGAACAGCTTGAAGTT^ 

COTATTCTTTCATCATCTAA^ 

ACCMffTTTGGCAACAGGAMmC^^ 

TCATOTCGCAG(^TOATGCGTAAACTCC^ 

SEQ ID 6496 

LDlSKjTFDASftKVGKTEHSAKFDl^ 

RLSEMTMYAQIKSECAAMKNRHHAAKDEEAAKAYAQ^ 

TKFGKRKLRAVLFMPAMVAYRIRAFPDFIKR^ 

SEQ ID 6497 

TTGCCQCACCGACAAAAATGCCGTCT^^ 
CGTCTGAAAAilTrAAMGTTCTAGATTCCCGOT 
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SEQ ID 6498 

LH 

SEQ ID 6499 

AAGCCGTC 
SEQ ID 6500 

LIAAPTKMPSETANGLQ^ 
SEQ ID 6501 

AT6ACTGC ^^ 

TGTCCGACGACaraCACm^^ 

TCCCTGC ^^ 

TCGGGCAGGCG 
SEQ ID 6502 

OTALTLPEDIRQQEPSVLLTO 
SLRKQFPDFEEIPGAYYE^ 
SEQ ID 6503 

ATCCAACGGCAMCWAACTGAAAAATO 
ACGGCGGCAGCGTCGTCTGTATGGATGCGC^GCCCGACAAGA 
TTTGGGGTTTGTGGTATTGAACGACTTGG 
TTC(mAGGCM6CCGTGAAGGGGTm 

AACAA0GCCAA ^^ 
GaXXmTTAGACTTCmGA^ 
TGGGAAAAGGCGCGGGCTGCX^GGCTTGCCTG^ 
ACTACCGCGACGGCAAAGACAAATACCXjTCCX^AAATCCC^^ 
CGGO&TGAAGAACTGGAA^^ 
SEQ ID 6504 

UJKASREGVLPEYDRETbMEINIiFPEWFVAKEWRELTFK 

WEKAWiA(51iPVPAEFDEFYI«^ , 
SEQ ID 6505 

ATOmCACCCtm?^^ 

AAACOmACAACCGCATGGA^ 

GATACACGGTTCGAAGCGCAACGGCTGGAAAACCGCCGCXaAAC^ 

AAA(KlACAC(K^GGACACGCGGAAAACGAACGra 
SEQ ID 6506 

SEQ ID 6507 

TTGAATATGAAGGAGCGGATAGTGGGTCAAAGTGGCSAGT^^ 

GQGAATGGGTCTGOTGAATGTGTTG^ 

GGTCAM(mAAAC(MACGAT(XXGAMTCCGTTO 

^^CGTACCGGCGGGCAGGGCXnTCGATATGGGCTGCGGACAG 

TGGCGGMCTGGCAGAGGCGGAGGGTTTGAACGTCCGCACG^^ 

GCO^GCGCXnm:C(^CGTGATTGCCG^^ 

AAATTTAMGAGGGCGAGCTGAAGGATTATTATCGGGATTGGGA 

TAACTATGCTGGCGAAAAAACCAGGA 
SEQ ID 6508 

LNMKERIVIK^ELFCFGQMPVWKVEN^ 

TVPAGHAUMGCGQGI^ 

KFKEGELKBmimVEYKEEL^^ 
SEQ ID 6509 

ATGCOTKnXSAAAACCra 

GCCGTCTGAACC6CGTGTTGCQG(X^GTA 
SEQ ID 6510 

MPSQffiSAGPVVHYYVVLQDKAQQ^ 
SEQ ID 6511 

ACATMTAATGCACCACAAACCCCGCGere^^ 
SEQ ID 6512 

MmGLYSPQHAVQTASPSFFIISCYDAPRRLAL^ 
SEQ ID 6513 

ATGACGMTGAA(^T«X^GTTTATA 

CAATACGACATAATMTGCACCACAAAOCCC(X^ 

CTCTTTTGTGCAGACGCT^^ 

ACC(XX£AAMTACCTTG<XjCGTGTC 

(X^CGGCGGAAATCATCGAATGCGTTGCCGAC^ 

TGCGGCAOTGACGCGPT , iyy i *I<2r , PGM «rwnit^1inoA<vvw» n .inim»,. • . . ~ — . 
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CGCGC?PTCCGACTGCCCACACGCACGGCSO 
QSCGG^TCKWftCCCTT^^ 

ATCTTAGGGGCTCTACTAGATCATCOCTMtf^ 

" V- .wwiMWHIIBftTf f .1 



mswnMMTFBIWIAA^ 
AHI<^FAKGn»ffH/WnADM^ 
S A^KI^^ 

(aCACCGCCKS3GCAGCTAAAAMCCTGCQGGATOSG^ 



MDKPGKDDVYQUKSGKLVSEDGTSBERCIKVE 



GGGCTGAT 

TAim 



GlfflRHIiQSSYASEDYVGSGKILRQGGRCLFRliAQEGKQGIiL 

CAGOTCAACGGt 
SEQ ID 6524 



S ?^aSitox3Ccgmgcogctai^ 
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SEQID6526 

MPICTPPKPVGGWKmAA^^ 

NGRQLYFEAQRSSMHTMRQNWQVNKGVRSMF 
SEQ ID 6527 

"nJGCCOTTTCCAATGGTC^ 

GGGCGGATGCCGTC 
SEQ ID 6528 

LPLSNGLDSMifijNDDLEITR^KTTETEQA 
SEQ ID 6529 

CGTGCAG(K^GG6AATCTA(1ACCATTGGAAAGCG 

GAGCAACAGCAATATTCAAAGATTArcT^^ 
SEQ ID 6530 

VGFSPPIgZASAPPSFPR^ 
SEQ ID 6531 

ATGGGTOXXXXjGTGATGAAATGGTGGATTG^ 
SEQ ID 6532 

MGRKVMKBftlVPQPAAEMPSEGRFCW 
SEQ ID 6533 

GTGAGGCATCTACGAAAGTGC^TGAAAAGCAA^ 

TAATCACGCATCXEGGCATTCGATATOT 

CAGGCAGTACGAATGGTACGGAACCGGTTCGCCTGG^ 

SEQ ID 6534 

VRHVRKWSEKQLYFPPRRAKIPKL^^ 

SEQ ID 6535 

TTCCGTCTGTTTCCAKCCGTGTAAAA^ 

TTCTCGGTTTGGGCGMGCGTTCAAAGCCGAAAC^ 

AAAACGOTGTTGGAAAGCGAAACCAC^ 

CX3CGCCAAAACAGCGCAAAG0CTTGGCGGTACAGGCG(^ 

ACGCCATTCKXAAAGCXXsTCGGTATCCMGACAM 

CGTCCTGCTGCACGGCrGC^^ 

CAAGGCTTCGGCAATGGTTTGGMGMGATGCCT^ 

TCGGTGCATTCACTTTGGTGGCCGAAGATGA^ 

ACTGGTGTTGAAAMTGATGATTOjAAAGCACAAT^ 

CMGACTTTGATTTTATTATTGMCAAAAC^^ 

ACXSTCGCCGGCATTAOCGACGACAACATCGAT^^ 

SEQ ID 6536 

LRLFPARVKQRKIGKPELSSVPIKEQAMFF^ 

KAKTAQSLGGTGALRIAABFAKRQIJJ^ 

QGFt^GLEEDAYGLRVFLKH^^ 

QDFIFIIEQNC^SPSGLTPEQVDRLKN^ 
SEQ ID 6537 

ATGAAATCCAGACAOTCGCO^ 

CCGCCCCGGCAGGACTGGTCGAAGGGCAAAACTACACCG^ 

CCGCCTOAACCTGTTTTGAGCAAA^ 

GATATGGCTGCCGCCGAAAGCAAAGATGTGGCGM 

CCTTTGACGGCAAAAMGTOTTGCCGC^ 

CGGCAMTATAMGTCGAATTTGCCGACTGGGAGT^^ 
SEQ ID 6538 

MKSRHLALAIXjVAAIFALAACDSKVQTS^ 

EMAAAESKDVANSHI FDAMVl^KIKLQEPEVLKKWLGE^ 
SEQ ID 6539 

GTGAAMTCCTCATTOAGGCAACGGGCAGGTAGGTOT 

GCAGCCGCCTCGATGTCCAMCTGTTTTCGGCA^ 

CGTC3GCCTGOUAGTTGCCGCXX3ACCTG^ 

TCCATATTCGOIATAACCGAAA1XATCAG 

TGGTCATCATACAGGCGC(K3CGCGGCGGGCTG^ 

CA3TCTCCCCGCGaxm^ 

ATCA!TCATTGCCGGCGGCGGCMCATC 

ACAACAaxrrcxmxrrGc^ 

OTTTTGGCGAAAAACCTCGGCGCGAAGCGOGTO 
(OTC(^TACTC(XX:aCATCCGGCGCGGCGACA^ 
GCAGGCGCA!K^GCGGCATQJATGGCC^^ 
CAT^TCTmTCGTCTCC^ 
SEQ ID 6540 
VnULGNGQVGSTVAQhnMITNNDVTO 
SIFGITEnSPEQOTEQLAGUrcPGALQVI^ 
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MA(miGYRI^QLimyNmiECRPRRAEWIAm 
GSILAHIKRGDIVAVHPIRRGTAEAIEV^ 

SEQID6541 

TCA(mCCGOTt&T^ 

GCCATCAAATCTTCCAAAGTCTTGGOT 
CCACCGCCCCGCGCAAATGCXm^ 

SEQID6542 

VASWKTtmiil^GKIIjTGRDAAHmVNM™ 
AIKSSKVLAFPELC^VYEFEVK^ 

SEQ ID 6543 

GT6GATQGATTCGCOTTGCT6TCCACGGCGAC6OT 

TTTGCCA(XjAGGTATGCCGCGCCGCCGACTGC^ 

GTTTGAGCATTTGGC^ 

ATCGACGGGCAACTCCTCGCCTTTGTCGAGCA^ 

SEQ ID 6544 

VMFitfAVHGDGYGHIFDFKFVNGFHTQFGERQDFGR^ 
nXSQLLAFraiDEAmGIAAGEDFAVQQQYVAGFPACHm 

SEQ ID 6545 

TOTCTTCACCOTACAAACGCTGGCACTATTT^ 

CCGmTCAAGCAAt^GACTTTATTCAAM 

CGCCGCCAMGACGCGATGACGCAGATTTT^^ 

CMTGGGATGCGGACATGAGCGTGGAmGATGGTTAA^ 

ACACX^lAAGACAACACCCCXGCCt^^ 

OCCTTCCGACMCAT^ 

GCGAAAGAATCCCTGATGAC^CACATCGACATCm^ 
TC(mTCG(XXX^^GGCTT^^ 

CCACGTCGMTTT<^TTG(^ j 

SEQ ID 6546 

LSSPYKRVHYFNLIKNSI£KGCIiKQQQ^ 

QWDADMSVEKMVNEGVRRAYTTO 

AKESUISHIDIQELQKKAASGAELSTTEALRLELFEKVNAUj 

SEQ ID 6547 

TTGTTCATCAACCGGTCCACAATGAACTGCCTGC^^ 

SEQ ID 6548 

LFINRST^LLISFyRKATGKDYKYQNPNKTVFY 

SEQ ID 6549 

TTGATGAATATGAAAAAATGGATTGCCGCCGCCOT 

GOrrGGOTCCMCGCCGAGTTTGOT 

CAAACACEAGCCGTGGGACAGCCTOTO 

TTTGAAA1CACCX3AAGTCGTCCTCGTO 

MCTCTTGG^MCGACMTCCGAAAATCGCG^ 

TGTGAAAAACAACCCGGCCAAAGGAATGGACTTO 

TTGGAAAAAGTACGCGAMGCGGCGMTACGACAAGATOT 

SEQ ID 6550 

LMNNKKWIAAALACSALALSAC^^ 

FEITQVVLVPKGKKVSSSEDIiKKBNKVGVVTGHTGD^ 

LEKVRESGEYDKIYAKYFAKEGGQAAK 

SEQ ID 6551 

ATGCCGTCTGAAAAOAAGTTTCA& 

GCAAATTCX^CTTTATATTTGCCGCCGACGATAACrc 

C3AGGACTTTTTTGCCGTCAAAGGC 

TTTGCTTTCGGCGGCAGCXIATATCGACGGCGGCGGCGA^^ 

TTGCTCAAAACAGGTTCGAGGCGGG 

GAGGGCGAGGTGTCTGGATTTCATACGGCTCTCCGT^ 
TCGGGAAACCGCTTCAGACGGCATCAAACC^ 

SEQ ID 6552 

HPSENQVSIEIFQTGLIiRGFLFFTYFVRQK^^ 

FAF(X3SHIDGGGEACQRQHFFIiPDDWGTQVHIVFKRI/^ 

EGEVSGFHTALRDVGNKSDIGIL^^ 

SEQ ID 6553 

TTGTACGGCAG(XX£CGTTOCCAT^ 

GA3TTTCCGACTGCCAA0GCGTTTC 

CCG(XXXOTATCTTTGGTGTCCGCAC^ 
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CCAACCCGA1TGAGAAAGTGTGGGCGMTATT 

SEQ ID 6554 

LYGSBUiPC^QTDADGAQKKTAAYK^ 
LAQKSVIISI»UIRFRBHGALRGT 
SEQ ID 6555 

ATG(^GTATCGAACA(XX3TTTGGM 
(XXjAAATCGCGGTCGAGTCGATOA^ 
GGAACGT 

SEQ ID 6556 

H3SIEQRLEYLEEAKDVLRM^ 

SEQ ID 6557 

GA 

SEQ ID 6558 

MMNPYESVKMRHYIKLR^^ 

SEQ ID 6559 

ATGGATACAMCMTCAGGATAMGAGCCGMTTGGGA 
GAAAGGAAGAGCCCATACCGAAATCGGACGATGC^ 

AGTGCCGMTTTGAAACCGCACAGATTGAGGGCGGGGAAMW 
CCGTCO^TCAATATOCTGAACCCGAAAGCCG^ 
CAACGGCATGATCCGTACGGCGGTGCAGAAGCAGATGGCGC^^ 
AAAATCCTCAACGACATCAAGTACGMTTAAMTACAAAC^ 
SEQ ID 6560 

MW^QDKEPNWEKRLENLRQTM 

SAEFmQIEGGmnAAESGTAADCVAENGGGGAEEAI^ 

KILHDIKYELKYKR 

SEQ ID 6561 

GTGttOCCCTTGm^ 

SEQ ID 6562 jj 

VHPLFNIJiIfCACamLSSRTGROT 1 

SEQ ID 6563 

ATGAGCAAAAAAOTCATCCGCAMCGCTCGOT 

ACGCTGC(^TTTGTTTICAAAAAAMTCAM 

CACTTCGACCGGCATGTCCGCGATTCAGG 

ACCAAATTCGGCATCGGAGTGAGCCGCGTGTC^ 

TGGCCGACTTCGAAGCXTTGGC^GAATTGGCGG^^ 

TTCCTCOXXACCAAA*^ 

CCGTTTAACGCGT6GCAGTT6TT6AGCGGCGTG6A 

TGTACTACATCGGATTGCCCGACCATCCGCA 

ACTCTTTCCXX^ACCGCXIAAOT 

GTGCGCCTGTCGGTCGGTTTGGMTATGTTGGC^ 

SEQ ID 6564 

HSKKLHPQTLAIRGGKEQTCYRIIHNQAI^ 
TKFGIWSRVSPTDINEVTCAAV^ 
PFNABQLLSGVETLSLRMEKQFD^^ 
VRLSVGLEYVGDLIDDLKQALAE 
SEQID6565 

TTGCGTGTTTCAGACGGCATCGGCGGAAAMTGOT 

SEQ ID 6566 

LRVSIXSIGGKMR^SETLIAYGIHIHNRKKTC 
SEQ ID 6567 

AT^TGAAGCGGTT^^ 

SEQ ID 6568 

MDEAVFADKALKiaTGIJIFtf 

SEQ ID 6569 

TTGCGGAAAGATGATTOTATTTTGTM 
AAACCTAOTGGGCAATACCAAAAACT^TCAATAAC^^ 
GCCCAAATTCCAAAC(&TAAAACC^^ 
CGCAAmCKXTITICAGA^ 

SEQ ID 6570 

LRKDDPYFVimTIJSWRLNQTraVQITra 
RNKGFRRHLPHPSADAV 
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SEQ ID 6572 

LV 

SEQ ID 6573 
SEQ ID 6574 

CAGG&TGACGaGO^^ 

CTTGMGAAATOTG(XATTCG(XATTACCAAA 

GCCMTGTGAAftCGMGTTGGTTGAATTGMOGGTACAGGCTGC^ 



ITCAKKXMMATAGAAATACCCTTGGATG^^ 
TAAAMCGTTAATCTGATAITO^CGTTAAGCTWITCATG^ 
AAA(H3CGTTOAAGGTOTCCAAAAGGTO 
CACTGTTOATGAmGATOGAAia^ 

S ^AfSiS 
IEEILGPGITKKLPFKGKFFYQSCYQrcPIfflVByGKVHyGGQCETMLVEIJ^^ 

KGVEGTPKGLQPEQpnXatNQTQRFQYIHEEQKAIHAliHIETIf^ 

S ?£iSgA^^ 

MTCAAACaXWflaX^^ 
ATmCAGGAAAAG«mGT«^GAAATTGTOGATTTCCM 

SEQ ID 6578 

HFETSQ 



SEQ ID 6579 

ATGMS^GATAK^ 

GCTATGAAGACGCTTTTAGCTCGGCATTGGAAGAG&TAGGC^^ 

SEQ ID 6580 

MSRYQQKFIVQEIjENHEPIYPK 
epQ in ccoh 



AAGCCGCAAAAGCGGACGGTAT 

SEQ ID 6582 

MKFDTCOTCYGAKIAVVTAAP 

S aSS™tcacx^ 

(XIGTTTCGGGCAACGCTGCGTTTGCC^^ 

GAAATTTTTAAAAAMTGTGTTTGCGGGCTTO 

TTCCAA5WX5AGCTAAMGCGTOT 

FQCRAKSVPIAVIPTG 

^ CAGGMAGC^^ 
CATTTATCCGAATOCCAAACCOaT^ 

SEQ ID 6586 

QDSDS*QIflmPPB3IiPRQKH^^ 
SEQ ID 6587 

GTGCT^^ 

GGCAGGTGTTMTAAATTTCATGATATTTT^^ 

SEQ ID 6588 

VUOTFISIMVHAAIAH^^^ 

SEQ ID 6589 

ATOGMACCGCACAAAAACAA^ 

«ICGCCC<^TC«^A<3AATCI^^ 

ffiTACGCGGCGTAAAO^ 
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T0CGCGOCmXm<m^CAAAAr^^ 
ACGCMCATCATG«m^ 
CTTTA(XTGACCGGCGTCGACTATCCX^GGGGCA 
SEQ ID 6590 

SAPVRSPIMNRAQfTHLEIiDIGPMRRAAALLVGEQDFSSFRM 
LYLTGVDYPGAYGIVRPQIPHJIi 
SEQ ID 6591 

ATGAATATGCTTGGAGCTTO 
TTTTTGTCGCGTTCAAACTGCXX^ 

GACAAGTTCCAACTTTC(ATCAGCC^^ 

TTAC<XXX»CGTTTmA^ 

TTTCCAACTGCCGTGGCTG<m^ 

TCGGTACAATTTTGCTGCCGACTTT^^ 
CGGACTGGCGGTATTGT05TTa^^ 
TTAA*n&TGATTAAAGTGTTGGCATCCGGCTTTT^^ 
CGTTGAAACACGC(MGGCWTC6CTCGCCATCGGCX^ 
CTTGGCGAAAATO^CTCGC 
CTGATTGCCGTCGGCGGCGGACTGTATTTCGC^^ 
SEQ ID 6592 
MNMU2AIAKVGSLTMVSKVI£FV^ 

DKFQLSISLIiRITFraMSLSSFVGSII^SYHRFGIPAFTPTnia 

AQISLVINTIFASYLQSGSVSWKYYADRMM^^ 

LIMIKVLASGFYMQNIKTPVKI^ 

LIAVGGGLYFASLAALGFRPRHFKRVKJ 
SEQ ID 6593 

ATGTTGAATCCATCCCGAAAACTGGTT^ 

AGGAAAAAATCATCCGTCGGGCGGAGATC&TCGACAG^ 

CGCCGGATTTTCAGGCACTTATC^^ 

CTGCGCGTGAAAGTGGGACX^TTTTTCAGCAG^ 

GGAAAATAGGCGCAACGGCGCACAGCTTCTGGCTCTGCACGC^ 

CMTGCCGCTTCGCTACGCGCGGTGGAMTGTTGG^ 

CGGGCTTGGTCGGGGC5X3CTGGTOGGCAGTATC 

AMCCTATTATCAGGCGGTCATCCGCCGCTGGCAG^ 

GCTCATGCTGGAGACCGAGTGGCAGGACGGCCMTGGTTO 

M(^GAAACCGGCGCAATOCmTC«^ 

TGGCGGAACAGGGGCTTTCAGACGAC^^ 

AGACCAA 
SEQ ID 6594 

MLNPSRKIjVELVRILDEGGFIFSGDPV^^ 

LRVKVGRFFSSPATWFRGKGPVN^^ 

RAWSGLLVGSIVCTGILPRLIiAOTCRIU^ 

KQKPAQLIJGVRAQTVPDRGVIAQrVRIiSEAAQG^ 
SEQ ID 6595 

TTGAT(K5TTCGGATC(X;GTCTGAAAGGTGTATO 

AATATATTGCCAAACTGCAAACCAAACTGCT^^ 
SEQ ID 6596 

IflVRNKERCIRKALHSGTLANAIMl^^ 
SEQ ID 6597 

ATCXXXftTATCCTTTlTO^ 
SEQ ID 6598 

HPISFFGKRFFKAYIBARIARVRTGBTGCRimAV 
SEQ ID 6599 

ATGGaTACTCT(KmCTTAAGAAA 

ATCACXTCAGGGMTCGTTCOT 

TCAAGAAACACGCITEACCTGCGGAT^^ 
SEQ ID 6600 

MAYSAIMNKAUKSCHimKT^ 
SEQ ID 6601 

ATGAAAAMTCCCTGATTG(XXTGAOT 

ACG<mGGTAACTAMG^^ 

TGCCTCCGTCCXXXm^ 

MCGTCMTGCTTGGGAATCXXKJCAMTOA 

GCGGCAGCGTACAATACGCACCTAAAGAC^ 

CGGCGAAQGCACTAAAAAAATCGMTA(XATGGTCAAAOT 

TOCOTAGCXXCACAACAACAAGATGCC^ 
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TTTCTTACGCG^GHjCT^ 

CGGCTGGITGCMGAAGGCAAAGGCGCAGACAAAATI 
SEQ ID 6602 

MKKSLIAimALPVMMADmYGM 

NVNAWESGKFTGNVLEISGMAQREKRYLSVRITOSPEFAG 

SVMQQQDAKLYGAMSGNSHNSQTEVAATMYRFGNVTO 
SEQ ID 6603 

ATGATTTTGACACC(XXmCAC^^ 



GCTCGCGGTACGCAATCGGGGTTGGCACGGC(X^GTCGT^ 

TTGGAa(^CTaCCGCGCOTACCGTGAAA 

CCGTATCCGOTXX^GTTGATCCAGAGG 

TTTW^CGGCATTTGCXX^ 

CTGCTCAAACACGTflXO^ 

CACACmXXMC^CGCCGGaTO^ 
SEQ ID 6604 

lflLTPPDTPPFLRHGHADTIAAK^ 

LDTLTARYREIYAVCTSLGGNAPAKYW 

IiLKHVAKPLLLIJiAANDPFliPPEALP™ 
SEQ ID 6605 

ATGAGCGAACAAAACCAT^^ 

AGCGCGACAGCTTTGCCGCCGAOTGCAGGCGCM 

CAAAGCCAGOTCGCaCCATTCAAGACOT 

AGGAAACCAAATACCGCCAACGCTACGTCGATTTGATT^^ 

TCTC(MGTCGAAACCC(X^mTGCACCC^ 

CTGAAACGCTTGGTGGTCX^GGCTTGGAACGCG 

TCTCCGACTAC^AACGTATGATGCAGATGGCGGAAGACATC^ 

CXsAACGTCTGACCAITCCTCGAAGCCAT^ 

ATACGAAAC^GGTCTC&CCGM 

GCAAAMGATGCGGGCGACGACGMGCCATGCACTACGATGCCGACTA(^ 
CT^CCGATTTGCAAWXA 
SEQ ID 6606 
MSEQIWPQTBPQIjDENQIIALRREK^ 
GAEGTLFKTNHGELTVRVSDIRLLSKSLRPM 
imVVGGLERVPEIKRSFRNEGMSVRHNPE 

SPGIGSLQLALFEGCAEGKLWNPTF IVDYPVEVSPLARASI7PKQGLTERFE1JW6RELAN 

LTDLQTIRDVILFP0IRPE 
SEQ ID 6607 

TOXXX&TGGTTTISTTCGCT^^ 

TCCAAGAAAGGGCGATGOCGTCTGAAAATCTTGCA^ 
SEQ ID 6608 

LRMVLFAHRVFRKNKSGAVCFRR^ 
SEQ ID 6609 

GTCGTGGATAAAACTTACCCCGTGAAGCTGCGACTGCATM 

CTACOTOMCGGAATAATGCTGAACAGCATATGATO 

CCTGCAATGTATTGGTCA<X^^ 

CATCCGGGGCGAAACGGCAOT^GTTOTCAA^ 

ATMTCGCGTCMCGTCCCXX3CGTTTGAGCGA 

GGTATGGCGCMCCGTAAAAATCAG(XXGAGTTTGMCGCACC^^ 

GATACGTTCCCCCGC^^ 

SEQ ID 6610 

VVDKTYPVKLRLHNlttlW 

HPGRNGIQFLK*MPFITONHEKRra 

OTFPRttSRNNIVAAPVKQKD^ 

SEQ ID 6611 

ATGTCCGAACAACATAmCO^^ 

GAAAAGCTGOGTTTATTTGTGGGTT^^ 
TTTGATGOm^CGGTTGCCGCC^ 

CGCGCCGATCGAAATTKX^^ 

TTGGCTTTGGTTTTCCTO 

GCTGGTCTCCT^^ 

TAAAACCGGCAACGCCGAGTTGTTTGCCKK^ 
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mGTGTCCGCCOTGTGTTC^^ 
mCeOTTTCTTCC^^ 
SEQID6612 

MSEQHlSmSKINALGPGIMlfl^ 
FDAGTVAALIMASCIJILVSGRYRA^ 
LMjVFLAI£AFVQYGNGEAVQMA(£ 
FVS APW AWLNYRLVKGDKRHRI^^ 

SEQ ID 6613 

ATGTTGTTCGtoCATAta 
AASOTACCGTOT^^ 

SEQ ID 6614 

MIiFGHKVFRIFNCVSTHRAAYSDTAI^ 

SEQ ID 6615 

TACCTTTTTCTGTTCCGGCAAAGTMTGTTC^ 

AAAGTTCGATCAATGACAT 
SEQ ID 6616 

SEQ ID 6617 

TTGGGCGGCAACTTCCTGTtOT^ 

SEQ ID 6618 

LGGNFIiCI/^KDFIEDVGHVVFRICPGKGGIIiASNR 
SEQ ID 6619 ^ 

ATCAGGCGCGCGCATGGGGGGCGGATGCCGTCCTGt^^ 
GCACGACGAAACCGMTTGGAAAMTGCCGCAACC^ 
CTGGAAGGCAAAACCGTCGTTACCGAAAGCGGCATTACAGG^ 
TOGGGGCGGAA6T6GGCAAACTOTC 
SEQ ID 6620 

imTDEPYFQGSPEMQAmVLLPVliI^ 
LEGRTVWESGITGKADVKFbSRARGVHTFLIG 

SEQ ID 6621 

TCCTCCGAACflG&CGTTTGAAGAACCT 
TGATGGATGACCGGTTCTCCCGAATTGGATGCXrre 
TTGCCACACGTTCGGGAGAATMCGCGTAATTAA^ 
GGAAACCTCAGGATTGGTCGTTACMTGGCTTCAT^ 
GACAGCTCCTGCATCACTTT^ 
GCGTCGCCTCACCCTGMTGACATTGATCAGGTCATA^ 
TAATGCCAMCCTGTTGCMTAOTGCACTGCT 
SEQ ID 6622 

SSEQTFEEAPPIjSFQEfrHFTVIiAQKTGNNVLICl^ 
GNLRIGRYNGFISHQRHQCTLUTC 
*CQTCCNTCTCSRFTDTAIA*SYYWffl 
SEQ ID 6623 
ATGCXyPTCTTTTGGMGTMTACGGAAAATAM 
(XjGCXXKT&TCCTGTOT 

SEQ ID 6624 

MPSFGSNTQnVRFEHEVliIVC^ 

SEQ ID 6625 

CIOTAGCCAATTGCACTGATM(^ 

AACTCCX5CCTGCATGGAGTGGATAAATA^ 

CCTtKXmCCGCCEC^ 

(^ceimccTGAacAcrrTcm 

CCATGCCimAAACAACGAAACCAATGCArc 

(X&(£ACATCTTCCAAATCAAACAM 

TGTMGTTTACCGCGCAA 

SEQ ID 6626 

L*ANOTNQSVIIiQQYLYRIIiVQMVRQILFKI!TO 
HLNLTQFIWIFRMRQra^ 
CKFTAQ 
SEQ ID 6627 
TTGCAGC^XOTGGCGATTT^ 
OTACCTGTCCGCATTGCCKX3AAG^ 
(nCAGGCTTTACCGGCTCGGTCGGCACATT^ 
CAAAAAAGOGGQCM(aWX!GOOT 
(OTKSCCGCCAAAAACATCCra 
AACCGC03Ca3AGAAACTCGCCCa 
CCTTTCAACCCTCTTTT^^ 
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TCGAGCCTTACGCCCAAGTTGCCGAC^ 
AGGCATCAACCCGTCCACETTCT^^ 
ATCATCGGCAAACKX^CAGCCTGACCGAMTCGAW 
ACGGTGCCCTGCCGCATTACAGCGCA^^ 
CGTCCCX^TCG(X^CGCCCAC^ 
ATTTGCCGCAAAOXCTGTGGaGGCGC^ 
' CGCC(XjAAACCGCCATGAAAAMGGCATGGTTAC^^ 
GGAAACCGAGTTCGGCAGOTCCTC^ 
GCCGAAGTCCGCCGCCGCCTCaOXXX^^ 

SEQ ID 6628 

LQQAGDLIIWLCFSDGISDFRPCRLKPRNLIWS^ 

QKSG$VPPYNEWliAANLP , ENAATC^ 

PFNPVFVSPLLIGKDSAVLFTEQCIUjNAEAAAAI^ 

IIGKGGSLTEIDVDIMLYRHRSARP6FVSLSFDTIAG 

ICRKPI^AQCCYGHGTGHGVSYFLNVHEGI^R 

AEVRKRLEPIjTBGAAKAWLIKKPBPLAR 

SEQ ID 6629 

ATGACGCAC03ACCamCCA(m© 

TGCCEACGGGOXXB^ 

GGCCGGGATTGAAGCGGCGGCACHmaGCGGCA^ 

ACC&CCOTTT^^ 

TTGCCGAACGCTATCCGAA^TCC^^ 

TGACCGCACGMTATTTATTATCAGGTTATCGAAAAAAACM 

AAQGTTGAA6ATGCGGCGCAGTT7FTGCGTGAAAAC66A 

1TATCGTGGTGGCGACCGTGGCGTTCX3GCATO 

CCG<X&CGG<X7rcCCTGCCG 

CTCGATGCCATGCTTTCCGTCTGCGA^^ 

GGTTTGACGGCACGGTGTTGGTGCAAAMTO 

CAACCGGCAa&GO&CTC^ 

aGCOTTGmCTCACCGMrc 

CCGMCGCGMGAACGCCTGTGG^GGCATTGC^ 

AATGCCGCAAMCCTCAACGGGCTGCACGAC^ 

GTCATCCGTCCGCAAACCGMCGCGMCAACMCTGCG^ 

CCGATATGCTTGCCGCC^CGCCXX^ 

CGGCATCGATGAAACCGCCAMCACAAACGCTGCCT^ 

CGCCX:CATTGCC«X^(^^GGMGGT^ 

TC 

SEQ ID 6630 

MTHRPTARQILHEVFGY PEFKGRQKDVINTliAGGG SLTVLMPTGGGKSLCYQI PAIJ4REGVAVWSPLIALMNDQVASLHVAG IEAMVNSGTSADEAREIADKLAQGRLKLLYVAPBRLV 

TDRFLRFU)QQTVSLFHDBAHCVSRW 

KVEDAAQFLRENGl^PYHAGLSMDVIffiDJQRI^^ 

LDAMLSVCETAACRRVLLLKHFGEAS^ 

QALQLTEAAKKVIiKGETEWIIjRP 

VIRPQTEREQQIjRQKLEAWRYi^ARAEfOLHTVLSD 

RALAAKQPEGLAEI^VYGWEEKAARYGAAVUAVLEBDAV 

SEQ ID 6631 

TTCMTTAATATTTTAAACATGATAGCCAATA(XATATTATCT 
GTCAACCAATTCAA 
SEQ ID 6632 
FN*YFKHD5QmiY(PI^ 

SEQ ID 6633 

ATGATACA(XIACGGCAACCATGCOGTCTGCGTC^ 

ACCACGAAGGCGGTGOGGCGGCACTCT^^ 

CGGTCAGGTTACCGTTTGGGCMCACCCGGCCACACAG^ 

CCCCCKX^iramACGTGCCGAA^ 

SEQ ID 6634 

MIQHGNHAVCVDPSEPSFVI^^ 
RVFTGTVBQLYDNFQRFNQLPJXMiFYPAHEYTAANIaRFAAH 

SEQ ID 6635 

GTGTGGGATTTACTCTTTGOT 

SEQ ID 6636 

VVroiiLFAYTVbniiENAn^CPIiFSDGILC 

SEQ ID 6637 

CTCAGCAGCCGGTTItnmCC^ 
TCGCCAAACAGTmACTACAACATa 
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TCGCOTIAGGCAGAGTTG^ 
CAT 

SEQID6638 

IiSSRFVHQFflQCQl£nTOT 
H 

SEQ ID 6639 

TTG(&CT«XX^GCAACAAAC^ 

CGACGGGATATGTACGGGTTGGGMTACCGCGCCGC^^ 

CXJAAAAACAGAGAAAGGCATTCAAAAAAGCCGCCAAAAGCAAAM^ 

GO&TGGMCTGAAGGCG^^ 

GICAGACCGTGCTGGATTTCAGCAATACGC6C& 

GGAAAACT(£AAGGTCTGTACCG(X^ 

GATTTCGATATGCAAAACCTHXXSGAAACGTA^ 

TCACTGCCCTAGATTTGTTCGGTTT^^ 

CC6AAAAGCCAT6CCAAAAAAGCGGC6GAGCTGTACXIGGAA^ 

ATAC(XXjAGGTTTCAACAAMTGGACAAAAAACA^ 

GGAAAAACTGTTTGAAATATTTGATAAAAACAGGAATOT 

SEQ ID 6640 

LDWRGNKFLGAAELADLKPLTflQ^^ 
AMELKA?PC1jSSIjFLTKWHKNLYEYLEEYPF 
DFI)MQNLSETH ) KLKTIRLW3K^ 
PKSHAKKAAMRKTRAGVVKLI^ 

SEQ ID 6641 

AGGCTTTTCCAAGCCTACAGGTGGCC^^ 

AGCTCGGTTTCAGTGOGTTGAATCAAAT^ 

TAGGAGGTGATTTTTCTTCMCE^^ 

GTCGGraCGATATCCAAAACCAACTTATCA^ 

ATCTGACGGTTCTCATCACGAACTACCTGACG^ 

CAGGATTCAGAATTTOTACATCATGCGGCAA^ 

TTTMTATTCAGCAAMTGTCAACAACATCTra 

ATACHjGCGCAAAGCATTACCTMGGTATGACCXjAAACCAC^^ 

TOTGCTATTCTGCAT I ! 

SEQ ID 6642 

MJQAYRWPCFQI*^^^ 

VGPDIQNQLIKIRPIiLYTC^NIiKANTADGRKARIQIiK 

FWQQNVNNIFLNTTNSRIFVQYTGNOT 

SEQ ID 6643 

TTGCCGCMGCAGACA(XX^TCCGTTTO 
GTCATTCCTGCGAAAACAGCTCTTCAC^ 

SEQ ID 6644 

l^ADTDPFCGRCTKKPMPSKITNRLQTASVADENS^ 

SEQ ID 6645 

TTTAGACTAGAACTCTACCACCAGCTGTraTTMTAT^^ 

OCGATTTGAGTTGCCAAACCCMTGCTTC^^ 

TTACAACTATAGOCTTAIK&GAAACAAGC^ 

AGAACOTTACGGOGATCAGCTTCTGCGAAA^ 

TTTTTTGCACCATGCTGACCGGGAGCG^ 

TATAACGTGCCAT . 

SEQ ID 6646 

FRVBLYHQLFINITGQFRAIRHWEHTI^ICIYRNPTRQADIjSCQTQC 
KTFTAISFCEITTEIflFnAIN^^ 

SEQ ID 6647 

TTGGAAACGGCTT(^GACGTCATCTGCACATO 

CCCAAACCGOTiraxm<^TATC 

TTTTGTAGTCCAACACCAGCTTCGCOT 

TOTTTCWSftCGGGAT^ 

GOCAACAGTCCGAGGTTCGACCA£AAC^ 

(XXXXX£TGCGGCAGCCGCCAAAAAATCCG^ 

ATGCCCGGGCftXHTGCOSAC^^ 

ACAGCGCGGCAAAOGCTTCX^GCATGGCTCAAAAA^ 

CGCGTCmGCOGCOGAACACOmGT 

SEQ ID 6648 

LETASOTCTSSUUO^ 

FFQTGFQFRQGIAPEAGEKQQPV^^ 

MPGRVADECQALVGMFGKKAVN^ 
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SEG ID 6649 

AATACGACGTTTTTTAG6CG6ACGGCAACC6W 

GGACXOTMTACGAACCT^ 

CGCCAGA6GTAGCCCM6ACAACGCAWGCCTO 

ecQ in 6650 

mm™ 
YT*SCVCFSH 

SEQ ID 6651 

ATGTca3TO X3CC^ 
CCCa^GGCGCmTO^ra 

SEQ ID 6652 

MSVRRSRPSFTHRRESGFVRTETnSSraQSRPQPTFPRHSI^GG 

SEQ ID 6653 

CCATTCT(^TACTCTTTTTMTGCGCT 

ACCGCCAAATCTGCTGCAACC^ 

GTTGCOTCTCT^^ 

SEQ ID 6654 

VGGIGGMTKYKFP^^ 
VALSPALSHGERGRGGCWG 

SEQ ID 6655 

AMTTTATTCTTACCAGCMTCGCTT^ 

SEQ ID 6656 

KFISYQQSLYADLCEYAHLYECVDR6MVSRDDDGSH 

SEQ ID 6657 

GT(XXX3CGACTTTGT0^^ 

mTCcrccrTGcam^ 

(TCHKGCMCAAATC^ 

AAMCQGCATTGAAAGCCGCAGCGTGATGTTTCCCGATATGCA^^ 

AMGGCGGAAACGTFTTGGGCGGATATGGACACGCTGGG^ 

GGWTOftOGCTGGftAACGGTCT 

SEQ ID 6658 

VPRLCHRGFDHNPQDKCW^ 

AAEQILPViamARAIiCroGLl^ 

GITLETVYGHAVKKLVLPQGENVVRFFPKR 

SEQ ID 6659 

GTCCGGACAAATCCGGATTCCC6CCTGT<XOT 
ATAGGTTGTGCAGGCTGTTTTTACTTTTCAGACGA^ 
GGGAGGGCCGGGGAGGCGGCATTCTCC^GTTGCGGCM 
A06GGGAGGCTGTTGGGGTTAASGGOTTG 
SEQ ID 6660 

TGRLLGLRVL 

SEQ ID 6661 

TTTOTACX^GCMTCXKTTTAC 

MTCGTTTCATACTCATAGTTACTTCACGACGCAAAT^ 

CMTATTTGCAGCCTCACAAATCMTTTAGCACGAOT 

SEQ ID 6662 

PliTSHRFTOTFASTRlCTSAW 

SEQ ID 6663 

GTGCGGAAACTTGCCAAATCGGTATTGCAGCCXXX^^ 
AGAGAAAATTCATGCX^TGATGCTGCCAAAAAAGOT 

TOG-K^COTATCTQC^ 
TWGCAAAATGGTCGGCGCGTTCCGMC^ 
GGCGCGCTGCGGCACGCCTCAAGCCTTA^ 
GCGATTACACCGCTCCGTATCGGCGAGTATC^ 

T(nnmc(mTT(mmG66GGmGACA 
SEQ ID 6664 

LVALYLAARHPDKVRSLCLTASPARW 
amRMGEYLHRHIiKG^ 

SEQ ID 6665 

TCTTGCTCTGAAAACC^TTCGAGCCCTAOT 

ACAATATGGTCATTCTCMGTTTTACOT 



-501- 



PCMB02/02069 




SEQ ID 6666 

S(SENDSSPS*mCQLYCDbISR^^ 
SEQ ID 6667 

TT(KX3GCGGTTMGTOTTGGGAAAGGT^^ 

AttGATlK3GATTTACAACCGTTGATTK3^^ 
SEQ ID 6668 

LCRLSCWERLPQLGECRI^ 
SEQ ID 6669 

CGCTGCATTACGAACCGCCCGT^^ 

CCCCGACACACCGTATTCCGACACCACCGC^^ 

TTTTGTTAATCGACGATGTCTTPACCACCK^ 

A 

SEQ ID 6670 

IASFRI5)AANSCPI£FRHVQGGAVCGGC^^ 

PRHTVFRHHRPPQSTLKGGERRRNIKNA^ 
SEQ ID 6671 

TTGGGAAA(MTCCCGCMCTO 

CGTTGATGCAGGTCGTC 
SEQ ID 6672 

UBOA^TWRMPSPQPSPTG^ . 
SEQ ID 6673 

TT(?PGCAGGCT(?mTTrc^ 

GCTGGGGAGAGGGCAITCTCCMCTTCCGG 

AGGCTGTTSGGGT 
SEQ ID 6674 

I£RIJFAFQTTCI!KjCKSNt^ 
SEQ ID 6675 

riKxATATTTATGCTTACCATCGlTm 

CATT6GATTCC6CXIAAAAT6AMCXXX3CCGGGTO 

GACCACCAMG^ACGGCGCGCCCX^GATCAAACC^ 

AAAATCCGCCTGCCGATOMGCCCGGCAGC^^ . i 

SEQ ID 6676 
HFMLTIVLYQPElPPNTGNIIRLCANTGADIjffi^ 
KIRLPMRPGSRSMNLSNTVSVIIiFEAWRQHGYAGGV 

SEQ ID 6677 

TTTOXftGAAAATGAOTCAT^^ 
GTGO^CCX^TAAAAAGGTACATTTAAAATCCT 
TAGAGGTCTGCTCACCAGGTCTMTCXXIAGGAACAAAAG^ 
CGCTGCAMTAAAGC^TATACAGCMTTGAOCGTGTTC 
ATACTGGAAGCAAAAATCGGGGMTAACACCAGC^ 
GCTGTGCACTTTCAAAGTATACAACCAMTAAATTM^ 
CGMGGAATACCIKaAAGCAArACCTGCCCT 
ACCCrTGGAAACATGM^ATO 
TMTTACCTTTCTACCAGCCTCCCCTTC^^ 
ACTAAAGCGCTOTAACGACCCACCGGAAAACATATTCM 
CCMTTCGAAAAACAATCAATGCTCQUATA 
SEQ ID 6678 
FPRK*FHF*TSH*LIILIiSKPIRIYU^IHRYHN t NQQRRA^IKRYI* 
RCK*SHIQQLTVIi*QSSYFVQin , ICISKr^ 

TKAL*RPTGKHIQYSQDAVSSAFV*FS*SNSIN^ 
SEQ ID 6679 

AAGATTTAAMCCTOTTTAGGGAGGCGTO 

GCCMGACCACTACCMTACCGCGTOCMtt 
SEQ ID 6680 

KI*MLF*GGVVARAFnHPRNPPYETHQ^ 
SEQ ID 6681 

TTGT 



AACCGTTGATGCAGGTCGTC 
SEQ ID 6682 

LI£KVMTWRHPSPRPSPRGRGRIAAG^ 

SEQ ID 6683 

^^AGCCTCAGACCT^ 
ACAACAAttTCTTCI^ 
AACAGTTCXCGMGTCCAAACACCG^ 
TOCGCATTGCTTCAAOCTGC^ 
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GCGAGGAGGAAACGGTAGATTGGAGCATCTACCTCTCGGAAGAA^ 
OTTOXJGAMTGTATCTCGAAATCGG^ 
GGTATT 
SEQ ID 6684 
VYEPETFNPYNPVEIVIDTPEPESVAQTA^^ 
SAI^PAEAPAVEENAAEITLETPDSN^ 
GI 

SEQ ID 6685 

ATGTATGCAGGCGAA(X3CTTCMTACTTft^ 

GCCTATCGGTTTACGGCATCAGCCTTC^^ 

GCTGATTGCCtt^OT^^ 

G6ACGGAAAA6CX5AAAAACGTCT6CTGOT 

CQ^QGCGOTATGCTGTACAGC^^ 

CGTGTACGGTTATGTAATC 
SEQ ID 6686 

MYAGERFTOSHIiSGLILAAAGLHI^ 

GRKSEKRLLSIAraVMGWMTliAVMK^ 
SEQ ID 6687 

ATGATACAGCACGGCMCCATGCCX5TCTGCGTC 
ACCACGAAGGC(MTGCGGCGGCACTCTGGCG0GGCTACAT6G 

CGCGTGTTTACCGGCACXX3TCGAACAGCOT^ 
CCCATATCGAG(XKGACAACM0C6ACATOCAGACGGC^^ 

CCGTC CGCrcAAC ^^ 
SEQ ID 6688 

LUSFRCILTPFGYTFQTASEPSRNTHENYPRQSPDRQIilLDOT 

RSCmO^NTRPMPPHQLPSRNFRRHTRFLRimPFFRRLRra 

PRRPFIOISPGRQNAEQRFGSIRRPARTEKRLPLTCPPKNAV 
SEQ ID 6689 j 

TTGTGCAGGCTGTTTTTTGCTTTT^ 

GCCG(KK5AGAGGGCATTCTCCMGTTGCG^ 
SEQ ID 6690 

I£RIiFFAFQTTCINGCKSNIjFF^ 
SEQ ID 6691 

TTGACTTTAACOCXyiAAAACCCTTTTOT 

OTACAAAGTC(XXXMTTCAC^^ 

AA<XXraXX»TCCCa^ 

GCCXKXSCAAAMTTGGGCTTTGTCAGCCA^^ 

TCGGTACGGAACACGAAGCACAAGCCTATCTGMCCAAGC^ 

CCCGTTTGCCTCGCAGGAACGCGCCG^ 

SEQ ID 6692 

AAQKLGFVSQGTAHVKIEQIVPGQSAPVA^^ 
SEQ ID 6693 

CATTTCMTOTAttlACCAACATCCTCGA 
GTAGATGCftATCAGACCTGCTTGCTTTMGACTAAMTA 
TTTTAGGGAGGCGTCGTTGCAAGGGCATTreACCACOT 
AATACCGCGTCCAACACGAOSAIC^ 

SEQ ID 6694 

HFNLTTNILDSSSRTFGNFOTI£ 



SEQ ID 6695 

AGACTCCACTTTCMCAAGTAGCm 

GATTCMTTGTACCMTCAGGOT 
SEQ ID 6696 

RUIFQQVADFIHHTI^^ 
SEQ ID 6697 

TTGTCGAAATTCGGGGAGCCCTCTCrcctt 

AAAGCAAAAAGCAGCX^TGCACAACCTGAT 
SEQ ID 6698 

LSKFGBPSPPRGRGQVAADta^ 
SEQ ID 6699 

ATGAACATCAAACACCTTCTCTTO 

ACGGCAAACAAGACAAAATCATCAGCCGCGCCCAAGCCGAAAAAGC^^ 
CGAAATOGTCAAAAAOGGACAGGAATACAAAGTOGTT^^ 
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SEQID 6700 

MKIKHLLLTAAMALLGISAPA 

SEQID 6701 

GCCATCm'AACTCCCAAMTGTCT^^ 
GATOSTGC^^ 

CATGTCGACCAATAACCTCATGATGGATCGTACtt 
ACCCATACCMTACGACCATCTCCATCA^ 
TCACCGCCTTCTTCAATTT^ : 
SEQID 6702 
A™SQNVFH($ATF(CTISRSMQIiR^ 

SEQ ID 6703 

TTGOCGCQGGCCTCCACCAA!^^ 

SEQ ID 6704 

IiPPASTOTFNITDPAIAADSVKSINSAESRIjFPKKIiDAAG 

SEQ ID 6705 

GAAGCTTAAACCArrTTCACGAGCAGCTTCA6(^^ 

CXjTTTACCMCTATTGCAGCTGCTTCAACA^^ 

CATAAATATGATTATTGCTTCXXM 

SEQ ID 6706 

BX*KPPSSFSQSIJJTTVII^*TIKSra J PYTCCFSU^^ 

SEQ ID 6707 

TTGAWXXKKXMGCTGAAGCCCGCCCTGCAACCC^^ 

OTMTCCTAAAACCCGCCAT^ 

GCAGGTTTCCGTGCGGACGGATTCGTCATTC^^ 

ACTTATCGGATAAAACGGTTGCCCCMCCCTGOT 

TGCCATACAGCCCTGTTCCC^^ 

SEQ ID 6708 

LMRRABARPATLSMHPIJ^PTIjRI^ 
TYRIKRLPQPCVLDSHFRGNDDSGIPNSNPPQPTYAHPTHI^ 

SEQ ID 6709 ! 

TTGCACCCGAOOTCAGGATTCGGCAC^ 
(^TGCAGACCTTCTACAACCTGCCGCOCXSTGGGTGAAAG 
OTCCGCCAACTGATTAAGGTCGGCGGCA 
TCCGCTCCGGGAATCGGCAAAAAAACXrOX^ 
AAGACATl 
GCTGAAA 
SEQ ID 6710 

LHPTLQDSAHMISRLTGKLVEKKPPQrWDVNG^ 
SAPGIGKKTAERHVLEIiRGKLVAHTVTDGlfAASPAA 

SEQ ID 6711 

TTTTTTCmGCTTOTra 
TCCGAtmGTTAmCMTCT^ 
GAGCACGATAACCCACGCCCATCAATTGTMTTTOT 
AmTOX^aUUAGTCA^ 
TCTACA(X!AGCGGGAACCGT^ 
SEQ ID 6712 
FFI/jFFHYHYPAYIAHTFPFIRLRS^ 
IIRNKSQFAIIKFNGYIKMQRiaiQFTO 

SEQ ID 6713 

GTGCAGGCGGATCTGGCTTACGCCTACC^GCA^ 
(KTO^TCCACCCCCGACTCGC^ 
CCGATTCCGTCAAATa^TC^ 
SEQ ID 6714 

VQADLAXAYEHnTOYPEATGA^ 
SEQ ID 6715 

ATGTTOGATTATACTGTCAGCT^^ 

CTOAAOTTCAGACGGOT 
SEQ ID 6716 



SEQ ID 6717 

ATGCCGTOCGAAGGCGGCGCGGCGTTTGCCC^ 
TGTTTGATGCCGTC 

SEQ ID 6718 

MPSEGGAAFALTKPCVSAPRCLP^ 

SEQ ID 6719 

(KX^ACAATGCATAACAACTCACCACCAACAOT 
OTAGACGCTTTATAAATACGCAM(XGGGGCGAGATACACGCTO 
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CGAAGTCCTC^TATATCCTTC^^ 

ATCGGAMTAGGATCATGCATACTCAT 
SEQ ID 6720 

GYNA*QLTTOTIiRTCFTISH^^ 

I6NRIMHTH 
SEQ ID 6721 

CCAGCTGGCTTTAACAACACCX^GGMTTTCGC^ 

CGTTGAO^^ 

CATATTTTTTAGCCAAAGOTGACGTW^ 
SEQ ID 6722 

SEQ ID 6723 Fm " r ^^ 
TCCTTTGAACXSGAAATOMACAA'PGACAA^ 

ATTCAAACATTTGATCACGACQCAAOT 

CTCTAMTCGGAAACAGCATGTTCCATAACTTTTTO 

TTAACCAATTCAGGAACAACTGTCTOT 
SEQ ID 6724 

SFSRKFKQ^*SFRFIJGFCCSS^ 

L*IGNSMPH8FFVSHSLTHIHIQGDFFlfPRDFHY^ 
SEQ ID 6725 

ATGTGCAGGCGGATCTGGCTACGCCTACGAGCA^ 

TTCCATCCACCCCAGGGTGTCGGTCGGCTAC^ 

AGAAAGAATAATAAAACTTTTGGCGGCAACCAGCTTA^ 

TCAGAGTCAACGATAAATTCAAACCOTATAT^^ 

CACAAAACCTACGTATTATGATGATATAGATT^^ 

GCGCCCGGCCTGAOTTGGACGCCGGC^ 
SEQ ID 6726 

iaCIRSTRNPSSIiLPSSUiFSSLLFSSAARAAGEDHGRGPYV^ 

RKNNKTFGGNQUJIXYQKTfflQI^GfF^ 

APGLTLDAGYRYHYWGRLENTRFKTHEASLGWYEF 
SEQ ID 6727 

AT6TTGAACAAAATATTTTCCTGGTTCGAGT(X^ 

MTGGATAGCC(^XCTAGCTGCGCTGACCG^ 

CGAAAAAGGTTGGGAACTGGCGGCMTGGCGGCGATGATGGTGTTT^ 

TGGMCTTCCACCGCCTGATGCTGtAAC^ 

TTGCCGATATGGTCGTraTOT 

GCGOTXK^TGATTCCCAGATKXKjGCAAAC^^ 

GGOK(XMCGAAGCCGTCTATGCmGa 

xotctccaccbccgc^^ 

ATCMGTTCGAACACGTCGA^ 

CCACXATCGTCAACCTGCTTTTGCGP^ 

ACAAtMACXTOOT^^ 

GACCTTTCCGATGCCAAAGGGCGGAGGXOTAC^ 

CCATCCTGCTGCTTGACGAAGCC^CCAGCGCGC^ 

CACCATTGCCGCX^TGGACAGGCTCGTCGTOT 

(mSGCTTCCTCAGCG^^ 
SEQ ID 6728 

MLNKIFSWFESRIDPYPEAAKOTPEKG^ 

WNFHRU^QSWPYQDEPAGRVSAKVI^ 

GARJ&VYAKQSMEEFMVTVRAQMR^ 

IKFEHVDFCYEAGKPLLNGFNLNIKPGEKVGIJGRS^^ 

DliSBAK^GYDAHW 

GQPLSEHVENQBD 
SEQ ID 6729 

TGCTCCAATC^TAGAGCCATTTGATTTGAAGAAAC^ 

TTAGAMTATCCAMGGCATTTCTTTAGTMT^^ 

CTCGAACTACTTGACCCTGCTTACCTTT^ 
SEQ ID 6730 

CSNDRAI*FEEraAPRPAFlFNQIiNTNT^ 
SEQ ID 6731 

GTGCTGTTTCCMCCCIK^^ 
SEQ ID 6732 

VLfPTIilLYRQAFIMDLPSSFLIJTTPSDSKRQ 
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SEQ5D 6733 

TTGATGGCGTTTTACMCATTAACOT 
AmTATTCATOTTGCTATTCOTATAATAC^ 
TTTATTATTCAGCAACACGGCGGCATTGTTATCGAW 
COUJGCGGAAOOGCATCTTTAAC^^ 
SEQ ID 6734 

PRGTASLTATIjI I SPTEA 

SEQ SD 6735 

TTGTTmXXmTATG^^ 
TACAAATAGTACGGAACCGAT^^ 

SEQ 8D 6736 

LFCSDffl?FVKECIYniKFRCGDRIWRIl^QDKAA 

SEQ 8D 6737 

ATGOXSXmACACCCCmTCAGAC^ 
G(&CAAOTACGAAGCCAG^^ 

ccTGcrcTACcGcm(ymx?rG(^ 

GCCGCCGAAG(XG(X^CATCCGCCKXXn^rc 
CCCa^TGCTGGAAAACGCO^CAGCOT 
OXmaSKEGGCGACATTO^ 

SEQ BD 6738 

IffSEHPFSDGISTPNPKETMN^^ 
AAEAAGmVIVliNK^LPETALWLEKLKFYE 
GI^HLQAADLPHYFPDFRHLVGQCRFHNC™ 
SEQ OD 6739 

TAATAOTCAGCTGCCAATGAAAC^^ 

TTGTTATCGAAmMTTAACGCACCATCAGGACGAC^ 

CCACTOAATAATATCGCCMC^GAAGCCTAG^ 

CAT |i 
SEQ HD 6740 i 
*YFRCQ*NDFHKSLSTQFTGY^^ 
H 

SEQ BD 6741 

GTGTCATCATTttCGATACACC^^ 
GTTCGGTTTCGOTATTTCCGATAAA!TO 
SEQ BD 6742 

VSSFPIHRNPETRHSRADRNPDIiSARKI^^ 

SEQ ED 6743 

TTGCCGACOSGG^^ 

(XKCGAMTCGTTTGAAGA6GCCTT6TCGW 

CTGCCAAACCAAGCTGGQCCAAGTCGAACAGAAATTACAG 

SEQ ID 6744 

LPTGAS(MPSEA6F6FQTASVLQKCYNPLFTOT 

SEQ ID 6745 

MTAGAAOTCCTTTCTCAACCM 

TGGATTTOGTAGATMTCGAATMOTTAC^ 

CATTTTTAGTTTCGCTCAT 

SEQ SD 6746 

NRTCFI^LTDNPRFSF*QWIXM 

SEQ BD 6747 

ATGCCCjTCTGAAGGGTTTrcAGACGG^ 
GTCAGGAATACCCGTGTCATCATTCC^ 
GG<^TCC«3ACCCT^^ 
TTimCAAATTGCTT^^ 

SEQ ID 6748 

MPSIKFSDGIFALFTVILKSRHSHAGRKI^EN^ 
FRQIALALDISYFKPL 

SEQ ID 6749 

ttgaatccgacgaat(&t^^ 

gctatgcggctttggacggcggcaagcgot^ 

cgtctattctttggttcacgacgatatgccgg 

camcccaagccttcgacgtgttgagccgt(^^ 

tcgatttggcaaatgtcggcaaacaaatggttcaag 

gtccgacgcggaaotgccctattggacgot 

aaagatcccgacaacgacmgccgacttatc^ 

ccctgcgcctccggcag™^^ 
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SEQ ID 6750 

QTQAFDVLSRPTEliPAA^ 

SEQ ID 6751 

CTTAGCACCTTTPTCAGTTAAAACGGTTTTAATA 
TTCMCAAGTCCAACAAATCTGCATTCAACT^^ 
SEQ ID 6752 

SEQ ID 6753 

ATGTCOTCATCAAAC^CGCCCT^ 

TTG0CGGTKXHXXX3TT(X^ 

TTTOACGAACACGTCG(X^6ATGGAAGAGC^ 

ATTGAAAACATCGACATCGGCK?^^ 

ACAACG(X£CGTTGAGC^ 

CGGAGAGCCTGAMTCGGCGAATTCCCAAGCCGGTT^ 

GCCGCAGGCAGCCOTKXGCCTA^ 

TCGTGAAACACGCCMT(XGTGCGGCGT^ 

AGTXX3ACGG(X3AAACCGTCAAA 

GAAGTG(XXjCTCAAAGCAGGT(X£^^ 

GCCAACCGACCGAGCA(K3AATG6AACGATO 

MTGAGCCXJCGTGGACAGCACCCGCATCGCCGCCC^^ 

ATTQOCGAACAQGGCATCAAAGCXaTCAT^ 

GCCAT 
SEQ ID 6754 

MSVIKRAIJSIiSDKAGAVEFAQl^ 

IESID1(X3PTOVRSAAXNWKH^ 

AAGSLSAWQLQGKEIfi^WIADADAAm^ 

EVPLKAGAHRFELKRVGGGI^^ 

IAEQGIKMIHPAGSMRDQEVFDAADEHGimVTC 
SEQ ID 6755 i 

TTGCT1TCATTATTGACCCACCTGTCTXACT 

ACmACCTGGmMTTTCGGCMTGTAATATTC 

CACC&CGTraATGTGACGGGra 

AGCGATGCCOTATTGCGACCOTTTGTT^ 
SEQ ID 6756 

SDAGIATLLLL'rVFQSSWLQHYV^ 
SEQ ID 6767 

TTGACCCACCTGTOTACTACAAA 

TTMTTTCGGCAATGTMTATTCCACGTTAOTTTAC^ 

TGTGACXSGGTCATIGCACS^GAQC^GCTK^TTTGA 

ATTCmCCOTTTGTO^ 
SEQ ID 6758 

LTHLSYYKGCRNRQFGSSQLEOT^ 

UTIMLTVFQSSWLQH 
SEQ ID 6759 

AltXACTTCATAO^^ 

CGCCIOTGATTGCX^ 

CCGCATTGCTGACTOTGTC^TCGCCACXX^ 

CGGOXKATJCTTGG^^ 

GTGATTGCCGTTTGGTTTCXXX5AAA 

TTTTGATGGCGCT(£CG<£^^ 

TGAAAGCGCGGCAGTCGTCGGAGCGGCGGTGTTTC^^ 

TGCATCGGCATGGCGGGGGTATTGGGAACATCGGGAAAMTGG 

SEQ ID 6760 

MHFIMITAYGYDADGA^ 
RLLIACASGAGt^VYNVPIAS^ 
VIAVtfFPEILGMGKAGNQLTF^LTGWQYSIiELT^ 
CIGM^GVLGTSGKffiFK 

SEQ ID 6761 

TTGTTTGCTTCTTTATTCCCTGT^^ 
ACCOTCCCGCACGGGATT^ 
^TCGT6Ajt3GAA6GG6GCGTTGGCAAAGATGA6T^ 
TGCGGAACGGCATATCTTAACAAAAACGGCAKiCTC 



-507- 



Jfd'/JL0U2/U2U69 



SEQ ID 6762 

LFAOJPVARVPmPHETREE 
CGTAYLNKBGSL 
SEQ BD 6763 

GTTGGCTGCAGCATTACGTCTACCTGCCT1TO 
TGCGCTTCGCTGGTTGCATMTCTACATTT^ 
TMTGCCTTTAGCACCAGAACCX^T^^ 
AGGAOXmATCTCTTCAATAmA 
TGAATGGTAATGCGTGCAGATTTTGCAGGGCGTTC^ 
AGTCGGTACTTTTAGCAMCCATTTT^ 
SEQ ID 6764 
VGCSITSTCLSLFTimFC^^ 
RTANLFNIYMNRYTHQDLQVTFQNFM^ 

SEQ ID 6765 

GTGTCAGAGGCTGCCGmT^ 
TOTGGCACTCATCTTTGCCAA^ 
CGCCTCTCTTGCCTACATCCTCGAATCC^^ 
TATTTTTGQCACAC6CGCAACAGG6AA 
SEQ DD 6766 
VSEAAVmiCRS^FRRHRRR^^ 
YFtfHTRNRE 

SEQ ID 6767 

GTTACOCACTGTCACATTGftTATGACAAGOT 

AGTTCATCAATGTCCGCACCGTTATTATGCTCA 

OTCTWAOCACGAATCAAATCAGCTACXIA^ 
SEQ ID 6768 

mcaiDMTSLFFDAVTATPGASIiKTP 
SEQ ID 6769 

AT(KXXX»ACCCCK^ 

GATTTTCTTTAAGCGAAAAAGACGC<^^ 

GATTACCATCGMTGTATC^TGAAATTGCGGC^ 

GTTGCGTKjTTGGCXSGGATTGGArc^ 

TGCTGGTGTCCG(KXXX3TTCAC 

CAGMTCATCGACGCGCAGGCAAAGGCGGATTTATTGCXjC^ 
SEQ ID 6770 GCCGATGCCTGTATC^^ 

MPQALVLQFPSAEALPSDFPSRLPEPDYADEKRMRPIVEEGFSIiSEKDAAUiD 

VALLAGLDEQILADrHS^^ 

EAGIGVAYRAKPKARAAADACINFGGLERVRGLFG 

SEQ DD 6771 

QOTATTCGCATAGGCAACCGAATOSTCGCC^ 
(BGACGAOGTra^ 

TCAGTTCCAAAACCGTGCTGTCGGTTTCXnCGOT 
SEQ ID 6772 
GIRIGNRIVAAFLUIEI^FASVGVFYEGYHGCAVFHQAGFAD 

SEQ ID 6773 

ATGGAAAATATATTCTCKffl 
TTA(X^6CTGGGCGC6GGATTCAAATCCGTATTCGG0^ 
TACGGGCAACGTCXjCCGGTGTGGCGArcGCCATTA 
CAGAAATACCGCGTCGTCAGCCACGGCAAATAC^ 
TCGCGCTGATTGTCGCGTTQGGCTTTATCGGCAATG^ 
TGCGGGCATGGTTGTGATTGGCGGCGTG^ 
CATATOTGCCmTGTTCAACCACATOTTAC^ 
TTTCCAACGAAGCAGGTATGGGTTCGACTCCGCAC^ 
TACGGCTACGGCATTGATTATCCTGCT(^ 
ATGTGTCTGACCTTCTTCGCCOT 
CTATCGmTGGGTACGCTC(^^ 
CATOATGATGACTACCTGATGCAGAAAAA^ 
SEQ ID 6774 
MENILSVLVGTVNRFLfcK^ 
QKYKWSHGKHGGPAFHTHGLTPKI(&GAAI^ 
HIVPMFNHIFTAAFNPEAVIjGGAAGIGMREAIRFGVA^ 
tOjTFFAFTOIGtnnfFGESNIRFIiFRGWEjGiyRALV^ 

SEQ ID 6775 

■ITGGAGGACXjAGGGCTTGCGGCATAACGGC^^ 
AGTATGTGTCGGACMCACGGGAAAATGCGCAAOT 

SEQ ID 6776 

I£DEGIiRHraSKNADYIVKRQRMI^ 
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SEQ ID 6777 

TGaCGGCAATGGTCAGACC(&TAA^ 
CTACATATOGGCCTTTTTTC^ 
SEQ ID 6778 
FLFSLFIGQVAFKGTGQ*EFA*Flfnro 

SEQ ID 6779 

ATGAAAM0GCTATCCCA(X3CG^ 

TGGCGGCAAGTTTGGACGATGCCGCTTACCAM^ 

GCTCCTCAAAGAAGGTCAAACCATCGTCAGCTO 

TCCCX3CGCTCAGGCCTTGGACGCTTO 

AAGTGCCGCCTGCGCAGGTTTTGGTGATT^^ 

ACAAATCGAATCGATGGGOGCTAAGTTOT^ 

TTTGCCGAACAGGCGAAAGAAGTGGACATCATCATCAC^^ 

TCGATTTGGCGGCGACGGGCGGCAACTGCG^ 

TTCCCAGCTOACGCCACCAACT^^ 

TCGCGCCCGCCGCCATCGCCGCCGTATTO 

GAACGTCAGCCACTCGCTGC^ 

TTTGTTGC^TCCTGATTGCCGGCATCM 
SEQ ID 6780 

MKIGIPRESLSGETRVACTPATVALLGM 

SRAQAtDALSSMANISGYI&VIEAAN^ 

FAEQAKEVDIXITTAAIPGKPAPKIJTK^IVESMKSGSVIVD 

GEITFPPPPIQVSARPQQTPSEKAAPAAKPEPKPVPIiWKKlAPAAI^ 

FVAILIAGINIFGGFAVTRHMUIMFKK6 
SEQ ID 6781 

acctttatttgagtaacggcgacgaacaatc^ 

ccttcaccaocacatgcgg^ 

taccgacttcaccgatggtox^c 

acgtgc^tttgagcacctttaccgggttt^ 

aatacgacaccggcttgmtaccgcgaggagc(^^ 

cagaaatgccgtctttgtpacgtttaaagtctacaa 

trcmgcaagggtgcataaggtgcacctttgtgcam 

SEQ ID 6782 

TFI *VTATNNHIII&FVITSAVAFSRSTP1^M 
TCDLSOTGFHFDTVDSRTNGNVm 
FKQGCIRCTFVQTFCYHANHAATACRGRIjHFNIXIH 
SEQ ID 6783 
ATGAATTGGTCAGATTTGAATTTMTGCCXX^ 
ACGGCGCGTTGGCMGjGTGGCGGTTACGGCGG^ 
«3TmATATGCOTGA03Tm 
ATGGTGAGCGCGGCmTTTTTTMCTO 
ATTTTGTTCTGGGCG(^C^^ 
CGAATGGCTGTTGA^ACTGGGTOGGT^^ 
GCCTTGGTCGGCACTGCCCCGAAMTCGCCGC^^ 
CGCTGCTGGTCGGCAACOTTG03OT 
TGCGGG3GGCCTCTATTACGCCATTAOT 
AACCAACACCXXXH'ATCGCTTGCCTTTTT^ 
GCTATGTATGGTTGTCGGTGTTTGCCGTCG^ 
CAACAATGCCGCAAAATCCCTCTTMGCGTCM 

SEQ ID 6784 

MWSDUniCPAIiPE^Ifi^ 

MVSAGHFLTAYIGimSLALYALIALRRDSGFMEAALKYFVI^ 
ALVCTAPKIMVVFAFRILVTGLGTVHHIW^ 
NQHRVWIAFimVtfSHA^^ 
SEQ ID 6785 
ATGACTTTCGCCTATTGGTGCA!TrC^ 

CACATAC(m<^CAa:c(mxnsc^ 

CGTCAACACGCTTGC(X3GATTGTTCAT^^ 
CTGTTTGTCGCGGCTGCT 

SEQ ID 6786 

MTFAYl^ILIACLLPLFCAAYAKKAGGJ^^ 
LFVAAA 

SEQ ID 6787 

TTGAMCAGATQCCGTCTGAAMCACGM 

SEQ ID 6788 
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SEQ ID 6789 

TTCXOTATCTGCAGCTGCAGC^GCGG^ 
CCTTTMTGGTAACGCTAGTTACAGAAGCMCTT^ 
CAGCCMTACGTTGCTO 
SEQ ID 6790 

FLICSCSSGFQIQIiLTVNQAYISFFNIAAATQ^ 
SEQ ID 6791 

TTG66MTAA6TAAAGMGGGGGGAGAGTA3TC^^ 

AGCGCmXX3GTTTCTT^ 

&3CGTGGACGGTATTTCAGTGCTOT 

TGAT6tCGGGTTOATTAAOGGCGCGTTTGCC6C6C^ 

CGTCTATGCGTCGGTCAAGCTG^^ 

AAACAGATTCCGTTGGGOTACAACAGCTTI^^ 

GAGCCTGATTKXXCTMOT 

GTTGACGGGCAGTTGGACX^CTGGGCGTTC^ 

<£AATATTG(X!GATTAC<XaCG^ 

GGTGGTPATGGGCGCGGTCAAAGTGMTTTCTO 

MTCCGCACGTTGCXXjAAATGAAAGACATCAATI^^ 

AGGCGO^AACGATTTGATTGC^ 

SEQ ID 6792 

LG I SKEGGRVF I PS FRRLPKTLS VKTNFYNYIFYFNHRIjTTMF SNYIiLSIAIWI PIAAGVLVLATGKDSRAP LARVLAFMGALAGFLVTLP LFTGFIM*SGGYQFTEFHEWIPLIiKINYAIi 

GVDGISVLFIIU^ITLLVVUW^QKRPAQ 

KQIPIKjVQQLLFVAFFLSFAVKVPMFPVHTWI^ 

VH$IdDWAI*GA^ 

nphvaekkdbcrefmlavuvavi^^ 

SEQ ID 6793 

ATGTCTTCAGGACTCGTAACAGCGGCGTATATCGT^ 

TGGCGGTCGCCCTCTTCGTAACGGTC^^ 

(X5AAATGC(X}GAACTGATTGCACTTCTG(^ 

GTCGAAGTCTATTTGGGCATTTTCATC 

TCAACGCACTGGCGCTCGCCGTATCGTTTGTGT^ 

TGOTKX^TCGGCGGCGCGGATATGCOT 

CTGGTCGGCTCAAGCGGOGCAA!PTCTGTCX!T 

MGAGATAGAGGAATACCGAGAAGTCAAAGCTGCXX&TC 

TGCCGAAATCACCGAGCTTTTGCGTAAAAA(X^CATCG 

GACATTGTTTTGGAMTGGACGAAATCAACGACGACT^^ 

CAGGTATGCCGGTTTTAGAAGTATGGMGGCAAMGAAGTCGT^ 

TTTCGGC^AOGCGAAGAAA^ 
SEQ ID 6794 

MSSGLWMYIVAAILFIFSLAGLSKQETAKRGCYSGIAGMAVAIiFOT 

VEVYIXSIPIC^VTFTGSLAAFGKI^GKISSSPLQ^ 

LVGSSCMLSmCKAMNRSFVSm^ 

DIVLEMDEINDDFPCTDWLVIGA^ 
SEQ ID 6795 

TACCCATTGCTCCTCTAATTGTCCAA 

GAAGCCAAGTAAACATTCTCGTCGAGCCGTTTGGTT^ 

TCACAAACAAACGCTCGTCAC^ 

GCCTTTACGOCACAGCGGAGAAGAACTCATACCGGMCGGGO^^ 

GAGC6GGCGTTT6(X^AAGTA66CATTTACCAGCTGGTGA 

CTTTCAATTOCAT 
SEQ ID 6796 

YPLLL*LCNRIPSDI)YPFITQQAVRINI^SF^ 

AFTPQRRRSHTGTGAAGTFLTPWFFG*VFYFGTGF1*STVTRA^ 

SEQ ID 6797 

TOCAAGAGmAAAmATTTTTG(&CAA& 

TTCAGGGTTTTCATCCTAAAGCTGTATTTTG^ 

GACTAAAATTTCCGAGCAGGATTTTATAAAAGTCAAATOT 

AAGAAATCAACTGTTAGACMTCAGTGATGGATAAGTTATTO 

AATTAAAAAA3OTATTAAGCAAITGTCCAGCAGGAAAAATAGACAA 

!mTCAAGATAAAACGTGGAAAA(^^ 

ATTGATTGGCTAAAAGAATGTGGGGTTTC(^C^^ 

AAAATAAATTTMGTCTGCAAMGGAGCAAGCGMGGCTCC^^ 

TTTAGTACAGGCAGMTATCGCMTCGTACAGATATTGATTCX^^ 
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SEQ ID 6798 

KKSTVRQSVMDKLFM^^ 
IDWI£EC<#STRGSW^ 

SEQ ID 6799 . „ 

GTTGGGCACCGAAOTATGAOT^ 

AAGCTAACTTGAACGGCGGTATAtXra 
TC^TACCAAOTTGCXJTC^^ 

SEQ ID 6800 

RTYFHAGTNYNIAVDRTRNSTLNTO 
QFSQFFFSQSIOWATGFim:POTSGFGFVDTIGF 

SEQ ID 6801 

TTGTT'KnTTCGTTGATTTTGTC 
TGATITOTCAAATCAAACAGAT 

SEQ ID 6802 

IiFVSIJLCQFQMWVIiIRTG^ 

SEQ ID 6803 

ATGCTACXXyUOGGCBTGAMT^ 
ACACCAAA 

SEQ ID 6804 

MVPHRREIAKMQIICFLNFliGHLKKYRGVKRlJ 

SEQ ID 6805 

TTCGCCTGTCTTTCAGACAAAGTTA2 
CGTTT 
SEQ ID 6806 
LACLSDKVKLGKNVPSLT^^ 

SEQ ID 6807 

ATGMTATTTTATTATTAGACGGCGGCAAAGCGTTCGGA^ 
CCGT6ATTGATGCXI6GCTATGATGTTGAGGCGGAAATCGAAAAA 
GGACGMGTCTTTACCGGC(&ACA^^ 
CTGACTTGGAATGCGC(X^TTGAAGCCTT^ 
TGCC6ACATTCTTATGTAA06ATGTGGTTAAAAAT0C^^ 
SEQ ID 6808 

MNILLLIXEKAFGHSHGGI^^ 
LTWNAPIEAFTREGDFFEGKGVI^ 

SEQ ID 6809 

ATCAAAACCAATTCAGAAGMCTGACCGTATTTGT^ 

TCGAGGAAAAGTOGQOTGIA^ 

AACCGAAATGCTGGCAGTW^CXSAAGTA^ 

ATCCGACTTTCGCTC^^ 

GCXIACTTCCGCGTAGTC^CCAGTCX^ 

GGCGGTTTTAGATGCGCAGGGAAATCCCTATAAMTTTCAC(^ 

TTGGTTGACAACGACATCACTGAAGGAAAGTTAATTCO^ 

GCGTATTTTTGGATTTTTTAGTCMGG 

SEQ ID 6810 ^ 

MKTNSmiWVQWESCSFSRAAEQL^ 

IRLSLVSSEGYINLIERKVDIAIi^ 

LVDNDITEGlOjIPLFAEQTSNKTffi 

SEQ ID 6811 

ATGMCGATATGACTTTATATTTGATAATTGCCCTTG^ 

TC(K?TGTATCTG(X!GTGCnX3TC^ 

CTTCTTGGTCGATACGATGACGGCGAT(^ 

AGmTAmCTTTO^ 

TCAMCGTCCGAGCGC(^TTT^^ 

TCAGGACGTATTCGCCTATCTGCCCAACGTACA^ 

TCGa^ATTcccarrea^^ 

(XXXX3ATTTAIIPGAGATGA6CAGCACOGCG^^ 

rimCCCTGTOXAAC^^ 

AGTGCGA!1TATCGCTATGCA^ 

TCTTCTCCGGCTraACTC^ 

GTTCCGCXIAATACTITATGGTGTTCXA^ 

AGCCCGOTGTCGTTACCCTGCCGCTGATTCTGCT^^ 

CCGACGCGCATCX^ACCATC^CAra 

GCTTTTCTA(X3TCAAACTGCCG^ 
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GGCACGCGCGCATTGGGAAACTTCTO^ 
AAACCGGCTTPATCTACACCTACGC^^ 

SEQ ID 6812 

mmLYLIIALVPLAGSLIAGIiFGmGRAGAHTVTII^ 
SttSIiFTFSMLHLIMSNNrc^ 
SAQFPIflVWLPDSMEGPTPI^^ 
SAIIGMHHDQDMRHM^ 

SPLVVTLPLIUAIPSVIIGYIAIEPMLYGDFFKDVIFVNMAH^ 
GHUO£NFFVKVGOT^ 

SEQ1D6813 

TTG(X}GGCGGCGGTATCAGGTATTC6CATA6G(^ 

TTtttTCAGGCCOX^^ 

TCGTATCCCIAGTTGTGQjTCAGT^ 

AMXflTTTGOTAQCaftlC^^ 
SEQ ID 6814 

LAMVSGIRIGNIUVAAFLLNELD^ 

NVLVGIEGFFBIAHAEL . 
SEQ ID 681 § 

ATOmTCGAACAOOC^^ 

AACAAATCGAAAACGGCACACOX?TC 

GCCGCGCGAACGCAATATCGTTTG^^ 

TTGGCGGCGGTCGATGATTTGGATGCGGACGMTTGGC^^ 

CGGCAATGTCGTATGAAGACAACCAAGTCGGTGCGATO 

CGAO^TACCGACMGATTTTTGTCGTOG^ 

6TCGTGCGTTT(X:GCGC0GAGGACGACGTGGAA(^ 

ACGAGATGGT(^CGTGATCC(XK^ 

6CTCG0CGTCAACCT(^^ 

AACTCGGGCMCCAGACGATTA(XATGATTGTCXXj^^ 

TTTGGGGGAWXri^TGGGAGCGGTTTCGTC 

TCaxnra'MTGGAAAAATTCGGAO^ 

SEQ ID 6816 I 

hsiehtppthdgetcqnhaerpsadfdrwsu:eilkpafe^ 

IiAAVDDLDADELAELADDLFHQVVYEALQTRDE 

VVRFRAEDDVEEAAQAFERmVTAPVTO 

NSGI^ITMIVRAMAMGQLTO^ 

SEQ ID 6817 

ATGTTCG(XX5GTCGTCTGAA^ 
CCGCCO^^ 

CXX^GCTTTTTGTATITrCCATOT 
SEQ ID 6818 
MFCmKNSPSITAKHFLKENIMKFINTCmGA^ 

SEQ ID 6819 

TTGCGTCGGCAATTCAGGC&^ 

TCAAAATTATAAAAATAACCAAT 
SEQ ID 6820 

LimQFRQl©EDYPQNSYGFG^^ 
SEQ ID 6821 

CGGCCTATC^TCTTTCTCCGACA^ 

ATTAAAmAAATCAC^AAAGCGACGMGCCGCAGACAG 
SEQ ID 6822 

RRMHLSPTNRD0VEVLQDIiNRPIiQIMPFFQIQ 
SEQ ID 6823 

ATGATTACTOC&CGCATTATO 

TGGCGGTGAACTTCAACTTTATCGCC^^ 

03TGCTGGTGTACaX3^^ 

SEQ ID 6824 

ICTLTHYLVIfiALLFGISA^ 

SEQ ID 6825 

TOACTI^GAATAAAACATCATCCATCCTO 
CTGTTTIOTTCACrAAAAAATG^ 

SEQ ID 6826 

FT*E*MIHP*jpfflASAQTKPDSTFFR^ 

SEQ ID 6827 

TTGAGGGTMTCTTCATCGTTTT6CCTGM 
TCAATATTTTA 

SEQ ID 6828 

IAVIPIVIiPEIiPTQSDIiSAVVFERIiIAACPISKIS^ 
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T^GCGCGATGATTK^TGC^TGCAGGC^ 



AGTCGMTCTGCCGAAGAAAGO^GTTOAGACGGCCT^^ 

S S^LFS™Y(^VRTVTmPVHAMmVLTFCVSW 
L^IPADYNNIRDLGSHYTDniLPF^ 

S f^l^ccmaccMGCcc<aTTOT^ 

AGGCACAAAATMGCCGATTKMCGACOGCACTGTTTTTTGAT^^ 
SEQ ID 6832 

CGTACCGC 

THIFEYHGEKKGDLHKiroililAIGDKyEABIAKRKAADAPYR 
SEQ ID 6835 

SEQ ID 6836 

LRHKISRFHRPHCFLILLPCUCSVCRRIFSDGIT 

SEQ ID 6837 



SEQ ID 6838 

HQQIAISILEELNNTSTOIIASAVISTDGLPW 

SEQ ID 6839 

SEQ ID 6840 

PTALFFDFWIjP*ICLEE«IFRRHHIKNPQ 

GTCATT^GATC(XaTCAATTTGTCG6ACGAGCMTOTO 
TTTTATACCGCCGWACAGCAACCGCGIQ 



DPSGQSBUrPFPIinGSTOTLmGIEDLSmmbVRIWmSlffiV 

caaactcttgtttaaagaggtaacccgtcogaagctgtcaaj^^ 

CAATGCGTCOTCOKGC^ 
CATCGCGGGCGGTTCTGTGTTCTCGTGGAAOT 
TO*3AAATCGTTGCCGGACA(X!!A(KTCGAATA^ 
GCTGGTICTCTCCGTTCCC^ 
CTACCGCTACGACCAAAT 



SEIVAGHHWSGFAFALFFIiAEHPfflLIAALTSI^^ 

GTCCAGAACGGTC^WMGGCGTAATCATMGCTT^ 
8 5n!n8& 
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SEQID6847 

TTGCTTG(X^TTTTOGACAGTCCGM 

AGGCGCAAAITGGGTGC6GATTTC(^?FGW 

CGGTACGTTTO3CAGTTTCAACGOT 

SEQ ID 6848 

U^FBQSBUiDFNFYINTG^ 

SEQ ID 6849 

TTGGATGAGGATACCAAAGTOT 
CCAACTCGTTGAAAGATTTGGAATOT^^ 
TCACAAATATCTTGCAAAACATMTCT^^ 
GTT 

SEQ ID 6850 

I^EmKVHLySTP^BRFNFMWEILSQGS^ 

V 

SEQ ID 6851 

ATGTTACAAATCGAAATCGACGGCAAAQGGTATCTGT^ 

TTGCCGCCAACTGCOTATGTGTCTCXn^ 

GGCGCAGGAAGGCGTGATt^G^^ 

CGTrATACCGAAGAAAAACGTTCCGTCGTO 

GTTTGCAGGAAATTGCGATGGTGAATCGTGGTG^ 

GACCAGCAAACCGTTCCGCTTCAAC(XX£GTAC^^ 

CGCGTGTTGCCGTTGGAAAACGAAGCCA!^^ 

(^GTOATGGACGTGGATTGGAAAA 

TGAAGAOTGTATCTGGCCAAAAAATTT^^ 

GOTA^ 

CGCCAGCCTGAAAAACGCTGAAAAAGCAGCGGTGATTTTGC^^ 
CTGGGCA!ITITOXX^^ 
AGCCTGAAATCGATACGGTTGACGGTOAAAAGCCT 
GCCGAITGCGCCGTTTACCGAAACATCAGGC^ 
GTATTGGGCAACCTGTTTGACCTGAAAGGTT^^ 
GGGAGGGCGTTCAGACGGCCTCAAAOCGCCTCGTCCGTG^ 
TGCCGCGCGTGTGMTCCGAACACATTGGCACGC^^ 
GAAMCGTGGTGCATCTGCCGCTGCATAC^^ 
SEQ ID 6852 ^, ra ^ 

HLQIEIIX^SVBQGATOEAAHKL^ 
RYTEEKRSWGKDMGPLVSAEEMSRCIHCTRCVRFTEEIAGIjQ 
RVLPLENEAINECWIjSDRDRFAYEGLYHESRLKNP 
QSIESIAimVLVVGANI^ 
LGILPQAANSVGADVLOTSGESVAEMANTP^ 
VLGNLFDI^?EYHDTAMLKDALDAES1^ 
ENVVHIiPIOTENAAIiGAIHGIIELAGA 
SEQ ID 6853 

TTCCCATWATCAATATCCTTO 
AGTTCTACAGTAATGGTTACGTTCTKIACCCGGCATO 

GGTACCGCGCAGCAATACGCCTACGTTGTOGCC^^ 
TOTCACCMCGTGGATGATACCTCCKOT 

SEQ ID 6854 

LPFXHILQIiSDNRRNHAGTH^ 
CTAQQYAYVVARLTFVQQFAEHPNAGTGGFLGPFQTDDF^ 

SEQ ID 6855 

TTGATAAATMFTTTTTATGGAGAT^^ 
GCGGGAATCXATTmTGAATTT^^ 

SEQ ID 6856 

LIHNFUffiSSEKHASIXn^ 

SEQ ID 6857 

AI^TAAAAACmTCCTCTGACCTra 

SEQ ID 6858 

KNKl^QSSDLHDQII^QNLSRFHI FQKRLEL 

SEQ ID 6859 

AGC^TAACAGAAGAAACCACGCCGGCACCCACGG^^ 

OTGGCATTACCATTTCCACACCTO^^ 
iOTCMTACGTACACTTCTGCTT^^ 

SEQ ID 6860 



VARLTFVQQPAEH 
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SEQ ID 6861 

CGGTTTCACAACAGGCATACTCCGATATGGCGGATAA^ 
GCGCGGCCTTCTGCAMGACTX^TGAAAAAACT^^ 

SEQ ID 6862 

LTKASVmmYKVKPLNLNDI 

SEQ ID 6863 

GTGGTTTTTTGGGTTTCfnO^ 
i^GGCAGCAGGMTGG?ITGTCCACGGCACGCI^^ 

GAACACGATGATGTMGGTACGCCTACTTGACGGGCC^ 
GTAATCATGTTTTTAACGTAGTCGGCGTGAOT 

SEQ ID 6864 

VVFWSFRPTISISSPTWMIPRSTRPVT^ 
EHDDVRYAYI/rGQQDVFAGLWffiAVGSR!^ 

SEQ ID 6865 

GTGGGGAGGGTOTAATGGTGGMGCTAAAATTOT 

TAGraTITITGAAAGATA^ 

CAAACTTTAITCGCATMTTGT^ 

SEQ ID 6866 

VGRVVKVEAKIFIIiYGAAHKfiKST^ 

SEQ ID 6867 

ATG<£AGGCCATA(£AAGTGGGCG^ 
GCGATCCCGGCGCAAATCCGCX^CTXXXaT^^ 

GGCAACTTGGGTACCX&CGGCTGCGTGGC 

ACGTGGTTACCAACGACGACGGTTCCATCG^ 

a3CCCAAAACGAAACCGMCTCTCCGGC^ 

SEQ ID 6868 

MGHSKWANIQHKKARQDAKRGMFTOLIKE^ 
GNLGTIXXIYAPNFVHQGyLVFEPGVDEDEIjMEAAIi^ 

SEQ ID 6869 

MCTTCGATTTTATTCGGCGTAAACGTCTGCCA^ 

CGTCCGATAACCGMCGCATCATGTCGC^TCGGCl^^ 

CGGCGGTTTOCGCGGCTTCTTTCTCGCC^^ 

AGGTOTCCCTGCGCGGCATAGGCTTCGTAAAGC^^ 

TGTTCAATGTCGOTAAAATCATGTT^^ 

CGCMTAAAA(^TGCMTCTCAAACTGATATG^^ 

ACCGTATCGGGAGAATCGAGCATTGTCCGG^ 

CGACTTCCGCCAACTCCCTTGCCGCGOT 

GGCGGCAAACCA6CCCATC6TGAAGAA6ACG<3GCAAAA^ 

SEQ ID 6870 

HFDFIRRKRLPPVAGGTVP^IjGFEVAVAATVHHAAI^^ 
RFSIiRGIGFVKLLAI^QVCMVLUjD^ 
GFTRHFTVLQPQKNLFGTIDQTRALVViiR 
GGKPAHREEIXSQKCNRQQDNPQFTA/HIGFLK^ 

SEQ ID 6871 

CCAA(XnX3GATGATACCTCGC^ 

ATGTAGCTGTCCMTGCGGTAGCCAGTTCGAAGATTTO^ 

TGGACAGCAGGTCGOGGATTTCCATTTCAAC^ 

GTGTOGOJGGTITO^GCA^ 

TGTGCGTAGTGGCGGGTTTCGGTTTCGTO 

TTTTAGCTAAAATAGTAGTCAAAGCAGCAGTCAGGGT^ 

SEQ ID 6872 

PTWMIPRSrRPVTTVPRPEHGEHVFDRQQEWFVHGTW 
WAGLRHRAVGSM^^ 
SEQ ID 6873 
Gl^XMTTTCCAT^ 
CGCTTGTCGGCATTTTTGA 

SEQ ID 6874 

VAVFHLDGAGQVDLSDFSRYSVYVEILAKPDTO 

SEQ ID 6875 

ATCGCTAJOTACC^TCAGGCGTGATP^^ 
AAAACATCTCGCAAArcGATGTGATTGACGAA 
AAAATATGTGGTTTGCAAaCCGACGAAG<^ 
GCGAAAGCCGGOTACMCTATATCCACGGCGAAATTTC 
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TTGMTTCGMCTCTTCGCCCACCAC^^ 
CTCOTCGGO^A 

GGTACCAMTTATTTTGTATTTCCGGCCATGTO 

TCAMGCX^GTTATTCCCGGCGGTTCGTCCG 

TATCG^TOACGAAGACGTGTIXATG^ 

ATCGTOCGCCGCATCGTAGMGGCAAAGGCCGC^^ 

TCCGCAGCTmCCAAGCAT^^ 

SEQ ID 6876 

MAnQSGVIFDQVETANPDCWTLDBWKRGGYr 
AKAGYNYIHGEIPEGYQRFEAALEQARMGFJ^KNILGSDFEFELFAHHG^ 
GTKLPCTSGHVERKjNYEVPLGTPFAEV^^ 
IVRRIVBGKGRMMiDIiLDS^^ 

SEQ ID 6877 

TOTTGGATCTCG^^ 
GCGCGGGAATGACGGCATTTCAGTATT^ 

SEQ ID 6878 

LLDLDffiiLVCRHSRAGGNPSSMVSNVIiLNSE^ 

SEQ ID 6879 

GCCTOACGGGCTTOGTTACAGCA 

AACATTTCTGCCAGAGGTACTTCGGCACGGACTT^^ 

CCGGAGTTTCCACTTCAACTGCCAT^ 

ATGOTMGAACX^GMTACCAGACGGATACGTACGTCAACT 

ACACCJVXmMTTTCATCGATAAACI^ 

CAGCTOAACGGCTTOCGGATAGTTOCACGGTAAG 

ACCGGAAATAATGGTTTGACCGGATrCTTCGTC^ 

ACGGCMTATGGATTACCGGCTCGGGGMTTOCATACG^ 

aGCGOTACTTCTO^TTTCACT 

GTAAACGCGGATAAAGGTCAGCTGAOCGAGGT^^ 

ACACOTGAACCGGAGGAATATCGGTAGGAGCTC^ 

CCAAAGTACGTTGA(X3CAACGCGCCTA(XM 

TTCTTCGGCAGTTTCGA(X!AATTCGGCA66^ 

TOTCXX3CGCCTA(XGGAATGACGATAGGTACGGG 

GCGGAAC?TGGTATTTGTTGGCT1XXXXXX^TACGGOT 

GCTAAAGTCAACGTGCXECGGGGTGTC^ 

TCTTCTTCCATGTAGTCGGTAGTAGCCGCACCGTOT 

AAATACCGASKjTTGCGGTACAGGCTGA^ 

SEQ ID 6880 

AFTGFSVSSSYVGRSFRIHiBiaKVSSAA^^ 

MVRTEYQTDTYVNYRIAGM)TWQR^ 

TGNKGI/TGFFVCTNAERRWPSQAVQGDflHFI^ 



FFGSFDQFGRNM<KGOTIGFIPD^^ 
GKVNVPRGVDDVEAVLGELFRHTGPVGSNGSGGNOT 

SEQ ID 6881 

TTGCAGGTCAGTTO)AAGACTGTTCCCGAT 
CGATTTOCCC^TACTOCGOT 

SEQ ID 6882 

LQVSCKTVPDNSQWREKAKVG IMSCMP^FSIffrAQIBIiIYP 

SEQ ID 6883 

TTGCMCTGACCTGCAAATTTTCGTCTTO 

jffAGCGAIGTCCTG^ 
SEQ ID 6884 
LQLTeKFSSl^SSFYTLSVSmCIB^ 

SEQ ID 6885 

ATGmTCCGCAAAATCCTTAAA^ 
(XXX?CGAAACCATCGCTTTTGTTGCCGACT^ 
TTGTACCAAOTGCCCTGCGCTCTGCGCGGCGCT^ 
GAATGCATGGGCGCGTGCGGC(&WX!OOCOT 

SEQ ID 6886 

MLSAKSLKQIDIELAKYPADQRRSAIH^^ 
B(^C(mAPmvmfflSI^FMTEEAlKKIABia 

SEQ ID 6887 

GAAACGGAAGTGAGAGAATGCTTTGTTGGCTTCAG^ 
ATGGATTTCTCACCACGTTTGCGGGCXOT 
TCACTTCCACGATAGGTTTGGAGTTTGC^ 
mACCGTCAATCATCMTACGTTCATGMTTT^^ 
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SEQ ID 6888 

FWlWQyVHEFGQUJM'EFRIWQVV^ 

SEQ ID 6889 

ATCGMGTTCAAClGCOGftAAAT^ 
TATCCCOTCAGACGO^ 
TTCGGAGAAAGTTCCCGATTCrreMGTTC 
QOyOCCGCAGACOGGTOG^ 
ACftTCGCTAITCTTGCATC^^ 
TCACTTGCAAGACTGTTCCCGA 
SEQ ID 6890 

APADRSAKNNGOTQRTOT^^ 

SEQ ID 6891 

TTTAGGACGCTTAGCACOTATTTAGA^^ 

CGAATCAGTACGACGCTGTGCTCTTGCAGGTTC 

TAGGGOTAGTC^^ 

TGGCAT 

SEQ ID 6892 

FRTIjSTVFRTGIiF^^ 

WH 

SEQ ID 6893 

TTGAGCCGCGCGAAATCACACCGCGTATC(rr^^ 

TGATTTTGGAGCTGGACGGCGAACAAATCGTCCG^ 

CCGTTTGGACT&CGTTTCCATGATGGTO 

ACGCGCATCCTCAATCACTTGATGGGtt 

CXX5(XGCACGTATGCACGCCG^ 

CGAAT(XCGC6AAGGCAC6ATGCTT6ACTrTATC6ATQ^^ 

G(XAT(^GTOTCTCTCCCC5A(K 

TGGATTTCGACATCCCCCTTCGGCGTCM 

GAGGGCXSAGACCTACACCGCTCTCGAACACC^ 
AAGGCATGGACGAAATGGCAAAAG<^ 

SEQ ID 6894 

LSRAKSHRVSSVRRT^^ 



GIGWSPERAMQKGFK^GSGVEWDVR^ 
KEimVEHPKGBPGraiSDGANKPYRUaRAPGFA 

SEQ ID 6895 

TTGCGTACCAATTGGTTGATAGTTGGCATATTO 

SEQ ID 6896 

LRTNWLIVGIFSRPVELNICEHHVGKSGI IFLLQADSQA 

SEQ ID 6897 

AIGAGGCftCGGGSMCTOCCSAA^^ 
AAGTAGAGTCGTTGCTGATGTATGGTCAGATTGMGCGG^^ 
TACTGAA<»TT<^ 
AAAAAAATAAAACGGTACGGAAAA 

SEQ ID 6898 

MRHGYLPKNIQKFKAQHRDLVLQ^ . 
KKIKRYGK 

SEQ ID 6899 

AT6GCMGCATTCAAAACTTATACGA6A(X3GTCGTCG^ 
TCATGACCGCATTGCACGACCATGAAGATTTGCAT^^ 

GAAGCCTTCGATTTGTACGGCATCATGTPC^^ 
AAATGCGTTATGACGAAGAGCAAAAACCX^GTGAOT 

SEQ ID 6900 

MASIQNbYETVVGVLGDQAGKV^ 
EAFDLYGIMFNiraroi^ 

SEQ ID 6901 

TTGCCGACA(XATGTCG(X^ 
(^TCAGTCGGACGGCTGGAGAAAATACTGTCA^ 

SEQ ID 6902 

LPTPCRQERNOTIASRQSSITTR^^ 

SEQ ID 6903 

a5AAACAATACGACCOTGCXIATCCAAACT^^ 
CTACCAAACGTTGG(X!AACTACGGCAGGCAG 
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SEQ ID 6904 

VRGKKDVSGISQEHGAIAGLVR^ 

SEQ ID 6905 

GTGCCTCATACCGATGTCTOGATTOCCGA 

T(mCGGTCGTATTGTTT^ 

CACGTCTTTTTTGCCOmC^ 

TTTCAATCTOTGAffTG^ 

SEQ ID 6906 

VPETDVFDIJIVRPI^IIE^^ 
HVFFAPHNRDMflF^ 

SEQ ID 6907 

TATTTTTATTGCiAGCAGAC!^^ 
AATAC(nTmTCCATACTGACAAACCAI 

SEQ ID 6908 

YFYCKQTVKHNroKKDDFLVCR* IPTDIPHNQSIXSWRKYOQyVFSIL'IMH 

SEQ ID 6909 

ATGGAAAAAAGGTGCGCTCGMTGGGMTAGMGGCGTTTT^^ 

GATTGGCCTGCTGCGCCGTGGAAATGATra 
GCTGACCMTAAAATGGCGCCAGCCCTGCGTCGCGT^ 
GTTCTGCGCG6TGCCQACCGTGTCGTGC(X^^ 
CCATCGCGCGTGACGAG 

SEQ ID 6910 

VVRGADRVVFWJWVP^ , 

SEQ ID 6911 

ATGATTCGGACTGGGTC^TCAGACTOGTCM 

rcCTTACGGAGTATCAGGTTTATAAGCMTTCG^ 

GCTGATCGATATCMTCTCGAAGTGGGGGATGTCG^ 

GATTCAAACCATTTGCX^ATCCX^TCTTG& j 
SEQ ID 6912 

MIRTGSIRLGQSVSDGTQL^^ 
DSHHUaWLAEEGLG 

SEQ ID 6913 

TIOTAACCX&TTTGGGTGTAA 
GCX^OTATGCCGAAAMGACGCGCCTTACAAATGCGGOT 
GGTTGOGTTTATQCTGCOGTGGQCAGTOGT^^ 
GCGCTGGAATGGGAA 

SEQ ID 6914 

ALEffE 

SEQ ID 6915 

ATGGTTACACCCAAATCCGTTAACAAAATTTCTA^ 

TGTTA 
SEQ ID 6916 

• MVTPKSVNKISTIISYFSEFQKPLK(^YRQ1^KNPINTIHL 

SEQ ID 6917 

ATACOTTTTTOCATACTGACAAACCATCOT 
TTPTTATGCCATAAAGCCGCGCTCAT 

SEQ ID 6918 

BffFHTDKPSIiAAVRIK^^ 

SEQ ID 6919 

GTGCXIAGAAAAAGACTTGGGATTTGAAGTGG^ 

AAATCGCTGAGITPCAAACCGAGCAGGOGGTACA^ 

ACACGACATTGATCAGGTCAAG^^ 

CAACTATGCATGGTTTTG^ 

TCGAGTGTCTIKXXX^CATTGAMCGC^ 

GAA(?PT6GGCGGTTTCAACCGCTTTTTC^^ 

immcmACCaCGCTTTGCT 

CGTCCAGGOTTTATAAAATCCCGAAGGGATQC^^ 

□^QGATAATCCftCAATTCCTTGTO 

TTC0X£CGCAG(XXX3CTGTTTO 

SEQ ID 6920 

VPEKDLGFEVAVMTVHHAALELPSTOTHHVGIGFP^ 

QVCHVIiIjIiD^ 

TIIXPALVVLRQFRQIJARA^ 

QDNPQFVVHIGFLKAVWQIRQHFVSAVQSISVRSSA^ 
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SEQ ID 6921 

AIGOttSTCTGAAAGOTC^^ 
ATCAGTmTGCATTTmCGGCATOTCGCCG^ 

T(£ACraTCAAAGCTGCGC(XXXX&TGCOT 
SEQ ID 6922 

^SERFRRHJVVCLLOT 
QIAUfflEVERHAAVGTQVAAVFGKCW^ 

SEQ ID 6923 

TTGCTTCCGGGCGGCTGCCCCTGC^ 
TAATIAATGAGTITOmAGGAAAC^ 

SEQ ID 6924 

LLPGGCECVIiPTIWUlFGVFLOT 

SEQ ID 6925 

ATCATCACX5CCGAGTAAAATTGTAMTTAATGTTGCAGGG 

ACACCXTAACGCCGTGAAGCC^CTCTTTCTO 

TTTTACTGTTCTAACGTTTTGCCAAATATTC 

ATTTGCCGTATMCGGTTATTGCCCAGC^^ 

T(XmGCATT(mTAATGCACACXX5GAAC^ 

GCAGGCTGTOTtXXaATTTTATCCATTTC 

CACCGCTTCCGCCTCACGGATAT^^ 

ACGCCCAMTCAAMCGGGGATAAACAAGATCGGGTCTTTCC^^ 

SEQ ID 6926 

IITPSKIW*CCRGFKyPPRVTRl^IjQSKTAV*YFHRTQ^ 

ICRITVIAQPVliFDKGPESPL*TMPITVAYFGMKRIiFAKIPGIP 

HRFRLTDIFPOTiPHIGRQHLI^ 

SEQ ID 6927 

TTGCTGCTTCAGTATGGCTGCGTTACAGAAAGGATGGA^ 

TGTGGGTAAGGATGTTFTCGTTCGCA^ 
CCGCTTTGTTGATATTCCCTTTATCGGGCAGT^^ 
ATCTTCATTACOTGTCTGCC 
GCAGTGCATTGGGTCOGGTCCTTATCGGCTTTGT^ 
AAAAAGTCTCCGACTGMTGCAGTGTCGGTAGCAGTT^^ 
MCAMCACGGCATTGTTGCGGTTTACCATAGAGATGGTGATAA 
CCTATCTGCTACCCTCCCTGAM 
MTCMTCCGGCATACCGTAGCCTTATO 
GAAAAATTCGA(XTGATTTTGATGMTTCGACT^ 
TGTTTMTACCACGCACAGCCCGCATGCTP^^ 
ACTGCimG(^CGCCTTTOT 
GAACCTTCG(X?WCX3a^GU 
SEQ ID 6928 

IFITLSAWRG1J?PLVHHVGTI£NXSGR^ 
HftiGIVAVYHRDGDKVVYGANVYDGAY^ 
EKFDLII^STtfyWRAYSTNLLSAEFLKQV^ 
EPSAGAEVITDENKTVEYKYGRGI 

SEQ ID 6929 

CGGTATAGATCGGCCGCCTCGAACATCGCCGCCTGC^^ 

TCAGAC(mT(m(XXmCTACCGACOT 
AfTTCATAACAAATTCAM^^ 

SEQ ID 6930 

RYRSAASNIAACRAFRNFRRHCIANRRK 

SEQ ID 6931 

TTGAAAATAGAAAATATGMTTCGACCGCAAGTAAAACCCTGAM^ 

GCCAWAGGTATCGATTTGAGTTCGATTACTGra 

(rTGrTTK^TCGCTGA^ 

CTOTATTGCTGCTTCCTACC 

ipQGGTGCGGQiCTCGGATCGCTTGCCG^ 

TTACAGAAAGGATGGATATGGTGAACAC 

SEQ ID 6932 

LKIENMNSTASKTLKGLSLVFFAS^ 
LLLLLPTFW^TIiPLLTCFFNRKIHN^ 

SEQ ID 6933 

ATOTX^GCAACGCCATTTTTCTTTTOT 
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SEQ ID 6934 

MPSHAIFLPVLPGHIjCSVRIOT 

SEQ ID 6935 

GTGCCTTGCTGCAAAAACAGATTTCAGCGAATTATT^^ 
AAATTTTCXTTTTCAACGACAATT^ 

SEQ ID 6936 

VPCCKHRFQRI IGYLMICAMYEADWIiKHCI$P^^ 

SEQ ID 6937 

ATGMGCCGCACAGCAGTTTGMCAGTGATACGOT 

SEQ ID 6938 

MKPHSSLNSOTPFAGIiEVYFPGLQFEQCGIJFI^IC^nB 

SEQ ID 6939 

TTOTCGGCACTATGGAATCACT^ 

!IAATCGCGCATTTGCAA(jATTATTTGCCG^^ 

ATTTAATGATTTGTGCGAIICTACGAAGCCGACTGGOT 

GCCAAATCACAAATATACXX^AAATC 
GTTAGATGTACTCAAACACOTAMTATAa 

SEQ ID 6940 

LVGlMBSIffAIIKPmQAFQKAI^D^^ 

ASAVQWFHQPDAFIAHCKTGLKTNGLI^ 

GKVRLTYHPLFFIARHSAAGRQ 

SEQ ID 6941 

ATGAGAAAAAGCCXSOTAAGCCGCTATA 

GCTATGATGTATTGGATGCAGGCGMTTTAGCCATT^ 
TTTACGCMGTTTGACGGCATTCCCAAAGAGCA^^ 

SEQ ID 6942 

MRRSRLSRYKQl^EIiFV^ 

SEQ ID 6943 S 

TOOTCTCGCCGCCGAAACC^ 
TTCTCTAACATAGCCACCCCTTCGTTCACGC^ 
5GCCATATTGTTCCATTCCTTCAATT6AGCC^^ 
AAATSCCCGACCTTCATATTGAAAGCTCCCAGCC^^ 
CCTX!TATAGGATTACCGTCTTTTCGACTO 
ACTCAATCTTTTGCTCAC^^ 
CTOT(XCC<nCTAGAAATA^ 
TCTITTGGTACATCTTTTMTGA 
MTCGGAACCGCCACCCGCGCACGC^ 
SEQ ID 6944 

KCPlTIliKAPSKSWlCCHASNVAQIQVKRmra amLFFIRTVKIMiliTQTFG^FP^ • 

LFAV*K*HCIiDRFTLTBLTCILFSHSCIFITFinj*y^ 

SEQ ID 6945 

GTGGTCAAT(HXX!GTTO 

SEQ ID 6946 

VVNGRI/3GREGIARNnnENGSEWHRFALR 

SEQ ID 6947 

ATCCCCGGATGCX5TGATTATTTTGCXXjCAAA(^ 
TGAKPCGTCAGAAATACCC6AACCGTCAITO 

SEQ ID 6948 

MPGOTILPQTIPPKTVFGRLDIiSFGIFARRGGOTSCYGSM 

SEQ ID 6949 

AHXaTAATT^CGGCAC^^ 
TTGCGGTTTTTTAT 

SEQ ID 6950 

MHNLTARRYGTU(ftISTRI^^ 

SEQ ID 6951 

ATGACGGTTCGGGTATTrcR^^ 
TGCGGCAAAATAATCAC(OTCCGG(X^TT^ 

SEQ ID 6952 

MTVRVFLTIfiRAVATVFSTPSGKNT^ 

SEQ ID 6953 

AJTCAAAACAAAATTTTACAAT^ 

ATCCGCAACGGCAGTGAATGOT 
MAA!K?KmXKX^ 
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ATCTTATCCGCTCGACAMCGCCTO 
GTTTTCAAGAT^^ 

RICGDKASFH)YQQPPARPFDMQELTAI^^ 

S |^^TA^^ 
TGCTTCATCACOTAGGGAACCGT^^ 

SEQ ID 6956 

LRIHAVTATLYGLPLY(&I^ 

CCAGCAATATGTTTTCGTAACGTGCCGGCT^^ 

CGCAATCTKACACGGCGCAACATTT6A 

TCGATACCTTGCAGACGCAGAATATCGTGCGGAIXXj^ 

CGTATOTACACCGTCCACGTCAG^ 

AAAC^GCTCGGCMCGCGCGGCAGACCGCOSGTAA 

GCACCAA(^GGGAATGCaTCGAAACC^ 

GTTTACKGTCAATCACTACCAAAGTGGACAAACC6OT 

TGAATCACGACTITOCMGAACGGCCGATGACAAC^^ 
CWXmACGGGATGCC«^ 
GCGOGCCAAGTCGt^^ 
GK^CAAOTAffX^^ 

C(mAACCCTTAC(m^(mK3GTC^ 

CGCATACCGGACMCTGTTTM^ 

GTTTGGACAGGTTCTCCATCATC(CT 

TTOTTAACTTCGGCATTGGCTT^ 

TCGTCGG(^AA(aTAACCGTATC^ 

GGATTTCGCTCMCAATGCACGGCCGACGGTTGT^^ 

GGTACfiATAATCGGTTCQCCGTTGGCAGGAGAC^^ 
CAAA6TCGGCATTAAATGCGGCACACA(XIAATCGGTGCAA^ 
jlHHJQGMGWCflCQGMW 
ATAAATCGmGAACAGTTOAACG^ 
TTTGACGGAAGCG^^ 
TTGCAACATACGTTTTT^^ 
AAACGGCCGCCGTCCAACXXSAACCMCGGACGCAAA^^ 
GTTTAGCX^ITTTTTTGATTTTGGTGTC^ 
TO^TCGAAGTCGTCCK^ 
AATAC(XX3TTCGATATCACGCAACGTCATGTT^ 
G6CaaOTrGGACAGGCTMCTOT 
OC^CAAGCCGTCGCO^^^ 
TCAAACTOTCTTC^ 
QPfl ID RQ58 

PWHTAOHLTDITOFI^GNIiiTliQ^^ 

*' »«~^«««mt»wn «^«MW«*<W«'W'»WJQ^^ 



TGMRTESCQOflGF^^ 

FPNFGIGFSQQGRFFFVfHFHIVHGMGlfrAFCRKSETGW 

DIf(3!VC^raGKQAM^I»mGHVVQTQYI)V^ 

I^MDFFQDIPYFRWyQFFRSF^ 




PGIBt »ic* * * **" *■ * — *> — ■ ~" *~ — ~ 

AEQAVALGFERTVVI^ 

SEQ ID 6959 

ATGGCTGCCGCGGCA(XX3ACACGTTTTGAA^ 

TTCCC 

SEQ ID 6960 

MAAAAATRPEPIiIM)PHQRliI»YSGLTKTGTAIJ 

SEQ ID 6962 
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SEQID6963 

TTGGCMCTGTCCGCTTG^ 

C6ATATACAAfiGGOTGGCTTCA<K^^ . 
A 

SEQ ID 6964 

IATVRMWVALPIiPE™^ 
SEQ ID 6965 

GTAGCGTTCCAAATCGATATCCAAACC^ 

GTACGG(XXnTCACGTCGTCAGACTTCACAGTCAGC^ 

TA(XX5CCCAGAGGTTGTTGAGTAA(^^ 

AAATGCTTCCXCTGMCGGCCGTCATACAGOT 

GACTCTTTCGCACXOTCG^ 

TGTAGAGTTTGTTCAAGAACTCGCG^ 

GATGTTCATACGGGARGGTACGCCCMCGGff 

CGGCCCGCCAflTreraCCGGCTTGCAGACGGCG^ 

AC^CTCATCCGCTTCTTCGCGTTT^^ 

CAGATATTCGGTCGTGATTTCGCTGCCra 

TOTTCAAAT(XIAAACGGTAGCGTTTCAAC^ 

TAGGCATAC(mTGMCTATCTTTTACGTC 

ATCGCCGGCTTCTACTira5CA^ 

CGGGCMCGAC&TTCMTTCCTCMTGTG^ 

TOTATTTTGACCCAAAGCCAATTC^^ 

ATTGGMCGGGTGAATTTAACCAAATKn'AAATATCGACA^ 

(X3GCGGGCAACGATTGCAGTAGCAGAGT^ 

ATGCGOGGTTCGCGTCATCGTGTTCCAAGAACG^ 

TTTTTCACGACAGGTAACCAAATCGCCMTCAG^ 

ACTTTGCCGTCAATAAOSCGGCGGTM^ 

ATACACGTCCATAGTGGGTCGGATGCAOT^ 

GGTCTGATCCATAAACTGACTCMT 

CGTTOTTTACGGCACGTTO^ 

CGTTACGCAACTC(^(X^GTCGCAATCGAGGre 

TTGTTGGGCTICGGACAGTTTTTG^ 

TCTTC(K3TGGGCGGTTCXX:CC(^Cra 

TGATCTAAAGGGTCX3TMTTTCTTTTACGCCOT 

AG(XAATOTTTGCCCAGCAGGCTTTCX^ 

TCTTTATCCMGATATCGACOTGGCAGTTO 

GCTCATT(^TAGCCCAAAGCCTTC^ 

GGGMTGATGCGGGCGGAGMTAACMTTTGCCGGAAGAGTGTGTC^ 

ATCACAAAAGAACCGCTCGGGGTCATCAACGGMTT^^ 

C^GCAGAGGGGCAGCATAAGTGATTCCGCGCAATTGA^ 

GAAAATAGAATTAAATGCXXjCTTGCAGACCGT^ 

ACATTTKrCGCTTTt^ 

SEQ ID 6966 

VAFQIDIQTQ'ANLFDQYVEGLGHTGIDFW 

RCTA*TAVIQRDLSFTWEAQFimiFAWSQVQHFADFRLFR^ 

DVHTGRYAQRVQYIWHRTAVRHVRHV^ 

QIFGRDFAAFGQLHRAAVGFLTDNHTLDTVECVTO 

IAGFYFRTDVNDTGFVQTVLHT^ 

IGTGEFNQIVNIDTHFTGSRFVVMDTDH^ 

FFTONQIANQIAV*IQIGIittiCNHIAAFFIX3RQVIN^ 

GLIHKLTQIiAGAEEFFDGGRllGFRINQVVRHQ^*FR^ 

LIXjFGQFLFVCARIKFHTAYAG^ 

SQCFAQQAFLraQTGQACIGNimiFR 

GNMGGE*QFAGRVCLTFVMLEEHTGRWQIjGN^^ 

ENmCRLQTTOTFVKSIFQLQ^ 

SEQ ID 6967 

TACACACTCCGAAAAGTMTTTTCAATAAT^ 

SEQ ID 6968 

YTLRKVIFNNRPSiaiNGTHQSDni^ 

SEQ ID 6969 

TTGATTCATCAAGGGCGGTATATTACA03GGAAW 

GCGGTAATTTTCCCTCTTTTAC^ 

CAACAGCGGCTATCACG(X!MTCTCGGTATTTTG(^^ 

(XGTTTTTCCOTATCGTC^ 

TGGCGGATTTGAAACAGCTTGTCCAAT^ 

mTCTGATTGOXSAGATTGATTTAT^^ 

TTCATTTTTTCMCCGCATTGCCGCCG 
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GGGMGTGGCGCATCCX^CGCAMTMTGCC^ 

SEQ ID 6970 

LIHQGRyiTREHEKMUiMSSN^ 
AFFRYRH^YEHLKNLLEKNVGKPDR^^ 
IJFSTAI^PFNVAtflYFIFERLPQFSm 
SEQ ID 6971 

CCGATAAGTTTCXXSCACC^ 
GMCTTCCGTATCATAGCGACAAACCGCCCGGCC^^ 

SEQ ID 6972 

LSENGKGNFAAAWVIPANLHSVIPTKVGIRNAKS^ 

SEQ ID 6973 

TTGAGAATTTGGGAAGCGCATCCGCCGTCCGCCTTG^ 

GCK&GAGCTACTGGCTGCCGOT 

CGTACCXjTCCGTGCMCCGCGCGGTGGGA^ 

MCCCT6GCAGGGCGTA61WCTTTGGAAGGCGG 

GGGCGGGCTGCTGGTGO^TCTCCC^ 
GTmATACCGOTX ^^ 

TGCCmGGGCT(mX^CATACCGTCm 

SEQ ID 6974 _ 

LRIWEAHPPSAItfSRRPQPYSffi^ 
RTVHATAKWDETAEIADDESHffiL^^ 
VLYRLFHETPPRVFEPETFESSCTCSRGKVSimMI^ 
SEQ ID 6975 

ATCAAACAATCA1G 

SEQ ID 6976 j 

LMNQIRNRTILlLEAEimiiSKNFHYDVbaPEOT 

SEQ ID 6977 

TTTGATTTCGACTTTAGCGCCTGCTGC^^ 
AGACCCAAACCAGTMTTGCACGGACAACTO 
GACCTGCAACTGCAACAGCAGCAGCA6AAACACCGAA 
AQCCM 

SEQ ID 6978 _ _ 

FDFDFSACCFQLFIiDVFGFSFimTIiFKGFRCAV^ 
SB 

SEQ ID 6979 

TT(XX>AAATT(XiAAGC(&lOTA 

TCCGACmCGGCAAAAAAOX3TA(K^m 

AM(?PGGGAAlXXa5AACGCAAAATCTAAA^ 

SEQ ID 6980 

LRlWKRCKFEHQW^SYPmTF^ 

SEQ ID 6981 

TTGTTTTTGGATGTCTTCGGCTT^^ 

ATCACGCCGACTTTTT^ 
CGAATTTTTCTTCAmGCm 

SEQ ID 6982 

LFlMFGFSFRimiFHGFRCA^^ 

SEQ ID 6983 

TTCTGCGAAAATATGCtmCGGT 
AAACGATTTTCATTAAAGGGGCAITACTTGATTGT^^ 

SEQ ID 6984 

LOTJKRFGTVRHJWVMATETNIAKISW 

SEQ ID 6985 

ATGATTTGCCGAAAACOTATCGGACACACACCATAT^ 
(X3XC(7TOK!ACAAGGTCAA^^ 
(^TCTAGGCGAACATAAAATCXXKjGCGAAAGOT 
TTGGGGCXX3GTTTTTGTCCGCTTAGGAGAC^^ 

OmAGMCAACCTTCACA^ 
GCCGTCTGTCTGACCTGCGGCAAAAA^ 
ACTCKXXmCTATC^ 
O&TTCATttXXSC^^ 
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SEQ ID 6986 

LGPVPVRI/^PKLGLGIHGTNAPASVPGIRSHGCVRMK^ 
AVCLTCGKNGKHKMPLRSI»OT 
SEQ ID 6987 

GCOiACTCAG(^CCTGAGCA<X^^ 
CAGAAGCAGCGTAAACCAACGGACCAACCATT^ 
TGCATTCGCAOMG^ 
ATATTGAGACTCAA 
SEQ ID 6988 
SSFFAGPPLCQSSQTWUCARYRSLHNBQQFGQQIiFAAl^ 
CIRTKLSHTGNTWAIFSI^ 

SEQ ID 6989 

ATGOTCCCGA(XAATCCGCC£CCAAT^^ 

GCGTGCTGACCACGCGTATCGCATGGCT^ 

GATTCCCATCAfiCGTCCGCGCTATGT^^ 

GAOaGCTTTCCCTCATCAAACGCC^^ 

TCTTGGGC6C6CCC6A!PCCGCACACAAGCCGCA^ 

OTGCAAAGTAACGAAATCCTGOX^ 

AACGCA6CGGTATTTGCCGTCGGCGAC6ACGACCAM 

AACAAAACTACCOTCCGTtt^ 

CTACTCC(XXTOi^ACeKX5A 

CGCGTCATCGAACAAAGCCTGTTCXXX^GC^ 

CCGACC&CGACMCGCOTCTTC 

CTGCAACGCCG(XXXX5AAAGC(XXX&A^ 

GGTCTGACCGAGCACTACCGAACCCAAAAAGGCGACAACCM 

CCGAAAACATTTCAGACGATCCCGCCTTCCCCATCCTC^ 

CGCCGCXAAAGGCTTGGAATTCAACGCCGTC^^ 

GTCGCCATaCCCGC(XXm^AAC^ 

TGCAC^CCTGTCCGTCAAAAAACCCGCOT 

CATC(^CAAMC(^(^(^CCAAATTCGGa 

ACCAAGTTTGCGAAATTGGAAGCGATG |! 
SEQ ID 6990 

MFPDQSAPNLLQGLNPEQI^ 

DQLSLIKRIiLKSLNIAEEIIAPl^ 

KAAWAVGDDDQSIYRFRGANVGNMAUIEEFM 

RVIEQSLFRSGIPYKHGGLRFYERQE^ 

GLTEfflffiTQKGDNQDRLDN^ 

VAITRARKRLnTMAQQRMIJJGQTQFGIASRFVEKIPI^^ 

TKFARLEAH 
SEQ ID 6991 

TTGTTTGGCTTTTOTACAC^ 
SEQ ID 6992 

IJtjFSYTAALPLSATSKKTNMPSBGFRRHISR 
SEQ ID 6993 

ATGCGCTATTTTAATAGAAACGGAAGGATGCCTGACAACT^ 

TCCAAACCX3ATTTGGACGGACGCATCGACAGGA^ 

CCTATCGGTTGGGGAAAATATTTOft^ 

AMCCCGGATTGTCAGGCTTCAGACGGCAIT^ 
SEQ ID 6994 

MRYFNRNGi^DNLIIilSVlaAKPSPTOWCTLI 

KPGLSGFRRHFLFRRLPLPPRTLPATIS 

SEQ ID 6995 

GTTATTCACGCTTGATGTAW^C^ 
GCATCAAAGTTCTCTTTCAAATCAGCTO 
CGGCAACATTAGGGGTAA(XX^ACCTATTTTAGGGW^ 
GTTCAGATTGCCTGCTTTGATTTCAGCA^ 
CCAGTACCTTTAGGCAGMCGAOTGAACXIACGGATMC^^ 
CCAAGGCAATTGCTTCGTCGATT^^ 
SEQ ID 6996 
VIHA*CINANT*THSACTSNPFQVLTPSSSRFGFiro^ 
VQIACFDPSSQIFEEDDIRTSI^ 

SEQ ID 6997 

TTGGGCGACTATTATOGTGAAATATGCCGTC^ 

SEQ ID 6998 

LGDYYREICRIiKPSIXjIFVRliDVAESGRAAV 
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SEQ ID 6999 

TACMCA(mCCACGT^ 
GCACGGGTCWmCCTACH^ 
CTGCAAATC^GTAATCACAACCGGAATTG^ 
TGGGGGGGATGGATTGGCTTTGOTXX^^ 

SEQ ID 7000 

YKTWVQJffl*ASRTCYSTWSRIQVSSSQIRLPS?SNPFQ 
WGGWIGFACRNl^PNIMNPLCS 
SEQ ID 7001 

TTGTCGGTGTGATTGCCGCX^TTATCGGCAA 

TTTGGAAGAAAAACGCTG(mTCG(m^ 

TTGCCGGCAGTTACGGCTCCMTC(%XX^ 

CCXICAACCCTTTAATCGTCAACTTGAT^ 

TTGATTTTATATTTTTTGTAT^^ 

TTTTCGGTATCTTGCTGCTGTTGTGGGCAGA^ 

TCTATTGACTTGGGACGATGTTT^^ 

G6A6TGTTGGCGGAAAGTGTC6G06GTTTGGGCGTTAQ0GGCAC^^ 

CTATGTTCGGCGCATPTCTCXX^ 

TTCACCTGTGATTTTCGGCTCGG 

CTGGGATATTGG 

SEQ ID 7002 

MMKLGFKPIPIAIMVLCALVtiALPVPDGVK^ 

FGRKTLGIGYSLALSELI^APVTPSNTARGGGIIHPIMQSIM 

LILYFLYPPE2W5TFNAVQFAKDI& 

GVLABSVGGLGVSGTAAGVILVIiAYMyAHWASTTAH 

LGYW 

SEQ ID 7003 

ATGTCCGACCACOXCTCC&CACC^ 

AGGGTTTCCM(XXrmGCCGA 

AAACCGCATCAG(X^TTTGMCGM6TTACCCGCACGATO 

(ffiXXOTATACCAAATCGCCGCAGCATTO^ 

CCGACGGCAGGCGCGMCGGGGGMTCAGGAGTTTTT^ 

G(XGGACGGTGATTCGGGCGCGTTGCT 

OmCGACATCAAATOCTGG!^ 

TGGCGGACAACATCAMCTCGGCCXmCGOT 

GCTGAAGAAMTTGCCGAGATGGGCATTATGCCGTCTG^^ . 

CAAACO?imX^TCTCGCCXX3AAC^ 

AMAAATCTK^GCGAACCTGMGAACAAGCGC^^ 

CGAAGCCGTCACCGCAAGGCTCGAACAAATCnK^^ 

(^AAACMCCCGACAGATACX&TGAT^^ 

TTATGAGCCAAAfiCTCGTGGGTGGCGGCGTATCT^ 

TOCAGACGGCATCGAAGCCTGCGGCGGCGATACX^ 

GAATCOTCTCTGACCMCTCTC^^ 

ACGGCAAGCTCGGC(XX^A^ 

CC(^CTTTCCGCCGCCACCGGTO 

MCGCATGGACATGGGMCACK^GTCCCTO 

(X^3CCTTGGCAG6CGAAATCATCGAAATGC6CGAAA 

TCT(&TAOTXXmT(XXX^ 

AOGXXH'ACOGCCTCTAOCGOOGGCAGCA^ 

TGTTCGGGGAGGAAGCGGCAACCGCC 

SEQ ID 7004 

MSimDTARRHSLPIiARQLDNGKIiKP 
(&YTKSPQHLSVVAMGKAGGYEW 
PNDIKSLVRPFVFRKyiiDyGAyEAMRNIM 
QTLPISPBQRQLIiABSMGFDSYAAFS^ 

ESIiSDQLSALADIVIAAALSCAWADMPK^^ 

NAWTWEHQSLTI&RFIOT 

TAYRIjYRRQQHNTKLRDAAKTEVSDEI^ 

SEQ ID 7005 

ATGCAATACAGCftCACTGGCAGGACAi^^ 

GCGOGAjS 

n^GTITOJGCGGCGACCATOTCATTACC^^ 
MTAOftCXaama»T^^ 
CGCCCCCAAAGTCMTGAAGACAGTGPK^GA^ 
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(XBACCCmCGCO^ 

MTCCGACAmCCGOT^^ 
SEQID7006 

^YSTIAGQTEISIiVSWGFl^ 

LSLGGDHnTLPIJiRAHARYF^RLMiIHP 

GTGTPVCGGLSSDRAIjKILR^ 
SEQ ID 7007 

GTGGCGAGGTTTGAACGGGAAACGGCGGCATT^ 
SEQ ID 7008 

VARFBRETAAFAAVFLSAGIRPHIGARFQTASKV^ 

SEQ ID 7009 

SEQ ID 7010 

MPSmiPRYSIXK^IKKQRQMPPFPVQTSPRSPK'CA 
SEQ ID 7011 

OTMTCTTCTCAACCTGG^ 
(mACGGTCXTTTCATTTftCACGAA 
GCGTGGGTCTATTAGCCCTCCCTC^ 
TTOCmTAGTCTTACGAC^ 

SEQ ID 7012 

VNLIJnAELQIiRGCFSTKYIjNRHA*^ 
raSLTTIADIHNFLYRHQNLSOT 

SEQ ID 7013 

ATGCCTATCOTACCATCCCEX^GOT 
TTTCACTGCAAAAACTGACTGMGCXXn*^ 
GGCACGGGAA6AGTGC6GCTATMGGGCGGTTATTGOTGGTTO 
(mmMGCCGAACTGATGGCGGTTTT^ 
GGCATCAGGTTTATTTG<n^TTGAGMGOT 
(mAAATGGCAGTCTTCGGGTGGGGAAAA^ 
TTCCATTTGGGCTCGCAQCTTGGGAACATCCGT^ 

GCTGCCGCATCCGACAATCATTACCGAGAGC^^ 

axmGCACCGCGTGTGTTGO^ 

ATAATCAGTATMTGTCGGGCTGTTGAGTTTGGCGCAACG^^ 

CATTATPGACGAATKXIAAGAACGTTITCCC^ 

TT(^TG^CCGAOTGCGCGCCGC^ 

CCGMGAAGAGCCXXXXnTm 

ACAMTTACOTm^ 

CCTGCCTCXX^GGCTATGTCTTTCTTAAAA 

SEQ ID 7014 

MPIIfflREVCHIHHWSIC^^ 

GSKAEIMVIiAHAGTRQTLIVCNGYKDREnRFAIWGEKLGH 

FHLGSQLGNHDVATG^ 

PEAPRVLHSMWETWTDI 

LNEPIAHRAVLIfllTCDSDCTIIHHI^^ 

PASQAMSFLKBLEAGLNGYTYLEDE 

SEQ ID 7015 

ACCTTCCCTTCTCAGCAATACATCAAAAAATAAOT 
TCACGCTTAGGCCAAACCACCmT^ 
TAACATTCTTTTTTTCACX^ 
SEQ ID 7016 
TFPSQQYIKK*PRNCaCRIDmKHSNKHYmSHSIB 

SEQ ID 7017 

ATGCCGTCTGAAGCOTATCCGGCgl^ 

GCATGAAATC^AAAGAGTGATGGTTTTTO 

SEQ ID 7018 

MPSEALSGVQTAFCFPTVYKLSIiFLDRTO 

SEQ ID 7019 

ATGCCGATTTTOCTGGATATATCC^ 

SEQ ID 7020 

MPILLDISRDGKGRIDTGTGADDFGFPDMHKT 

SEQ ID 7021 

AGGCTGTTOjGCAGATTCGGCAGCATTT^ 
AGCCG(XXX3MCAGG(X^AACAIT^^ 
TATTMTGAC(NX!AAGCAGCA^ 
AGATTGCGTTAGTTTAAAOGATTTGC^^ 
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SEQ ID 7022 

lUiFGRTCSILYIttCSQFLSAQ^ 
RLR*FKRPARFRlBCGSVFGRHrEYAIAENFTOT 

SEQ ID 7023 

TTGAGCTATTCGCCTCTMACTTAAGC^ 

TCTAC^GTMTGGTTACGTTCTCACCX^^ 

GGGTATGGCG6CCGCC3CTOTCTTTGCTC 

A(X!GCGCAGCAATACGOT?ACGTTGT^ 

TCAKMOn'GGATGATACCTCQCTCTAC^ 

(^TGTAGCTGTCCAAT^ 

GCTGGACAGCAGGTCGCGGATTTCCATTTC^^ 

ATGTGTTOCGG^^ 

CGTGTGCGTAGTG(KGGGTTTCGGTTO 

TTTTTTAGC 

SEQ ID 7024 

IiSYWPUIIiSDNRENRAGTHGTAJ^ANRKAQT 



AGQQVADPHFNQPQQLGIVDHVAFVHE^ 
FFS 

SEQ ID 7025 

ATGCCGTCTGAAGCGTTCAGAGGGCATIT^ 

SEQ ID 7026 

HPSEAFRRHFACSSSQIRQAFIiIKKHrFHADIiL 

SEQ ID 7027 

AT6AACCTTGATTPAA(X3GCTCAAARAGTrc 
CAGATCATGATGAACGCGTGTC 
GTACATCCTCCTAAATGACGTATTCCATAGAAAGGAAGAATT^ 
MGCCGCCXyUWSA^^ 

SEQ ID 7028 

MNLDLTAQKVRLSWKDIIWGYENKYLGWAD^ 
MPPKDGnPSAHPYESTttRLYSHWBHYLKNQGGQG 

SEQ ID 7029 ! 

AGCGATAACAGMGAMC£ACGCCGGCACCCACGG^(^^ 

(XCGGCATOACC^TTTCCACACCTTTTO 
TGCTCMTACGTACACTTCTGCTTT^ 

(m;GcccGccT(^cemx3TCCAGc^^ 

TCTACACGGCCGGTGACTACGGTAC(ECGGOT 

CCAGTTCGAAGATTTTTTCTTCGTAAGC^^ 

CATTTCAACCAGTTCCAACAGCTCGG^ 

GGGCCGTCGGCAfiCAGAACATACCAGGATTGCACCGTO^^ 

TTTCGTATTCTACGTGCGAGGTGTTAATGGTMTACC 

AGCGGCAGTCAGGGTGGTTTTACCATCGTCAAC^^ 

SEQ ID 7030 

SMUffiAGTHGTAAFANmQTFPHSDmQ^ 

vari/tfvqqfaehfnac?^ 

HFNQFQQI/alVraVAFVHBHDDVRYAYLTGQQDVFA^ 
SGSQGGFmmTOGANVYVRFATFEFFLSH 

SEQ ID 7031 

TTGCTGTGTTTTAATGGCGGTATTGAAAAACACATC^ 

TOCCGGCGA£TCTGtf^ 

CTACTGGGGCAAACCTGTTTTGGAAG^ 

SEQ ID 7032 

I^FNGGIEKHILraQ^ 

SEQ ID 7033 

AT GCCGTCTGAACG^ 

SEQ ID 7034 

MPSERTFCRSDGIFIDWYPTVEEAISRffliEDPS 

SEQ ID 7035 

TTTAAACFGGATAATCTTTTCX^ 

GCGACAI 

SEQ ID 7036 

FKLDNIJVmQLVMGFGM^ 

SEQ ID 7037 

A5^TTTCCGGCAAAATQ3AGATTCTTGCCAM 

TTATCGACCTGCGCCGC(XHBTGATGCAGCX^ 
GACCGTTACTACCAAATCACCMGTGCTO 
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TC»CTGAAGGCMtt^^ 
CAJK^AOTAAMTTCADOGAGTTC 
AWtf!TCAGC(£CAAAGAAAtt^^ 
TGCMTCTCCMTCGTAAAATTCTTGTCTGA^ 

AGCCXSTCGGCGCATTGCGT^ ' 

GCCGACCGCT^TTGCCGTACAOCATCCGT^ 

GGGAAMCGQCMGMKXaC^ 

GAAATTCG(?rcCGCCTCCra 

TGCCTGCTGACCGATGCGCCCAACAGCG^ 

SEQ ID 7038 

MISGHEILAKBIEVLNAAAOT 

KFSRKEIDEYTKFVGHGAKGIAYIl^^ 
OJiTDAPNSVDD^LRBLSLRLRQKAVETKEA 

SEQ ID 7039 

ATGGGTTTCGGGATGTTOTCGTCAAT^ 

C66GCGCGGATTA 

SEQ ID 7040 

MGFQffFVl^nAVHRANLIALR^ 

ATAMTTGGGCCTGATTCCCGTAACCGACMCCGT 

C^TTTGGGCGCGCTCGGTTTCGGCTT^^ 

ftTGGGTPITJACAACCACGGCATO 

ATGATTATTTAATCTGCCTCGAAAAA 

GCTCGAAffECTG^^ 

raTCCMGTGTACAGCGGATTGATATACAAAGGC^ . 

SEQ ID 7042 j _ 

HAAFPSEAESPVM^IiARRILFALDAEKAHHFT^^ 

IffiFNNHGIDAMIRKIEKSKVQGVLGINIGKNAW 

VKSVEMEIttTmiDKSS^ 

SEQ ID 7043 

ATGTCAMCATCGAACAACMGTTAAGAAMTT^ 

TGGAGTTGGTTATGGCTTTGGMGMGCCTKX3G^ 

SEQ ID 7044 

MSNIEQQVKKIIABQIiGWNEM)^^ 

SEQ ID 7045 

ATGCGTACCMCTATOCGGO^ 
GCSAAGGCAKBTCC^^ 
CGACCAACGA 
SEQ ID 7046 

MRm(X;LISEQYLCQ7VTVRGtfVERR^ 

CAGCCGMGGCTTOTamGC^ 
CAATMTTTTOTAACTTGTKH'T^ 

SEQ ID 7046 

LRGAAEAWSGDDRUV^ 

SEQ ID 7049 

A!«mAAOTTTTGAACCCC^ 

CAGGOTAGGTCAG<?mxra^ 

TGTCGCCGGCGAAGT(X^CGGTT^ 

GGTTTGGACGATGTGGAAAACCTCGACAAAGACCGCAT^^ 

AAATCAACCXMOTTATCCCC(M 

TCCACCXBCWmC^^ 
GCGGCGCXjAAAAIICPACGCCGAAATCGTC^^ 

AGCTC(&CCAAATCCATGACX^^ 

ftCOTGAAGCCGGCTGCGATTTGG^ 

CTTCAAAQGC 

SEQ ID 7050 _ 

MA!KajJPCIQCWASARQPHMPPIP™ 
GLDDVENIjDKDRIGVNIGSGIGGLPSIEATGKAVIEGGARKINPF^ 
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STWJDDPATASRPWD^ 
SSTKSMKMiLGAAGGV^ 
SEQ ID 7 gj^^ 

AACAGACG6TTTGCACCTGCCCCAAACCCM 

GCCGAGTGCOOTrGC^^ 

GOaAACTCAACTTCC^^ 

SEQ ID 7052 

FSHLENP*FKCTIJ , RLPLPI^ 



SEQ ID 7053 

ATGi^AACCTGCGGC^^ 

CCGCGTCCGACCAGCTTGTCAACCTGCT^ 

ATTT(K£GCAAAC^^ 

(^CAGCAGCCGTOXSmAAAACGCCGCTAOTCG^ 

GC6ATTATCTTT(XSTGTATGTCCCGACCACGC^^ 

GATTTCGCTGGGTAIXKTCATCCCT^ 

S ^AAEGC^ 
DSSRSFKTmVPFPQSGIWTIAFVSGQVSNAVKA^ 

SEQ ID 7055 

TT(XXrarrTCG(K£ 
CGGC«3CTTaC5&C6GCAC(m^ 

SEQ ID 7056 

LPPSAAGSVII»Ii)SDKtfRTNRTLYAP^ 

SEQ ID 7057 

TOCTATGTTOCCTGCCGCCATM 
TCGGTCAGCTGCGGAGCAACTCGAT^ 

SEQ ID 7058 

LWLPAAIIKSLKVQTVLYKTCRRI^^ 

SEQ ID 7059 

TTGAAAAATMC(^TC(X&CACOGCATC 

SEQ ID 7060 

LKNNGLDTASTEKPAAH^^ 

SEQ ID 7061 , ,_^ LL n,^ 

ATGAC(XCGTTATTTA6AAAAGCC6TTTGGCTGCTGTTT^ 

AGCGGTTATGTGTTTTGGGAACX^GCATTGTGTGCGGC^ 

TTCTGTT^GGCATATCAGTTGTGGTTGCAGT^^ 

GTGCGCGGGTTCGGAMTl^ 

AA 

SEQ ID 7062 

l^LFRKAVWIiLFAVS^ 
VRGFGNCAEPMLMWAl 

SEQ ID 7063 

TOAAAAAAGAAia^TATOSGrGm^ 
AAACGGCTTGT 

SEQ ID 7064 

LKKESISWPRKRHRAAVRCYAVS^^ 

SEQ ID 7065 

TTGATAT^aSTPEGAAACGGTATK^ 

CGTACCACGCCTn^GTTTGQXTT^ 

ACTGCAGCTTCACGTCGCCATACCOT 

CATGCCTGCCATGGCATAAATCCTGTTCC^ 

TTCACAGGCTCGGl^GGAAAATTTCAGGTTT^ 

Cq^GTmAAACAAATOTOGCM 

GGCATAGTTGGQCGAATTGTTATACCAAAACGGCATAAOT 

TAGATATCGGCCK^mCtt 

OTTOACTCCTGCCCTTC^^ 

AAAAGCGGATGCCGTCGGCMCAGTtt 

SEQ ID 7066 

UOlIjaiYSDAVPVmQ^ 
HACHGINPVPRTRFGTAIiSVGLVDM 
GIWMVIPKRHKVHRIWAYAQQPAQIVFVQV^ 
KSGCRRQQSCQKKIAHISCIHIjRPHCRHIGFQTAFYSGPG 

SEQ ID 7067 

ATGACECACACCGCATCGAAAACW^^ 

ACACCACCGAATTOUO^ 
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TCCATCATTATG(X^CAGGTCGCCGC^^ 
TCAAGGACTCGTCGTCCTGCCCGAAGTCGGC^^ 

SEQ ID 7068 

MTHTASRTPKIjWAVIAAAAFILLITIC^ 

SEQ ID 7069 

GCTTGCGGAGCAACTCGATTTCTTTGTC^^ 

CCGCTOTGGTMCC^ 

ATCGGCAATGGCGGTACOXn'AOT 

SEQ ID 7070 

ACGATRFLCHiFflHFFAVFQFE^ 
SEQ ID 7071 

CTGQGGCGTATTGCAJ&CATO 

^TTCGGGGCTGATGATTGCCGTCGGGCTG^ 

CQ3GACTGGTC^CGCAGGC6GTTCGGCAGGACMT^^ 

TCAWU^CGCCTTCAAAACCC^ 

SEQ ID 7072 

VGRIATIVRRACRPPRRVQGIKIffiCPPARU^ 
AIALLILPVSWWLAGGNNGGNIM 

SEQ ID 7073 

GMGCGG^GCGCACGCCCMTGAGGCTTCGTGGG 

ACGCCCGCCATC(XX3CCGAA^ 

CAGTCGTGGTTTCTTTTTCCAT^ 

GAGAGMGAMCGGCGTGC^TGTGCCGTTTTCTT^ 

GAATAmATTOTTGTGCCATTraTO^ 

MTCGOTACCGTGCTTATTm^ 

TraCC(XK:CGCCCGCGCTGCGGAA(^GAAGAGM 

SEQ ID 7074 , 

EAVAHAQ*GFVGPEAGVFQASPIVVAVA6VQGQAGRIOT 
ERRlK^ECAWLMFRLIiGmKMIA 
FTRRPRCC^EEKRREEKRREKKRREEKKREVFCRPYAYQ 
SEQ ID 7075 
TTGAAAACCAATATGCCGACAATGGGGGCGG^ 
CTTTGACGGACMTGTTACGCGTTTGGCGT^ 
CCGCAGCAGCTOGATAAGGGC^^ 
TTGGACAAAAACXIATTTGCGCGGGCTGTTG^ 
CTGTAUGGAAmTOGGACAG^ 

CTGCGMGCTC©?rGCG^ 
CGCGCGTtntXjGTGCGTGCGTTCGATC^ 
GTATTCCGTAITCCGCTACMCXXXLAAMCGGGCT 
CTGM^ACTACAACCACAAGCAGA^ 
SEQ ID 7076 

LKTNMPTMGAEMNTRNMKYI^ 
IOTfDLRGLIfiVREHKP^ 

LRMLGAGFVHQSNGQSRPESRSWNIUYAMAGMEWGKLTO 
LIDYNRKQNGIGIGLMFKDWDGI 

SEQ ID 7077 

pre 

AGGCXSGTTCGGCAfiGACMTTCCTGT 
TTCTCCCEGTCTC^^ 
MCCCCAAOTACATO^GCT^^ 
ACATCCATCGCCATCATCGGACTGGCAAACATC^^ 
TGCTGJmACATCTTC^ 
CACGCGmCCTCGCCACCCTGTT^^ 

SEQ ID 7078 

IJ>S«X:SHSAPVPAGFPSIAroRSPPKSH 

NPKLHPAAPELFRLIttiPHRLS 

HMjPRHPVRTGDAHPR3KRIPRLVHRSHR0YTIR 

SEQ ID 7079 

ATGCHOTGATTTACATCTTC^ 

SEQ ID 7080 

MVLIYIFSPKTDLNFHFAAALGFTWI^^ 
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cpQ in 7082 

IA^KSSQCrmBQM^^ 

^i^gjiaotmtoattto 

J^IACftGCCCGI^TGCKM<K3GRAIGCCGTCftiftCTTGCGCS& 

SEQID7084 

KHVPIF*NPGQPQ*IIiI"YQU *IKWSNFYPOTVKTPSAIiIiQIQPEHIiF<aftVKLAQ 

AQCCGTAGTCACCGMTTGTGTAATCiO^TGCCGCCGATGTAC^ 



CATCAAAATAAAAAACATTCATATTTTGGGCOTAATC^ 
TTTCTGAMGAAMGGGGATTGATGATTraATTCGGGCGGCGGAACAGGTTAAGGO 

OCGMGCAIXCAGG^^ 

TCGATATTTTGAAGGCQ j 

SimFIRAAEQVR3^^ 
UAEKHIYFIENRMVRIMAHASYAIAKDK^^ 

ATGATGTGOmCACTATITOm^ 
FAVXHFQMRmm*smaua>P*PRBFRP^ 

CAGfflX»TAAA<CT3Ca^ 
AAAAGTAAAAGCACTGGCT 

SEQ ID 7092 

MANSAQAWCRAI^SVKQRAHHASIATAFRTAVimKAVEAGDKAA^ 

^TOCGPT^^^GGWCGAITIOTTCTP^ 

TGTAATOGCQGMTTCGOr 
SEQ ID 7094 
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GEVFGQEFIDDVACVGGHEAFLMGN^ 
DGNIIX&LVEF(£RHLTGNG^^ 

SEQ ID 

AAAAAAACGCCGTGQGCCTCAAGCAGGCAGCSC^ 
CTTTAAAAAAGCCAACAATCTGMGCTGACTTA 

ATGCC(mXOTATTTCTCCGGC^ 
CAAGCK^TCCTGC(m 
G^GTGTTGAAGTCTAIIPCCffPCCGGACG^^ 
OGCGTTCCGMGAAGTGAAAAACAACGTCGGC^ 

ATCTTCCCGMCGAGCAGGATATGAMGACGGTTTO 
SEQ ID 7096 

VF^GVFFFSNLFHQBSARMKKSVIAVLAALSIAACGG 
IJflQIEAGAYQKVNKDLIPNYKNIDPEIi^ 
EVLKSIRPDVKRFSPSIIBELARCa^ 
IFPNEQDMKDGPVMPQMSA^ 

SEQ ID 7097 

TOGACGGGGATTCGACGATTAAGAGGTO'TTCGC^ 

SEQ ID 7098 

HX5DSTIKRFFAICRSLDGGFDAGVSDGIGRAGAV 

SEQ ID 7099 

TOCGCCGACAGGCTGAAAATMCAATOTTACCGCAmra 

TGCCGGGTATCGAAOTGAACGCGAGGTCGAAAACTGGC^^ 

TCAMTCGGCGGAGACGAAGGCGMGATATGCTGTC(^ 

GTCCACATOTTATCQATTTCCT^ 

CCGttTTGAAAAAATTCGATTCGGCAGGCATG^ 

AMCGCCCTCGACAGC&TTGACGGCAACOT 

SEQ ID 7100 

LRRQAENNNFYMIITLIGIKLKQT^ 

VHILIDFLDBQGYLTBSIEDIIiWfl'PLEWMUJEAMIjKQALTA 

PFPPPVLPRPRPSPILTEHSPTCtfLSAV 

SEQ ID 7101 

ATGCCGTOCGAACGATGTin^CX^Tm 
GTACGGCCHTOCCCCGC^ 

SEQ ID 7102 

MPSERCSKILYVflEIEIFIPSPPRPGFQIAOTSGL^^ 

SEQ ID 7103 

CTCTTGCGGTGCAGGCGGTTCGATTTGTTCOT 

CATTCTTCGGCMCGGGCGGGTTCCMGT^^ 

GGCATTGGACG(£GG1ATCTTTGGGT1^^ 

GCATTTCGGGCATTCGCGACCGTCGAC?^^ 

GCACA0GCAACGAM(X3(XXOTTTTG0CGAAOT 

CTTCCACTTGTTTGATGAAC^^ 

TTGGGCGAAGTGTTCGGTCAGGAATTTATO 

OTCGCTCATTTCGGGOK^^ 

GCCGACGGTMTATC(&(X3GTGGTTTGGTC^^ 

CITrcGGGCGTGCOTACAOGGAAGTCGGGCGG 

CAATGTAAIK&CGGATTTCGGTCAAC^^ 

GGTJM^GCATG6TGC6GTO3GT(X>^ 

ACOmCTTTGCCT^ 

OTTATGGCTGTCGAGATOCAOCGTC^^ 

TCGGATTTTTTTCCACAfiCAACGGCG^^ 

AGCACGGCGTTTTTGOTAArcTCGTGGA^ 

GCCGOTTOGGGACGA(^ 
SEQ ID 7104 
LLROUTOIOTAAFFLRA^ 
AFGAFATVEFDIJGFRFGDAFGSFFGGFVDVAOT^ 
LGEVFGQEFIDEVACVGGHEAFIiLKGNVFAW 
ADGNIDGGLVEFGGRHI/IWTLPNHLIEFELVCTQEGFD 
GTHGAVGGEAQFTHSILIiHGGCGKRR^^ 
SDFFPQQRREVETDQIVQSAAGIJif^^ 
AAFGDEVADVSVGEHDFRIAAQ 
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SEQ1D7105 

ATGGTCAGATGGGTTACGGGGCATTTGGAGGTAGTC^ 
GTTTCTTGATTGTMTAftATTTCT^^ 
SEQID7106 
^VTOmEVVIALVPFLRLVK^^ 

SEQID7107 

ATGGATTCTTTTTTCAAACCGGCAGTTT^ 
TTGCXXaAAGACGAACAGCCCGTTTO 

cmGcmx^mmx3GGCGG(m 

TTCATGCAGCACATCTTCAAAC6C6CCATO 

SEQID7108 

MDSFFKPAWAVLWLMFAVRPALADELTN^^ 
FMQHIFKRAMGINIiPRTSAEQA^ 
SEQID7109 

SEQ ID 71 10 

MYYFGHCADKSPFCPLSGFWIGCT^ 

SEQ ID 7111 

rGTGGCTGMTGAAAAACTCGCCGACTTCGTGCGCXiAA^ 




GTTCGCCC6TTTCACCACCGTG6CAGGCGAG(^^ 
ACGOTCTtOTCTT^^ 

CCGAAGCCCTGCACCMGTTACCATCGTGATGAGCGAC^^ 
TTGGGTGAAATTCCATTTCCGCAGCCAGCMGGGATTAAA^^ 
CGCGGCGAGTTCCCGAAATGGACGATGTACATCCAAGTCAT^^ 
TTGMGTG(XX)GAATTCGAGTTGAACCGCMTCXXX^ 
GCMGCCCGTTTGTTCAATTACGCCGACGCGCMCGTO 
CGCGCCGAOXmCTAC<XXa^ 
ACTACCGCCAAGACGATGACGACTACTK^GC^ 
CTTCATCAAATACCGCCATATCCGCMCTGCT^^ 
GCACTGGGTCAGGCCGGTTTGCTG 
SEQ ID 7112 

MNNGAFVADNQNSLTAGPRCTIiLTQ^ 

TPVFFLRDPRKFPDLNKAVKRDPRTNMRSATNl^ 

RGEFPKVPI^QVMPEADAEKVPYHPFD™ 

RAIXMGSIiPHYEPNSFGQWQ^ 

ALGQAGLL 

SEQ ID 7113 

GTraTCGTGGGTTTC^TCX^OTAAAAa 
GTAATTTCGTCQCCGGG€ATGTG(^T6AGCGCGTG6ATGAT^ 
(X3GTTTCTTCTTTGCTGTACACGCGCM 
ACC06CTTOTTCAAGCAGCATACCCM 
SEQ ID 7114 
™gfvve*krahqlvadqygqlalv^ 

TRFFKQHTQVFGRBAGIEILQlFDEVGDDBXa 

SEQ ID 71 15 



GAGGAAGCCGCAGACAGTACAAA 

SEQ ID 71 16 

LRIIMLQNP^PDSRLRGNIXXRFFVFPITC^ 

SEQ ID 7117 

TTGTTAATTCACTATMTCAGGAAAGACAAAAAAC^^ 
TTCCCGCGCAGTCG 
SEQ ID 7116 



SEQ ID 71 19 

ATGCCGTCTGAAACGGTCAAACGACGTTTCAGA 

AMCCGGCCTGACCCAGTGCGGGATCGGTCGOT 

TATGGO&TATTTGATGMCn^ 

OXCTAGTTGCCGTCGG^ 

AATTGAACAAACXMGCTTGa^Tm 

CTCGAATTCGCCCAOTCAATCAGCGGATAG^ 

CATTTC(MGAACTCGCCGCGC^ 

GGAAATGGAATTTCACCCAAAAACGTTCG^ 
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CGCAGGAAGMCACGGX^ 
CGGTG(?I£AMCGGGCGMCATCTCGGTO 
GTGCATACGGCG?TCGGGGATGACTTC<X^GCAC(^ 
ACGGGCGOGCCGTTGTI^TGGTCAGATGGGTT 
AMGCGTATTCGTAAOCGGTTTCTT^ 
SEQ ID 71120 

MPSETVKRRPRRHFYARLFPVSDGAGQDFFNFH^^ 

VT^VVPVCRVAVDFQRRFGEVGLLLPLAEAVGFWR 

LSFAHFNQRIWFRPNFGQVKMIRHFFGIGFRHDLIOT 

NLVQGFGQQRPEVPVVCGGTKIGARIAFDGPVQVGBFARVAQEEHGRW 

VHTAFGDDFAHEVGEFnQPQILRQQGAARAGGQ^^ 

SEQ ID 7121 

TTGCAACTTCAAATCMTCCGCAAACCGGMTC^^ 

CmTCTCAAGAAACCGTTTTATCCG^ 
SEQ ID 7122 

LQIfllHPQTCIPSFPRSRESEKVRTEm 
SEQ ID 7123 

TTOTMTCCACTATAAACGCAAAAIATC 

SEQ ID 7124 
LLIHYKRKI SRHSRTIPT6SNRHENFSASFPRKHBSGT 

SEQ ID 7125 

AT6AACCGGGKS3AAACCCArTTTATCC^ 
GCCGTCTCGGGCGTATCOT 
SEQ ID 7126 

MNRVETHFIQYAADGCAACRPFREHFCGRRRGVM 

SEQ ID 7127 

ATGACGGAGGGGAACGGOGCACGGACATTTGCCGG^ 

TCAGACGACCTGTAATTM(X3GCTGCTTA^ 

AAGTTttXmATTCCCCCC^ 

GGCGGCTTGCTAraGGMTATCTGMTCG^ 

GCGa^TGACGGAG(X3GAACG(m^CG(^ 

TGTTTTTCAGAC6ACCTGTAAT 

SEQ ID 7128 

MTEGNGARTFAGIFYVSinLFSMRYSRK^ 
GGLLLGISESSFPWCVffiSRNPNAARIYRRNRNPTDPDSRL^^ 

SEQ ID 7129 

ATGACGGAQGGGAACQGCGCACGGACATTTGCCGGMT^^ 

TCAGACGACCTGTAATTMCXSGCTGCTTATCK^ 

AGTTCXXXmTTCCCCCC^^ 

SEQ ID 7130 

tfTEGNGARTFAGIFYVSIIIIiFSMRYSRKKK^^ 

SEQID7131 

TGTTCGGTACGGTTGMGG3TT^ 
GAAAATGAIHXXiCGAAGGGTTGTO 
GTGGGCTTGGCGTTTATGGAGGTAAACATCGCC^^ 
GGCGTGCGGAATTTGCCGGAGGTTTGGTGTTGATTG(XA^ 
SEQ ID 7132 

MSLYALUjVAIiGMSMDAFAVALAKGMVRMPPRKIM 
VGLAFHEVNIAFAAAVIGMAATVMVTIGLTAGKAFGVI^^ 

SEQ ID 7133 

A(HTO^TACGCTGGTArc^^ 

GGCCCGTGTCGGGCGTATCCTTATTTATAGAATATGACGM 

CAGTTTGTGGCAGCCTTTGC^^ 

ACCACCAGCGTMCCTGC&ACAGGGCGGCAAT^^ 

AGGCAAGGTTTTTGTTGGCCGGCGGATAAATGCGff 

ACCCGACACGGTAGGMTGCCTTTCCCACC^^ 

CGCACGTTGCCGCGCAAAAACAAAACCGGCGGCGCGGTCAGC^^ 



gctcoeaaactcx^^ 

SEQ ID 7134 

SSflftVSAAGHCGKAAVQFQFQQCXSIQlCGRH^ 

qfvaafavgvvyrtgyrhhfaphfcrqaggnqrtc 
trhgrnafptqrpaeipgnphrlrrgmaa 
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CGamTCGGTTTCTCrGA^ 
SEQID7136 



SEQ ID 7138 

MMSENKMPSBUiPVSDGIIiLHIiILYRYTCVKIiIiKIKy 



ATTTCCGACAGCAGGTCGCGGATO 

miOTPHIQQHHCRAM^IAS^ 

AGGTi 



ACAAmTGTCGGCGTTTACAACCGA?AT(XCGCCCCCG^ 

SEQ ID 7142 



GEIBAHBAAGRGVQS 

IWLTFVDDASTYAYFIVEEECKNEGYEKCKAKKIMJGKDERQCTSTRDyTGPI^ ■AurLSiEStumur uru mbbmi^l. . w — 

SMUIGIHLSKranBWIWW 
SOraVKKHFTUlAGVOTiUraRXV^^ 

SEQ ID 71 43 „ M ^, m , n .. m i^ u mMn>n^^ 



CATTGAATCAAACCCAAATQCGA 

SEQ ID 7144 

LRKmLFSIAiaNHTDRESSWKCB^^ 

CCGAimACCCm^ 
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GGCJAGTTCOGOTX^AACC^^ 
GCCTCGACCATTTCTTTTAACGC^^ 
AMGCTGGATGGGCGTGAGCGGATTGWGATTTC^^ 
GCCGTTGCCGTTGTCTTCGGGCAA^ 

AACGGCCTTTTTCGTCAMCACCAC^^ 
AAfiCTGCTCGG^TATGGTTGAACAGCTO 
AGJ&TGGGTTCGACGAAACGGCGGGCAAAATACi^^ 
MCTCAAIITCGGCATATTTCGCCCGCGCC^^ 

ATGCTITITTCGATTTT^^ 
CMCTGTACGGGAA0GGCGTTGCT6ACGGCATO 
CGTGCXXJTTGATAAACTGCGCGGAAATGCCGAA^ 
TCTTTCAACAGCMTATGACATAACGTGCCAAAAC^^ 
CCC?rcMTCCGTACAGCAGGACGACGGCGCATATG<^^ 

SEQ ID 7146 

RlSPYRFWQDDSDTRPmiilAQADAAAVFS!© 
GKFRLKPARAGFVQG^IAD*6VQIIJFQFERRGARVIAECIj^ 
AVAWFGQYRRIAQQDFGIVRRGIF^ 

QPSLRVQYMYAFQTPNRTGLI^LFPMFFVGKIUiVELVRII^ 
RAVIKLRGNAEQEQAGQTC^ 

SEQ ID 7147 

TTGAAAGACATCGTGTATTCCTTATGGGAGGTT 
ATCMTATTTTATTTTCMTAGCTTTACACAA 

SEQ ID 7148 

LKDIWSLWEVKIIVQTGRQTCSADYFFSVPIAPI^ 

SEQ ID 7149 

ATGTCTTTCAAACTCXX5TTACCTCGCATCAOT 

Cmm(H , AaGGCCTC(^GCC6TTC^^ 

GAAAATCCGTCCGTTTACCGAAGMCAGGTCAAA^ 

• GTGGACAATATG(XGACCAGTCAGGGGCACAAAGGGC^ 
MGAACA6GCTGGC6<XXXnTrc6TGAATCCGGTGGAAATGGCGW 
TGAAGACGGCGCGGTTTCGTTTAAAMCATCACTACCGC^ 
CAGGAACGAAAAGGCGTGCGCGCGTTCA!KWAC^^ 
TCATTGMGATCCGAAACGCGACAAAG«X^ 
CGGCAGCGTGTGGGAGCTGGATAAGGCAGTTAC^^ 
GTTTCCGMTC^GCGCGCACGATGCCGGAACT^^ 

SEQ ID 7150 

MSFKLRYIASVIAISSIiI^^ 
VDNMPTSQGHKGQFGGRNSPTALNAALLGSQFWK 
QEWtGVRAFMDMSCIACMNGVNLGGTT^ 
VSESARTMPELPLTAPMESKPDWK 

SEQ ID 7151 

TTTGTTCCCGATC&TGTTTAGAGGT^^ 

CGACGGTMCGCGGTAGCGGTTGGTCAGQ^TGGAAAOT 

CT(XX»GCTTMGGTAAAGTTGAGCOT 

IK£GC(X£GGTCGCGGCMTG(XX^ 

SEQ ID 7152 

FVPKJV*RFPTRI04PVLRSQCVDLERLGQF*S^ 
LPA*GKmYAPAQRLLELVGQWLBAAADGKPPAVOT 

SEQ ID 7153 

ATGAGCAGA(XXXftA(X!CGCCGTATTCGGCA(X^ 
QCCCGGCGCGCATG(XXn i GCATTACG^ 
MGTGCGGACATTTTGCACCTGCCGCGCXXX^ 

A(?IOTXjTTCKXXX^AACGG^ 
GAOTCOSTCCTiaCTCTCGC^ 

CAAATACGGCTGGCTGGTCGAAGTGTGC 

SEQ 0D7154 

MSRmAVFGSVPHSQMPVLAYREGKWQPTEW^ 

PTIiIGTOPVIGKAGSPSETALLYILASPVGDYFKVGSPVKILVETEffi 
DSVLTVAKDLGYTVSEWJF^ 
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SEQ ID 7155 

TTGATTTTTWATCGGCTGTTGCl^^ 
TTTT(7rGCATATCGGGAAAAC^ 

SEQ ID 7156 

LIFLSAVAFDSLNLPKFKFKSS^ 

SEQ ID 7157 

ATCMCAATCCATTGGTGAA^ 
CGCCAAAGTATC&AGATGTTCCTTC^^ 
TGAGATAAGACTCTCTGMGGTGATT^^ 
TCCAGTCMCMGGTGAGGGTATATCTAAGGTTAC(K^ 
AAATAATCGAAGCCAGAAACGGTCCTGACGGTTO 
TGT(X^GCAAATCA<^^ 
GGGCAGGTAGGTTTGGGATTAACMCGGi^^ 
ACTA»(XXrrT(aGGCTCAAGTAACAGGCW 
CGGCGGCTTTTTCGGCCCGCAGGGTC 
TCGGGCGGTGCMGCGTTTCCGCATCAMCG^ 
TOSACMCTTCAGCAACGCCGCCC^ 
CACMCAACOTACATGCCGGAAAGTGATAAAAMGACACX^^ 
TATAAAGTCGAAGCCTGCTCnrKXy^^ 
CACAGG(H^CAAAGTATGTTCCTCCAAGGC^ 
CTG(ftCCGG<^CGCCTC^ 
ATTGATGCXATGATTGATGACMCGGOT 
TGCAGGGCGGTTTTTA0Q6GCCTAACGCCGAASW 
AGCAGTCAGCGCGACCGTGGTATTCGGTGCGAAA^ 
SEQ ID 7158 

MfflJPLVNQAAmFVFLLSACI/^^ 

SSQQGBGISKVTDYHDFKm^ 

GHVGL6LTTEITVNFEKKTLSGKLIKNKMVIM 

SGGASVSASIK»TGTSSGNSm>TmDAVa 

YKVHftCCSNLITCIiKYGIiLTR^ 

IDMIDDNGFKGTMTGNDGFAPDQNSSTGTYKW 

SEQ ID 7159 ; 

TTGTCGTCATTCCCGCGCAGGCGGGMTCCA^ 
TTTCTACATCGAATCCGCTTTT^ 

SEQ ID 71 60 

LSSFPRRRESRLTOETOVKRFLQFYVPDSRMGDNDSGISTSNP 

SEQ ID 7161 

ATGCTmGAATAAGC(XOTAATAAAAGCa 

TCTTCGACGMCACGGAACAGGTGGCAAGCAGC^^ 

GC&GCCATTTCACGTCXKGA!^^ 

TTGCCC£CCGGCGCGGCGCACGCGTCCJ^^ 

GA(XH3TrTACCGGCACGGCTTCTTCC^ 

GCGCAACGTCATCGGCGGATGGGATTGCA^ 

TTTTTGCAGGAAGCCGCCAATTTTTC^ 

CGGCGTGGGGCGCGTTGCGCGTGTMTGCAGTTGGT^^ 

CAAATAACGCTGGCAGCCETAGGCGATAl^^ 

GCCmATGCTGTCGGCGGCAA^ 

SEQ ID 7162 

MLLNKKVIKAILVFVR*KHPRFDRFGI 
mCRLKAQTGDIVIiHTL^ 
AQRHRR)K^RGG0VVPVFRIM7FQVSHPPQRQVVFRHIW 
QITLAAVGDILQRAVFIjRRQLKMRRADV^ 

SEQ ID 7163 

TO©3GAAAGATTGC(XXAACTTGGAAAGT^ 
CGGTATTTTGGlTrATTTAC 

SEQ ID 7164 

l/XIAATWKVPSPQPSAAQC&ERI^ 

SEQ ID 7165 

CAGCTTCGCCCCTGCTTCAATAGACCG^ 

CCGTCCGCCGAACAK&CaA^ 
CGCGGATTTTGCGTTCGATMTGTCC^ 
TTTGOTTGMGCTGTTGCAAAT^^ 
TCGCXJGGTGTCCAAACCGATGTCO^TC^ 
TA(PTGAGGCAGCCGTGTTTCGGCGTAT(XIAA^ 
GCGCAGOTTTCGGGCTGTCXJCACG^ 
AT 
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SEQ ID 7166 

QLRPCPHRPSRiU 
FAIiKLLQIGNHRRRTDFHQRl 



SEQ ID 7167 

TTGCCGCAACTCGGAGAA' 
GTCffTATGAAAMTAAAAAACAGOTGCACAAGCTGftT 

SEQ ID 7168 



AMCCGCCATOECGMGCGCG^ 

CfflraCTCCCATCKMCTCCGTC 

CGCWCAARSCCATCAAAAflTTCCCCC^WTCCAAO^^ 

AQGCRGAfiCTGGCAAMOXXAAAWS3ASGG03CGCAfiCT^^ 

CATOXTCAAGACOOTeGC^^ 

GMCTGCttEAACAMTCTACCGCGCCT^ 

AACTBCTCGGCTTTAAAMTHCGCCGAATTGTC^ 

AffiCCTCa^CTttAAGCCTTaK 

ATWTGAATTOaACAAAACtmmCC^^ 



CG«aCGCTGCAACT«rc^^ 

TGGCACAMTGTCCGCCCAOMGAAACCGGCGAGCXOTXXXG 

CCTCTTCGATATGATGATTTACAGTGAAAGCGACGAATGCCGTC^^ 

GCCMCAGCTTCGGCCACATCTTCGCCGGCGGCTATTCCGC^ 

OACAG«^AACGOTCre<3(^GAMT^^ 

CTICGACAMGCGGCT 

S !^HAraMI»^^ 
RFKTIKNSPEFATIiSPAQKTKIiDHDI^ 

ELRBOIYEAVVTRASELSHDGKPDOTANiraTLEHAIiKTAKIMFK^^ 

BflOOTPVGKVIAGliFAQIlQUiY^ 

!^TQVDEIiCTSGINGVEWDA^ 



SxMauCICBGlffl^^ 

GWXWATGAAAAATAAAAAACAGCCTGCACMGCKGTATOAGTTTG^ 
mAGTTIGCATAAACCTrAACCCCAACAGCChCCCCG^ 

S ^KKPLPGPPPEGEERIAADFAVAGOJ^J3If 

CaGAKXXXXXmTTCGTACAKXjm^ 
ATATTCAGTAAAGOCATAATOTTCTCTTTCOT^ 

S ^WFQPGWPli^^ 
PDAARFVHRFKHIGSQI^LVraCT^ 

SEQ ID 7176 

^VSVWMDSSFPRSRESBPVGTBmiKRFLEI 

^ATCmAAWreaGMCATTTGTTATCACACGGC 
GTCGTATTATAACCCTGCTCTATATTCAG 

SEQ ID 7178 

lfl^TmT^PRGCRRF7QPl^ARQ^QIRGTVBQ8RIITLLIIQ 

S !^cggcS^ 

ACACTTGGCCCGATGCCGTCCCGATACGCC^^ 

OKCATCGAAAGCMGCGGCGGTTTTTGGTTGCCAGCCXX^ 

aBGTOXSGCGG^ 

eKGGOXXyroGTCGACATCM 

KCOTGCGCCGCGACAWATT^^ 

TaXK«W^TCCCCGCCATTC^^ 



WO 02/079243 



-538- 



PCT/EB02/02069 



CK^TCATCGTCCGCTOGCCTCCC^ 
T 

SEQ ID 7180 

LRPVRABNLPAKNQKRFFATKOT 

AVAMVWATKEIiIlCLSGSIIJlSAT^ 

CAPYI PMQRYLENVQAEKLFITPAARHVTRVPYDDKAyRIIW 

SEQ ID 71 81 

gtg<x:aaaaamcottttg^ 

tcgotatmtacacmttatatgc^ 

cgaagmtacxsgcgccgacagcatccaggtgctcgmgg^^ 

GTATTGGACAACGCCATGjACGAAGCACTCG^ 
ACCCGAAAGAAGGGCXJTTCCGCCGCCGAAGTO 
CMC(XXjCTGTCCGACT(XX3TAA(£ 

GGCAaaccGra^^ 

TGACCXaCGAGOGC(aCGGCAAGCAW 

CGAGAAAGACGGCATGAGCtJKXIAATGCGCAA 

CGCCAAGTGATGACGCGCACCATCAACAGCTACATO 

AACTGCCCGACCCCAAATTCTCATCCCAAACCAAAG^ 

CGAAGCCAAMTOTCACXra^ 

CPCGCOGACTGCCMGAAAAAGACCCTG^ 

MGGTAAAATTTTGAACGTCGAAAAAGCACGTTTTC 

ACTACGCTACCAC(XX2ATCATCATCATGA(X6ATGCC6^^ 

TACATCGOXaG(XX^^ 

GCACGCGTCGCCCATTGATTTGACG 

CGGTTCATCAAMTCACGCGCAAGCT^ 

TGGTCGGCGAGGGCGCCMGCTCTACAAAGGCGAGMCGAG 

A6GTTTCGGCGAGA3«AA^ 

ATGGGCGACGAGGTCGAACCXXXSCCGCGCX^TO 

SEQ ID 7182 

VAXKRPCHFAGKFRARTGRKCM^ 

VLDNAIDEALACTCDKIT^ 

GTTVRFLAGTETPGHIEYSFDIIiAKRIRELSFLNNGVDIE^ 

RQVMITlTIKSnEANWAKKAKVETAGDDMREGLTCVLSVKL 

LADCQEKDPALSELYLVEGDSAGGSAMQGRDR^ 

mQPPLYKAKY(aCQERYLKDELHKDQWLLGU 

RFIKITRKI^GNVMVSyiEPKFMSKAYQTLTQTAAAIiKGLVGBGA 

MSDEVEPRBAFIENHAIiIAQNIDA 

SEQ ID 7183 

AAATGCGGAACATTT(^TCACAC(m 

SEQ ID 71 84 

KCGTFVITRHIPRGCRRFFQPNQARQTEIYQIRGTVMQRRIITI^ 
SEQ ID 7185 
TOCITOX5ATAAAT(mXAATCTCAAAT^^ 

SEQ ID 7186 

LLPINAAISNPVIPAQAGIRTFNAKAIYRE 

SEQ ID 7187 

TTGAAGGT(X3GGATTCCCGCCTTC6CGGGM 

SEQ ID 7188 

MTGFEIAAFIGSNRSRSAVIPAKVGIRPFEF^ 

SEQ ID 7189 

TTGGGAAACCACGATGGATCCCGCCGTGCGCCGC^^ 
TTTATCGAAAACAATGCGCTGATTGCG(^AAATATCG^ 
AACCGCCTTTCG(&GCAAAACCATGATCAGCA^ 
GACGCGC^AACC(mXX^TCGCCATCGA^ 

(XECAAAttOTCMCGtmTC^ 

CAGTCGGAAACGGAAAAAGCCGTTCOTATCX^ 

(XX^TGC(XXX^G(XX3AAATC0<^^ 

GTCGmCCGCCMJ^^ 

(^TC«K^TCCOTACAAAACCAAC^ 

GOCCAAACAAGTGGAAAACGCXX3CCGAAAT6^^ 

AAAGCCCTCAAACTCGGCACGCTCGCGCTO^ 

AAACrKECnOT^^ 
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DAQTARIAIDIYGSIA^^ 
QSmKAVmYNSCAGLLAROlLNRI^ 
mAmGAAGIIPAVLHYFRK^^ 
KALKLGTLALIiEI^^ 

GTCCTATTATMCCCTC£OT 

GMGGOTMTTCG^ 

TACGTTG«X3GC^^ 

GCGTCTTTATGCAGCAIXCGCAAATCGTT^^ 

CCfGAAAGAACCGGGCCOTTATOTAOT 

TGGTCGAAACCGCCATTTCCGAAATACAGCAGGGCX^ 

CAGGTC^CCTCTCCAACAATTma 

CCAA 

SEQ ID 7192 

• ^cGTmTRMPRGCRRF^^ 
YVGGEPRLGLEQTDGIPWHS 

QVNLSMIiTEEPKSRDTVEiaSTPAEETC 

SEQ ID 7193 

TOGCCTCCATGTATTCAATCGT^^ 
AATGCGG6CA!OTATATATA^ 
SEQ ID 7194 

LASMYSIVEIYIFVLAEIVYARYTIIiI^ , 

SEQ ID 7195 I 

ATGTCCGAAAAMTAttTCC^^ 

ca ■ i - ■ 

SEQ ID 7196 3 

MPEKIHPEDIiNUlFGKFSESKATADKn 

SEQ ID 7197 

TOIAGACGCGCTOT^ 

SEQ ID 7198 

HSMPEHPm<H3IX$IVSCT^^ 

SEQ ID 7199 

TTGATGCAAAACCCTGATTTAGTCGGACAAMTGCC^^ 

CCOTACGGCOT^^ 

SEQ ID 7200 

U^NPDLVGQl^PMESV^ 

SEQ ID 7201 

ATGCAAATTGAC1TFTGCGGG0GGTTTTAOT 

CGAAATTT 

SEQ ID 7202 

HQIDPC^RFYRPYIYKSQIFKHISXIYTRRKHIJOTSIIJFPKP 

SEQ ID 7203 

TOX^TTTACCi^PTt^^ 
GGCGTT6TTTGCAGCATCXXj(X3AAGAAOT 

SEQ ID 7204 

LRVYRFSASVI^PAWSPKPIHSC^ 

SEQ ID 7205 

ATGAGCAACTGGAAAC^ 

TAAAGCAGGCGOCAGftATTSAEMXJQAATC^^ 

ATCOTGCAAATGGATACGGAACGGCAAGACC^^ 

ATCATGGTCT6CAACG(KIATCAAG(^^ 

CCATACGCGGCMGCTGGCAMTTGGGAGCGGTTTATTCACGAAA^ 

CAACGGGCGGACGGGACGaTAmAACAGCCT^ 

CTTOAGGOTAACCGAACGGCAGGA!^^ 

AGACACGTCAACACATACGGACACACGCACAAGGAATCT 

GQGACJGGCCTCTAAGTACCTGAAAGAGCTO^^ 

AACAGCTTTACTTCATPC 

SEQ ID 7206 

MSN^DIPYNDLPPLPPKQDIESKTILKRCIAARASIARLKQAAELIPN^ 

IMVC!NAnHPYETAIRKTGGTALKG(M 
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IiGVTHWESWimiiDGV^ 



SEQID7207 

ATGGCTAAAAIAAAATCnTO 

SEQ1D7208 

MAKnCIiINKMLKIiNP0RASAYRIEAlJffiFBK&^ 

SEQID7209 

GGCTTTOCGGTGTITrCGAGATTTC^^ 
ATCCCTTTGMCXKXjCGCCCTTO 

GTGTFCCAAGTOTFGCAAAG&AGCG^^ 

AGCCACACCAGCGCGACGGCGGAmTGCGGAa 

CTATC(£TGC£AAACGAOT 

SEQ ID 7210 

GFSGVFEISLNA*LSAPKSNAVSERQASPKMFFIFRRQFQ^ 
VFQVLQRSVQDGTOmYHFFAVKimSL*TH^ 

SEQ ID 7211 

ATGCCGCCTGAMGGCTTCAGAC(KjCATT^^ 

GGTTTCGGTTTTTTTTTGGCTGT^^ 

SEQ ID 7212 

MPPERLQTAPPTQFPPPPIIPKJTVUiK^ 

SEQ ID 7213 

GTGGTTCCGGTATTTGAGGCTTTGATTAGGGAT^ 

MTATMTTTGGTTCAAACGGMTACCGGCATTTTM 

ATTAACAAAAACCCftTACGGCGTTGCC^^ 

SEQ ID 7214 

VVPVFEAHRDADFQHFLSYNNE^^ 

SEQ ID 7215 

GAAAGCGGGAATCTAGAACG j 

SEQ ID 7216 

MPSEAFQAAFAAFGHLKIPSFPRRRESEYIRTETCIPSFPRKRESRT j 

SEQ ID 7217 

TTGATCAAGTCCGCGCTAAAACA^ 
TTAACTTTTGGCGCGGCGGACGGGGCTCGAA^ 

SEQ ID 7218 

.IJKSALKKIVQRKTGWEWQire 

SEQ ID 7219 

GGCTTCCGCCACCGAATCCCGGTTGT^^ 
TT(^ATTCGGGCAGCGGAACACT^ 

SEQ ID 7220 

GFRmPVVGJ'FAWAFQPRLmFLNIKPAFDRIiAVAGAIGIRAAEHFGD * 

SEQ ID 7221 

CACTTCTTCGTTCMCACCA^ 

GACGAAGGCAMTTCGCAGCCCCAGCGTTTCM 



GTTQGACAATCGCGTCGGTGTAGCCGGAMTGATACGGGOT 

CACCGCCGTTTCAMCGACAGCCTC^^ 

AjOTATTCCAACTGTTC^^ 

SEQ ID 7222 

HFFVQHQRQYWmONAVGMiIjBIIGVRRGIDIGVDFVRM 

DEGKFAAPAFQRLfQRMH^ 
HRRmQPRARGRLEKAim}EFAFQQIAAM4RLQ 

SEQ ID 7223 

GTGGCGGATCGGAGGGGCAGGCGGTCGGCAAACCGTGCGTO 

GCGTCTACAATATGCGGATTGTATGTTTAGG 
GCTGAWKMOXOTrc 

TOGGACACGATGTCAAGGOGATGGTTATCGC^^ 
AAGCaXOTmMCGACGO^ 

fpTGGGCTGCAlKCACTTTOOC^^ 
GATTGCTCGATACCGGCMCAACAATCCCGACTACCG^ 

GTOTTGGGT 
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SEQ ID 7224 

VADRRGRRSANRAPGG^ 

LGHIMftMVIACNTIMVA(2UURQK^ 

LGCTHFPIiLKPIiIGREAHN\^VDSAITTA 

SEQ ID 7225 

ATGGTCGATAACCGTGCCTTTTTC^^ 



OTCGAGGTTGAATTOCCT^^ 
GCGGTATCEGCAGTCGGCTTCTTC^ 

OXXX&CGTTGATGAC&X^^ 

OTTTCGCCTTTTTT^ 

GAA(mTGAGTOGCCTOmCA^ 

TGTAAAGCGGATTAGGCATTTTTGC^ 

GCGCGTAOTATCGCACATTTGCCTCCACAAAGOT 

CAGGAAGCCGCCATGTTAATCTGCAACCOT 

TCACCAAAGACMCCGCCTCTCCTA 

SEQ ID 7226 

MVDNRAFFDGFOTEPIiRFHIGSSHFFVQHQRQYRHTR^ 
AVSAVGFFQLFGNWGHREAGRRDEGKFAAPAFQRL^ 

ARIIAHLPPQSFTARILYFSIX3ISKHARISIRYNRARHFFS 

GGTCTTCCKGTTTAAACGCACCGCTGGTCAOT 
GACCTACGCCATCGTCGAATCTTATCCQTO 
AACA(XXnTOCCTGATCGMTG^ 
AAC6AGGACGCGAAAGCCTGCCGCTA 

SEQ ID 7228 

LWACPSDPPHKSCKAPMSDFSPVSRFLADEMTLDLGMWS 

NSVCLIEWPQQGGEFTPPADITATLTHGGGGRKCLLTAHTERGRESLPIj j 

SEQ ID 7229 I 

GTGGCGGTCATOTGCGGTGGTGCG(XjTCAAMGGGOT 

CGGCA!TPTTTCTGTATmCCTGT^^ 

CKXATGCTGGGCTACCTGCTOTGGGCOT 

ATOGGHTCGAffHTTCATTGCOCMGOT 

TGCCGTTCAATTGQGCGTTTGOCCTATCGGG^ 

CGTGCTGCTGATGCAGGACATOCCX^ 

ACGCTGGG<£TGCTGTTTTTCGTCGGC^^ 

CTTATCTGACTGAGCTGGAAGGTTTGTCT 

TTTGCTCX3GCTTCTKTI^ 

TTTGCCATTGCATOAAMTGMGCATTCGGTCGG 

GAAGTC^CTCGATCTGC^CAGCATGTTGGTGGAAGC^ 
CCGTATTTCGCGCTCGACTTGGACATCGC^ 
tfSAAAATGGTGGTGGTTACGCTCAACAATAT^ 
MC(rmACCGATATGGGTGC(^GAAGCan^ 

gc^tatccgccacagccgttatgccgot^^ 

catacgccgtcggcaaaacagt(xx3ca(^ 

cggtgacgtgttggtggtcgcaggcaaaaaaaamgaaa™ 

SEQ ID 7230 

VAVIVRWCGSKGRILSELYAAPVS^^ 

I6IiEFSLPKI^AMRRIiVFGICGLQW3VM 

TOLLFFVGSKIMSRWFRMVAKRKS 

FAIAFKMKHSVGDSIiKTiaiYIiAQGGE^ 

PYFALDU>IARVQVARSAGEPVSFGDAKRREVIi^ 

ANIRHSRyAALEGLFVGSDDEAGFGENGETVRHAFPLAAEA^ 

SEQ ID 7231 

ATGTTMTCTGCAACCCCTO 

ACOOTCTCCTATTCGAAATra 

CGCCXiACAACGACCTGCGCGTCATC^^ 

GCCGCOGAGAAAACACAGGMCAGACCGCCTOWGCGCATOT 

ACATCGGCCOrimCCGAACAGATC^C^ 

OTACCAAAATCCACCTTAACGTCCAC^^ 
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SEQ1D7232 

MIJCNPYEWIHOTSSGKIFRPGD 

AAEKTQEQTASGASY^ 

VTRIHLITVHVNHDRTDAHRLyF™ 

SEQ ID 7233 

ATGCCTOTGACCGCCCATACCGMC^ 

CGACAACC£GGGCAGGTTGGTCGTC^ 
AACACGCCGimCC(mX^CTCGTCA^ 
GGATGGATGCCGACGATCCGATGATGGCGCFGCT^ 
CGGGCGCAGACCCGTCATCATGCTCGATCCGGGACAC^^ 

CCGACGCTTTCACaGCCCCTCCGCG^ 
CGGCGTACCGACCAGCGGCMCCGCMCGTCGATACCGOTT^ 
AACCATCTGaCAAAGGCAGG<^ACGAAGC^ 
GCAGCGAATCCTTOGTCGGCAGTGCGCCC^ 

SEQ ID 7234 

MPADRPYRTRTRKPAAMTKI^^ 
NTPTTVRLVIDIiKQPTHAQWALPPVGGFKDRLVVDLYra 
QLfALGYNVFMTRNE0VFIPLGVRV 
NHLHKGRVDEANFAVIitAPOT 

SEQ ID 7235 

ATGACCGCCACC(XXXXXX5AmAGACCGATO 

GCGCCCCGG(XXjGCGC^ACCGTCGTC^ 

O^CCCXXrmTCCGCQC^^ 

GACTGCATCCATATGCACX^OTTGCCGACATO . 
ACGMCGCAAACAttGCEAAG^ 
CGACOTttTCGCCAAAGOCATGGCAAAAGCG^ 
CGCCGCGCCCAACGCGATATGAAATACGG(^GCG^ 
SEQ ID 7236 j 

WffATAADLDRLLPQTQCRECGYEGCLPyARAMliRGE 

mQRIMYGSDSEKAAAIiEyiiKQYKAKQEAAQRTAS *" 1 

SEQ ID 7237 

TTGGTATCTCAATTTATCCATACCTTTCATCAT 
TTCAATAOGGATGTATT^ 
CGACCAGGATAGACGGCATATCGGGTGCGOGCAATAC^^ 
GCTGTCGCGCAGTGTGGCGGTTTG^ 

ATATAAACCCCCCTTCCGCGCG<^^ 

CTGCmCCGCCCG(mGACGGC^ 

TCGGCTTCCGGAGAG(XXjCGCAGGGTT^^ 

CGACGGGCGGAAGC(mA(^CTOTGCGTGG^ 

AACTTTTTGGGAAAGCCCGTGCAATACGGTATT6ATC 

GTCAGGCGGGTGTAGGTGTGCGACGGCCATATCCG^^ 

GGATGATTTGTCTTCGTGTCAGTTTGGT^ 

GCAGGCGGGGTAAATTCGCCTCCCTGTTGCG^ 

SEQ ID 7238 

LVSQFIHTFHHKKAVVK&SRFMPAFIRQi^ 
AVAQCGGIJfflVl^im^ 
LQTARAIX^GVFTAVSRIEHDDGSAPVFAPPGSGRGVLGGW 
NFLGKPVQYGIDVCVLYIDDQPARVVEREVLVFERGGAF^ 



SEQ ID 7239 

ATGAmTGGGTGTATTAATTTTTTTTCTAA^^ 
CAI^TTATCTATTTTA!ITCATATO 
SEQ ID 7240 

SEQ ID 7241 

ATGCXATCXGTAAGGCGATTACCGACATO 

TTCGGACGGCATTTTCGCGTT(X3^ 

CCTGGAAATACGCTTCAAACGGCAAACCCX^ 

GTCAI^TAAGT(^TTTCGCCGCT 

Om^GTITGCCGAAATCGGCGAT^^ 

CCTCAAGGCTGGCTTTTGCCATACCCAT^ 

TGCCAATGCGtmOK^ 

AMCXGTCCAAGCCATCCCAATGTTTGCC 
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SEQ ID 7242 

MPSVRRLPTSSTVWRKTOT^ 

VNISDFAGNAGRQVGQQEGGSIPDFFDGNVAAQ 

CQCGQAVCGNFVCGVEIffAADAVEEVAA^ 

SEQ ID 7243 

GTGCGCGGOTCGGCTTTTCAGACGGttTAOT 

SEQ ID 7244 
VRGPGFSIXjIFDVMIKQI/roiTTPSKDRHTT 

SEQ ID 7245 

TTGAATGTGTGCCAAG^ 
ACCCGACCGATTCAAATTTTGCAAAAAA 
GTTGAAACCCTGCCCTTTAGGGCG6 

SEQ ID 7246 

LNVCQVYKGEVEEKNQAASSNST^^ 

SEQ ID 7247 

ATGTTTGTTGACCCCCGCTTCTT 
TTACTGAA 

SEQ ID 7248 

MFVDPRFLPKHPCTELESTARFRHSKTRFOTLRRGiTXHVra 

SEQ ID 7249 ' 

TTGTCGGATGAAGGCAGGCATAAAOTACTTCCGC^^ 
GCGAACAAGAATAOTGACCGAATTGTCCGATCGGCT^^ 
AAACACTTATAATC^TACGATACGCAGGCATTGCTTAM 
TCCGOTTTGGACAGCC^^ 
CTATMGCGGGACAGACTACArcCC^ 
TAATAATTTGGCCTATCK^ 
OCCGCCTAIKMXXjAGGCTGTCGTAG 
ATATTTTTCGAAAmCTCCATT 
SEQ ID 7250 

LSDEGWKPTSAPNRVF!MKGMDK^ 
S6LDSPKPFDWLDFLKTTHSLISENLLEKGSG6VVRRDSW 

PAYAEAVVAYYETOTGLFKKYFISAYHfWNKYGP^ j 
SEQ DD 7251 

■ ATGCCGTCTGAAGamTCAGACGGCATTO 

SEQ ID 7252 
MPSmSDGILSSSPSAVPVPSVSYRPPFRPCRFQ 

SEQ ID 7253 

TTGCTTGGCCTATGTTTGTTGAOT 

CGATTGCCACGTTACTGMTAATTTTTATGGCAGAttra 

ACTGGGTTTTGMGCACAAGGTTATGCT^ 

AACGGAGAAACTGTTTTGCTGGCTGATCC^^ ' 
TGCmAAAATACTGftTTTTGm 

SEQ ID 7254 

LLGLCLLTPASYQOTHVRSWKARITOSG 
NGETVHADPSLGHV^^ 
SEQ ID 7255 

ATGGGCATCAAAGTCGCCATCAACCXX^ 

TTCAAAOCAACX&CCATCTGACCAAft 

CAACOTGCCGMCTGCCGTGGAAAGAACTC^ 

ATTTCCGCACCGGGCGGCGACGATGTCGATGCAAOT 

CGGTTGCCAMGTCTTGAGCGAAAGCGTCGGCATCG^ 

CAGCGGCGTGGAAAACAOEATTC^ 

TCCTTGGTAGATTTGAGCTTCC^^ 

CATTGGTTTCCATGGAOTCAACCA 

CAGCTXKXIAMTGCTtSAACACCGC^ 

SEQ ID 7256 

MGIKVAINGYGRIOTQVLRAIYDYQIQDQI^ 

ISAPGGDDVmTmGVNDSVLTM^^ 

SLVDLSFQAARDTTVEEINALMH^ 

SEQ ID 7257 

ATGAATACmATATACACTTTTOGCCA^ 

GG(XK:(^TT(K3AACAGGCCTACGCCGa^CC^ 

CATCAACTG6TTCAACTGGOTACCTO 

GCTTTCCGCGACATTTACGACGCACGTCCGAGC^^ 

TGTOAAACGCGGCTACCXXXITGGATACCGGC^ 

CAGCGGCACGATTGCTATCGMGCCGCTTGG^ 

cGcaxxmsAAGcxx^cccaxx^ 

OTTCAGOGTTGCCG(X}GC^ 
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GTTGGGTACGKMTTGAAAAAACA^ 
AACATCGACTGCOOTOT 

SEQ ID 7258 

MmYTLFATCPRCUiETVL^^ 
AFKDIYDARPSVDKAAPDVRIHAFLDERNVEI^ 
RRAEAQTRFVRAPBGSBN^^ 
NIDCRIiFLIEMVQGSNR 

SEQ ID 7259 

TTGACGtOTTCTTGOTATTT^ 

TTGCCGTCGAO&CT^ 

OCGAGAOGGCTTTC&TCAGTGC^ 

GCCTTimTTCGG<mGACCAMa 

TGGTCGTTGGTCAGTGCGTGGATGGTGCSTC^^ 

TCATGTCGGCGGTCAGGACGCTGTCGTTCACGCOT 

TTCTTTGCIK^1^C(XX^ 

TTGCCGCOSTCGTGGGATACGTCGGCT^^ 

^OTIXXIDIWMCQBDPmAPBWOT 



^ _____ _ _^ _ AVDGDFDAHGIiPIiC 

CTATTOTACTGTCT^ 

SEQ ID 7262 

MFSSFRKHFSFLTAILPGCRHFQ^ 

SEQ ID 7253 

TCCGCCCaXTTGACCGCACCCCCTa 
AGCA(^TMCGCGCATTCCCCGTATTGACCACG^ 

GTTGAACQCAATATCXXSGCGTAACCGCCAAC^^ 
CCTATCGGCAAAf^CTCAAAACCCGCCACA 

SEQ ID 7264 

SARU)RTPCLCHPARCROT^ 

VERNIGI^QHAFRIRHGTLQIQTLQHIQHOTDQIKKHHA * PTVQNLH 

SEQ ID 7265 

ATGCGTATT(£OTN3CT^ 

CGTATCGGAACGGCGTGTTTCCTTGGWTr^ 

GCTGCOTAACGGCAOTATCG<^TTGCG(^ 

GCATATTTGAAGCTGCACGAAATGGGAC^CGCGOT 

ATGGCGAATC6ATGTTTQQ3TTGCAAOT 

(X^GCCGTTTCGGTTCGGAGCTGCTGCCGTTT^ 

SEQ ID 7266 

MRIPLLAPDNYAFPDPAYAIAR^ 
AYIiKLHEMGHAHSFECHYPDENGKTRU 

S ATAT(^im^ 
TAAACAATACTGTCAGGTTTCACTTGTT^ 
CTWAC03GCAGCACCGCGA(XGCGT^^ 

SEQ ID 7268 

IWFOlSAKCVL*LnaCRSFSEAKMAKFAHI*niATi™ 

SEQ ID 7269 

ATGMTAAAAACCGCAAATTACTGCTTGCCGCACTO 

AWGACCGAG^m^CGCTGC^ 
CAGOTCAGTATGJ^ 

TTTACGGCGGACATTACiArcGGCAGCCGQACATT^^ 
SEQ ID 7270 

FTADITIGSRTFQTAFTAE 

SEQ ID 7271 

GT6ATAGl!GQGCAAGCAjSGCTGAAW 

CXX^CCQCGTTTGAAGATTTTGGAffl 

CCGCG'IOT^C(XAGTraA(»GG(^ 

GGCGAGGl^(^TOXmATCCCGA^ 

AGGCCAAGGGCAAACGT 

SEQ ID 7272 

vrvGKQAmYYSvYYFrcnra 

GEVTBFHNPEIEAI^DKIAEENGYRIVDHALYMYGW 
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SEQID7273 

TTGCAGGTAAAATATCAAAC(^TTAATTTMATCA 

SEQ ID 7274 

LQVKYQTINI^FTEYSAAFTKOTSN^ 

SEQ ID 7275 QQsmxm ^^ 
CCCTKEXX^ACTCATAAACC^^ 

CGGACGCBCAGTCGtm^ 
SEQ ID 7276 

VIDnAVAVFIXSDFVI^FDFGIVEFRY^ 

VAEFPHNIQYPCKIIDMIROT 
SEQ ID 7277 

CCGGCGtmGGTOAGGO^GCG^CC^ 
ACACOrCGOTC^ 

ATGCTGCCTGCCAACGGTGCGCTTTTGC^^ 
GTTCGGCAGGGGGMGCGGTGCGAACGTTTTGATGGAGG 
. GGAAMGGGCGCG^TCrTCCTC^ 

GTTGCACGTTATGGAAAACTATGGCTCGGATTGGG 
SEQ ID 7278 

MIQDTRPLIRVVAGILLDSIXjIIYLLGSRPK 

MLPMGALLRSLSVPmYGSLKTGLYGENSmYRVLPLGSA(^ 

VARYGKLWLGLGAHAWRDETIGKHHE 

SEQ BD 7279 

ATGMCCAGACMGCCGCGATCTGACCCGCATCAGCC^ 

ACGTCCmTGCTimCTTCAGCCAC 

CCCGAAAATCCGCGTGGGCAAAATCGCCGG^ 

CGCGACCTGCCCGACGACGTTGCCGCAGGCGATGTCT^ 

TCCTGAAAAGCMCAMGGCATCAACAMCGCCK^ 

CAGCTTTGTGAMTCaJCCGMGACCTGCACATO 

GAAAACCTTGATGAMTCATCCTTGCCGGCGACGGCATT 

GCGAGTTGCGCCGCTTC^^ 

ACGGAGTATCCCGAAGCXXsTCAGCACCAACTTAGCGAT^ 
AAATCAGCCGCaCMTATTACCCTGCCGA^^ 
CGACACGGCGTTGAACGAAGTCGAAACGATGCTGGTG^^ 
GAAGTGCTGCGCGTCAAA 

SEQ ID 7280 

MNQTSWMRISHNTKIVAT^ 
H)LPBDVMGIMjVnjIH5LLTLTVESVK 
ENLDBIIMGDGIWARGDMVWGHAAVPAI^KRKIRRA^ 
TOTEAVSTNLAIAGGAVSVARAVHAKMVALTK^ 
EVLKVK 
SEQ DD 7281 

GTGAACAAAACCCTCATCCTCGCCCTTT^ 

OXIGCGCaTTGCCGCCCTGCGCCCC^ 

CCGCAACGGCATCATCAAAGAACGCAGCAACCTGACCGra 

GCAGACAAACAA 
SEQ ID 7282 

VNKTLIIALSAfcF^ACS^ 

ADKQ 
SEQ ID 7283 

ATGATTTACGGCATCGGTACA(^CATCGTTT(X^ 

AGGCGGGCAAACCCGTCMCTACCTTGCCAAAC(K^ 

GTTGCCGAAAAA 
SEQ ID 7284 

MIYGI GTDI VSLKRI IRLJKKFGQAFAGRI LTPEELLEF PQAGRPVNYLAKRFAAKEAFAKAVGTGIRGAVSFCNIG IGHDAUJKPEFFTGPALSM^EQGISRVSLSMADKaJTVLAFA 
VAEK 

SEQ ID 7285 mmmn( ^ 

CGTTCTTTGATGA7GCCGTTG(^ 

GGCGCAGGGCGGCAACGGCGCGCGGTTCGAGTTO 

GGCGAGGA!FGAGGG1TrTGTTCA0GGGAG 



WO 02/079243 



-546- 



PCT7IB02/02069 



GEDBGFVH5RTPLCKSGCLV 

GGCAA 

SEQ ID 7288 

EFFOAENPFCn<QIVFGSF*T^ 

^P-Q ID 7290 

TGAAAACGTOGGCGCMTCGTTATTGTCGATto 



TTCAAAAl\*n i iaiwu iuuvvu * w . , m ATGTTTGTTCCGA(^TTGCCC(^ 



M^M^ESVKNGNLTAI^^ 

GQKPQQWJKDNWEERGKPPEGIGGGWGSAH > 

GTTAACCGCTTCTTCGAAGTCGTAGAMTMTCGGCATTGGTAAT 
VNRFFEWEIIGIGH 

S !^Xc<^mAG^ 



TJAATATGCCCBAGTTTTTCACTGATAGCGAATCQ^ 
SEQ ID 7296 



ISSKFBO^CTSGMQGitPITTERHiKMNMKGITIiFA^ 



(XOT^CCATCTICrACMTCmKCa3CC 
SEQ ID 7298 



GATCTG<?rCGATCTTCACACCCA!3^^ 
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SEQ ID 7300 

MPSETVCLPliRQQrcPEHM^ 

DKQGHAmV($DFGEF©FF^ 

DVVDVDTQKHNVSPSLRFPV 

SEQ BD 7301 

ATGATACCGCTCJa^^ 

<aCCCTCAAGCAC(^^ 
TTCGCCGOCAACTraGaTCGGCGTCAA^ 

atccccccagcggcac^ 

SEQ ?D 7302 

MIPLKIAIAGANGRMGRVLVEAVNNHPDTVLSGAIiE^ 

FAWSVGVm^THIUJTVMVLHEGYDIEIlEGffi 

GAVEMVWVNGKTGLYI^DVLGLHNR 

SEQ BD 7303 

(JKOTGCCCAGCATTCATOT 
ACAAAGTCAACGACAATCI^ 
CGCCGTCGAACCTGTCGCCGCGT^^ 

SEQ ID 7304 

VI»PSIHYEM)SATDFTGIiPVQGSKNGKITKTTVAP 

SEQ BD 7305 

ATGCTTCTGGGTGTCMC^TCGACCACATCGC^ 

TGCACCTGCGCGAAG^^ 

TAAAGTGATGCCGGAAGACGTGT^ 

GACGCGGGCATACGCGTGTCCTTGTTTATC^ 

ACGCCGMCAAATCAGGCAGTTCGAGCGCATCCAAM^ 

MTCCTC(mTCCGCGAACTG^ 

SEQ ID 7306 i 

MIJ/3VNIIfflATVRm<n?TYPSPVEW^ 
DAGIRVSIJIDADDRQIQAARDVGAP 
SEQ ID 7307 

GCACACAGTCGGTCAACGCGCMAGCACGGC^ 

GCTG(XX^6CATTCATTAT6AAGCAGATTCCGCXACC(^ 

AAAGTCAACGACMTCTGACCGTGGGCTTO 

CCGTCGAACCTGTCGCCGCGTGGAAACT(^^ 

(X^^TGCTGCMGCMGVCCTTCTMTCCT 

SEQ ID 7308 _ 

FVHRPPFYGNR2®PFHTEKmLOT 
KVNDNLTVGLGVYVPFGSATEYETO 

SEQ ID 7309 

TTGTCATGCTGCACAAGCAACTTTATT^ 

SEQ ID 7310 

LSCCTSNFII^IRSLSAKCRUINIFSDGILyGG 

SEQ ID 7311 

ATGCTGCAAGCAACACOTCTMTCCTACTGCCG^ 
AC^CCGCGOGCG0GTGGGCGTGAACTAC(^TTC(^ 
ACCGCTCGGWACACGGCGAATGAAAAAO 
ACGCCmCAGCCGOTCAATM^ 

CTGGTTCTCCGCCGC&ATGAAATA^ 
6ACA6CAAAGGTGCX5TCTTGC6CAC^^ 

SEQ ID 7312 

MLQATPSNPTAAAQIKAK3HADVKGSDWGVGYQIA 

TRHSRFNKAI&FFEKEKNIANGK^ 
DSKGASCARFKNHADIIGLQYTYKFK 

SEQ ID 7313 

TTGTnCATTFGTTCTCGTAA^ 

CTACGGAGAACCAAGATAMGATATC(^TMTACCTTGGAGCTO 
GACC^GTTTTATCATAGAT^AGT^^ 

SEQ ID 7314 

imPFVmFRQIOTPNVTDWVK^ 

SEQ ID 7315 

5^3CGCACGTITCAAAAACCACGOC^ 

SEQ ID 7316 

Ii^KPRRHHRPAIHLQlQINVTPFEYKNAV 
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SEQ ID 7317 

ATGGATTATTTTCCCGAAGCAGCACGTA' 
GACTAAGGTTAAAATATAATGGTTCGTTAACTGAT 

SEQ ID 7318 

MDYFPEftMOTKK&inVIlRTSiraKnfi^ 

GCAGC<KSCGCAAACGGCAGAGCGAGCTCCGCGGCGTATTGGTTCC^^ 

AroGCGaaGCWxawaaGG^^ 

O&ACAMACCTACCC&CTGAW^^ 



GACCGCAATJVCCGCG 
SEQ ID 7320 

^E^VWHEPVFHIASSRSRI^IiRVM^ * 

ram 

S otcotatoLagaatatatacagaao^ 

GAACTOTCGAGGT^^^ 

SEQ ID 7322 

SRYSGmQNLLHIKMVGTI^ 
ELFEVFFQOTQVGQM(HIAGKFFPRLSLPN6Q 

S ^mtSatat^^ 

GAAAGCGTTGCCK^GCAGGCACGGTTAACCGGCATCGGCGGG 

GCGGCTGCCGGAAATTTTAC3GGATACGCA6GGAGCTGOT 

AAACGGGCGGGCATTCCGACCCTGCATTATGTC^ 

(XX&GCTTTATCTCGATGCGGGCGGACff 

CCCCCTATTCGCC^^ 

GCC(mC(K^G<mCGAAGCGGCGTT^ 

TCCTGGTAAC6AG(X^ACGGC2GACTTTG^ 

TOTCGGOTGCCGAATATCCTGTTGG 

GCC(XK3CW3a^6(^TTTCGG« 

epQ in 7324 

KRAGIPTLHYVSPSVWAWRRERVGRIVHQVNRVIiCLFPMEPQ 

AAmTKRRIAEVLQRPEFAGL^^ 

AALQQDFGAIiHiUjKKiyiADLAARAVIiEEAK 

epQ in 7325 

ATGGTAGGGACGCTCGGGCMCAGGACGGCGGCAGGCAGCT^ 

ATGGAAACTACTTCGCGATGGCGGTAAAGCAGCGTGTl^^ 

TCATCAGCCTGTTGMGMCGACGGTTGCTC^^ 

MGHLAGKFFPRLSLPNGQYSGLTRIRTRFRSRRQYR 

ACOaC^CAGGGTTTCaGACGG^ 

SEQ ID 7328 

GJTCIGRRFMCRPVC^SIGTOKAGCFAPAYM^ 

S ^^A^ 

TCAACGAACGCGCGCAACATGCC<^CGCCATCG^ 

GCCCGACGAATCGGTAGCAOTOTtSTTTCGGGAAGTGA 

AAGCATTTCGGACACGCCG(X^TACMTGGCGTGT^^ 

CGGTCGGGCGCACATTGGATTTGOTGCCOT 

AGTATTTTCGCACGCGCAGGCGTTGGCGCAGT^^ 

ATCACGAAACCGGTGCAAGCGGCAGCGACAAAACCK^^ 
GTTTGAGAGCCGTCCGAGCAAATCCGTTT^^ 
'JTTGTCAAAGCX^TCGGTTCGTATC^^ 
SEQ ID 7330 

KHFGHAAHTHKPTIDIX:FKQVmQADYLVAPVENST^ 
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DDGTVAAIAGRTAAEIYGLIMVABCOT 
FVKAIGSYPTAVL 

SEQ ID 7331 

TTGTTTATCOTCGGCACGGCGTTCGGGC^ 

TMTTGCTTCX^GTACCGAACAGCTCOT 

GCAGATGTTTGCCCTTATCXaGCATCATTCTG^ 

TCGCCGGTCCR3CCCG(m^ 

(XOTjATGGGTTATCTGTTTTTTCAGGC^ 

GTTTGCACTC^CATCATCACGATGA^ 

COTraAACTCGCCGCOyi^^ 

TGTCTTATTTCAAAGMGAGGGCGGCAGCGCGAACGCCGT^ 

CTCTTGCOK^TTGCa^OT 

ATGCACGGCAAATTACCGTTCAG^^ 

(XXmCATGCCCACGCCATCGG^ 

CGGTAGCACGCCTGTTTCGGGAAGTGATGAGCGAGTGC^^ 

SEQ ID 7332 

LFMFCTATCQVAGGAVSDIKGRKPVALTGLI^ 
SPVLPGLVQYFIiPI^AVGGKIGRDVFGLVAGRFKRV^^ 
PSQLAAVIiFPGIiPPFWbPVWWSVGTQGL 
MHGKLPFRHNYVPNYERTPHPAPQRHRHHIOIBAPAQRTRATC 
TRRAYNGVSDHRRLLQTG 
SEQID7333 

ATGAAMOGCACGAAATAAGCAAAGGTTCGGTCAGCCT^ 

TCCACCGCATTATOGCGCCGGCGAGGTGCGGTTGMC^ 

GCCGTCTGAAAGGCGTGCOX^ 

TTCGGCOTATCGMCAGTTGCGCC^^ 

TCAAACTGCACGAAGCTATCCXn'AACGACCACCCC^^ 

ACMGGCGAAAAGATGGTACGCGTCAGCGAGGACGGGCAGTCGGCG^ 

GTGCGGGCAACXXITGAAAACGGGCCGTACGCACCAAAl^^ 

MTTGGGTTTGMGCGCATGTTTTTG^ 

GTTGGAAAACGGGACGAAAATG I 

SEQ ID 7334 

HKTHEISRGSVSLIGVABffiAGQRLDNYL 

fgvieqlrrarpeakylelvhrldk^ 
vi&tlktgrthqirvhlqsqot 

SEQ ID 7335 

TOGCACACATTCTATT^ 
ATCCAAAACCGGATTTTCTT 

SEQ ID 7336 

LAHILUIVCIFyiiKQQASVRDVVMFHW 

SEQ ID 7337 ^^^^ 

TTTTGCAGGCGGGTTt&GTCAAAAC^^ 
TGCXGACGATTGGTTTTCGAM 

SEQ ID 7338 

MNIIRALLIIIiGCIiAAGmVFIiAGIKL^ 

SEQ ID 7339 

TCCCTCGGTGCTCMGG(^TTMTGCTGTAACCG^ 
Aj^TTGCGGCGCAGCGGGTTGTGGGCGGCGACG^ 
CTTTACCCAGACAGGCGGCGGTAMGCGGATGG^ 
OSAGTTCCTGCCGCGCAra 

CGAGTTCGGMTCTMTTCX^GCX^GCCATTTTG^ 
GGAACGCTOKaAAATCATC(XX36TAAT^ 

SEQ ID 7340 

SI/3AQGIKAWAVNISDPAGNAGRQVGQQEGGSIPDFP!X3NVAAQH^ 
GVPAAHHRTGGFCQCGQAVCGNFVC^^ 
GTLGNHAGNQNFPALQKTH 
SEQ ID 7341 

SEQ ID 7342 

SAEQVGDIFHLFFSKTVPAQSGGNCP^^ 

SEQ ID 7343 

TTGGGGCTCCGCTGCCGTCTTACTTCCG^ 
TAASPA 
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SEQ ID 7344 

LGLRCMiTSftPKRQ^ 

SEQ ID 7345 

ATCGATGGAATAG(XGCGAAfiATACTGTATOT 

OTCCACTTCCXXXX^ 

ATTGTGTTCCAAAATGTCTTC^ 

TTCACG(OTAAGGCAGCAAC^^ 

CglCTTTAAACTCGACAre 

GTTCAMTCGGCGATACAGAA(X3 

TCTTCGCGOtXXSAAAGA^ 

CTGCAAQCAGCAGGGCMACGGCAGCGCAAT^ 

SEQ ID 7346 

IMIAAKILYPFAKHPRVALELEVFGVFLPVCI^ 
IVPQNVFraSFGRFFmFAAQNPDK^ 



SEQ ID 7347 

ATGCC(m?imGCGGTTTa3G<^ 

AA&OCMGTCICCGCXXXrP^^ 

(H^GCCGCTGAACGCt^TA 

SEQ ID 7348 

MPSEAVStXSISFGFAPPSDMUILRPCRK^ 

SEQ ID 7349 

TTGAAAAICCGCAGK&TGCACGTT^^ 
GGGCGCCGGCTTCTTCTIOTATC^ 
TACGTTCTGCGCCTGCCGGCTGG(yyO 

SEQ ID 7350 

UCIRRDARSGKPCABGSDKTCYKDRCGGIjPAAFRTIiRSEKH 

SEQ ID 7351 

TTGAATAACATOTTTTCAT^^ 
TGAGT 

SEQ ID 7352 ! 

LNNILFMITSWTGGIRARRSGSAFR^ 

SEQ ID 7353 

(HGMCATGAftAftGAMGra 
GCAATATCTACAAAGGCATCATTACCCGCATO^^ 
CffiCTACGAAGGCGG(XXXK%GCG^^ 
CTCGCGffiACGCTATCTO^^ 

ACGGCATGAGCATCATCGCGCGTACCGCCGGCATCGGCCGC^^ 

(^CCCCTACCTGCTCm 

'PTCATGAGCTACG'KIAT^^ 

(XX^GCCCTCCGGCG«XmTC^ 

GGAAGCTQCOGAAQftASTCGCCCGACaA^^ 

GACGCACTCAAAAAAGAO^GCCC(m , ACAGATG^ 

CGCOT(XX3CCGGCACCGGCGm 

CGATGTCGCaCOTCCT^ 

CC(^CCCGCCGTCAMGGCGTGCGCCAC^ 

AACCmGCXETACCOXS^^ 

(^GTCCGC(^GCGGC<^^ 

AC6AAGCGGAAATCCAAAGCCGCMCGTACAGCCT6CCG(^ 

CCAAACeGCACOTCAAAC^^ 

CGCGMC<XX&CA(£CGCGATGCCA^ 

TTCACGACA(X)GCC!GAC^AAGTCCGTPC^^ 

-TCTGAMCCGCAACCm^^ 
TCCAAGCCXsGAOGOGGTTTQCGCCGTTO^^ 
SEQ ID 7354 



lAGRYLVLMPNNPRGCEVSR^ 
FMSYVKPGM3RLKLYEIHTPLFSRFQIERQIESAF 
DftLKKDRARVQMGKLSRFGLIjELSRQR^ 
KRIRTDDVEEDGEPSYKRTAEPEBDESAKPFGGEKAKAARPEPAV^^ 



RHUERDAKERRERNNQR^ 

SETATFTAAAEQARETAQTOLVUETDPAALKAWMQPEVQA^^ 
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SEQ1D7355 

TTGTATGACXMCTGACTO 

TCGGCGTGCGCCTGCT6ATTGCCGCGCTC 

GACCCTGCTGCTTCAGTTTGTCGGGTTGAA^ 

CGTGCCTACCACTGGATATGCGGCGCGGCXSGCA 

GCGCGGGCTTTTGTGCCGCTATGCGTCCGAC^^ 

TTTGGCGCAAMjTTATACCGTGGACTGGAGCGT^^ 

GCCAACGCGTCGGGACTGTTGA!PTTCGC^^ 

(XACTTTOKXXSCCGGC^ 

SEQID7356 

LYDHLTLPIiTHFYQILALIIVffiSSFIAAK^^ 
RAYHWICGAAAFAGVALI24AGGAEEGGEVGWFGC^ 
A^IaUSIfFVVGVLIAVL^^ 
SEQ ID 7357 
ATGGTCATACAAAAGTATAQC^ 
AAAOTACATCCCGTCATTCXXmAAAAC^ 

SEQ ID 7358 

MVIQKYSGLTKIRTRRNARTVIPATF^^ 

SEQ ID 7359 

GCCTTTAAAGAMTTCACAAAACCGGTGAGAATCG^ 

AAACOTCACTCCGGTTTG^ 

GAACACATCGATGGaCGGTCGAACATATTGACCTO 

TCGAAAOXnCCATMTC^^ 

TGTCGTCCTGATCCTGTTT^^ 

ACCCAAACCGGTTGCGCCGACCAAGCCGTATO 

AAACCCAMGCCGTAGCCAGTCCGACXXX^CAAAGTTTO 

TCMGCCGATGGACGTGMGAAMTCAGCATAAACGCATTTTGC^ 

CQCCAOX3GTTCX3a^Tffre 

ATATOCTTTGACCTCCTAA | 
SEQ ID 7360 

AFraiHKT60mGINQIDKERADHRHNQiaW 
EHIWjTVEHIDLECGGKDVADNHAAKQT^ 
TQTGCADQAVLRmRSPRT*RAVSARQmTG 
WJGFGDVHTQDFDFIiNQNLAI^^ 
SEQ ID 73611 

TTGACTCGCCTTGCCGTACTATCTGTACTGT^^ 

SEQ ID 7362 

LTRIAVLSVIiSVASSPCPDI^IHYTTim^aV 

SEQ ID 7363 

TTGACCTTCGG(XAAAAGGATGGCTTC«^ 
GG6A& 
SEQ ID 7364 

LTFGQKIXjFGNLMGGNPDLGn'AlPKDYLKVR^ 

SEQ ID 7365 

TTGTTTGGAGCAAACACGATGAAACAGCCTGTTT^ 

TATTGG<mCAAACACCTGTTG^ 

CATCCCTGCOXXX5AAGCG(7^ 

ACCGCGCC6CT6CACAAAGGCATCATCMC6CCGCGCG 

GCAAAGGCCTGCGCGTCXKXX^GTAACGACCCACCTO 

ATTCGGCATCAAAMTCCCAAMTCCTAGTC 

GMGGGATAMCCTCGCCGGCCCGTATCCGGCGGACAC^ 

ACAGCTT0GQGCAG(XXX3TGAACA^ 

TCCCGTGGAGACC(XX£TCGAGATC 

SEQ ID 7366 

U^inMKQPVFAW^ 

TAPLHKGIINAARASTGFFSGHTEYLAEKSGTGQVVMML^ 
EGINLAGPYPADTLFQPnfliEGAMVLAMYHD^ 

SEQ ID 7367 

ATGGGAATACACATATTTGAAATAACAGAAAMG 

CCGAAOTGGTTGGGACAOX^TT^ 
TGAAGTCGGACGCTTTGAirKX^^ 
SEQ ID 7366 

MGIHIPEITEKAGAKPAAASVAVHPCCRTYRFNFANAS^ 

SEQ ID 7369 

TTG(XXX3AAGGTCAAAAATCA(XXX;ra 

AAAGCGTTACGGTCGCAAACGGCGTCGTTACCGCC^^ 
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ATCGACACCAAGCAOCTCCCGTCAACCTGC 



CGCGATAAATCAACTGCCAAA 
RDKSTAX 

CTaWSSCGA^CTAAATAAGOT 
AATCCCAT 

TU(jRVRnim t GIiSPSVSTSmiS**RCIA**KRP«tSH 

SEQID7374 

LADYY£SLSPRIFDTKILFIjYIiSNIYHSCIFT 

8 KJ52!™ 



ATCAACGAGAACmACCGAATGOTO^^ 

irrTOri » TTCCGCCTtnTXKGGCGCAGCATCAMGAAATGCTCGGCGGCCGC^^ 
(^^GCTCirTAAAGTCCGCGACTGCGACAACCACCATTTCC^^ 



TGATCCGGGC&3CCCTG<^^ 

SEQ ID 7378 

VramMRHMQGLEAQSmFHRKGRPM 

GGCGGCATACGCGG9GATG9GGCT(^CGTTCAAC(XXSrGC^ 

TCAACATCAGT6AAAATCTTTTTTTAACCGGT 

SEQ ID 7382 

VIBCHPPKHGFYLTGAQSaPSPVHPPICIBPPPGTOYHPPFHISHlIiFMG 

TTGGCTGATTATTACTCAT 
seq in 7384 
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SEQ ID 7385 

ATGMTCCaGCftXXmAA^ 

GCCCGTATGTGCA(XKX^TTTAGCCTACGCC6CCGMOT 

TACOATTCCATCCAOC^^ 

TTGGAAAGAAAGAATMTAAAACTTTTCGC^^ 

ACGATTimACTCAACGACAAATTCm 

TGATGCTGACGCAGCAGTTACGGTTO 

GCGCCCGGCCmCOTGGACGCC^^ 

SEQ ID 7388 

HNPARKKPSLIJSSLLFSSIi^ 
LERKHNKTFGGl^UnKYQ]^ 
APGiyrUJAGYRYHNWGRLENTRFKT^ 

SEQ ID 7387 

TOTATTTTCTACTAAAAACAGAATOCCT?^^ 
TCTMTOATCTCGGOTGGATATGMGCATTGTOT 
GATTGGCAAATGGTTTGGGTATGGATGATGTAATCn^ 
ATTAGCTAATAAATCAAATATATCTTG^^ 
TAGGCGCGTCGACGGCCATATTTAGCATAATO 
AAGTAAACGCCAAAGCACGTTTATACTG^ 
ACCGGCAT6CGGGCTGCCTGAAAAAATCATTGCGACATCA 
ACACGATAGCCGCGTTTTTTGGCTTCOTAGGCA^ 
TACCGATATAGTTAATTTTCGGATATTT^ 
SEQ ID 7388 

IR**imLIK(WMPPMIENSLVIVIQI^KifriJOT 
TGMm*KNHCDim*(3WnOT*C*SNLVINC^ 

SEQ ID 7389 

ATGTGGGTCATGTGTGTATTGCCTATCGGTTT^ 
TTTAQGTGCGCGGAAAGGGAAAAGCCGCTTCGCGCX^^ 

SEQ OD 7390 I 

MWWOLPIGFSKiIPSRMPSEKRNGFKSre j, 

SEQ ID 7391 lj 

TOATTTTCACTCATTTTTAAAATACTTTCAGCAA^ 

AAAGATTCTAAGGCAGCTAATTCACGAACGGTC 

TACOSATATCACCTGTTTTCGGACGTMTGG 

AACGTGTGGATTWTTTOGCCAGCAGCCAG 

ATAAATATCACTC(KOTACGTATCTGAGGAM 

MTTGTGTOTATAGAGTOGCGCTACATTTTCCATAACAA 

AATCTTTOTGCCATACTAAAACCTTCACAAG 

AATTGATGACG(XKH3AAG^ 

AAAGGTCAATATAGGTAAGTGGTGTGTGCAT 

SEQ ID 7392 

FIFTHF*NTFSNSPCQ*RRHGVTNLLLLG^ 
NW^ASSQFffl^C*SNSIflGN^^ 

SEQ ID 7393 

GTGCGGACAGGTCC(^TTCCCGCC^^ 
TTAAAAATGCCGTC 

SEQ fiD 7394 * 

WTGPDSRLRGHDEFWXX^^ 

SEQ OD 7395 TGTCMCGT^^ 
ATATCGAAACGCTOGGACAAATGTCGTGC^ 
CATCCAGTTCCGCOGCCTGAACGCCAGCA^ 
GACCTFITCCAA(^GCCGiCGAAGACCT 
TGTCCGGCAAAATCCACATCGGTTT^ 
a&CACGCCGCCGCGTATTGACGGACGCA^ 
GTGTCCTGCTGGATTACGCA^ 
TCGAAGACAAAATCAACCGCTTCGCCGAC^ 
CCAAATCGOGCTOGTGCGCAGTATGMGGGTCTO 
ACCATTC&GGGAOTTTT^^ 
TCCTGCTCCGCCGCGAACAAGCCTACCTCGGOT 
AGACMCQOTATATGCGCCTGAC^^ 

(XXXXXKX3ACATCGAMCCTTAAAACT(XX^ 
AGCCG(^TTOCXMXX3TTA^^ 
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^wnMTyPKMfWV^ 



tiklffagq: 



SRISGVTPAAVMJUVHLKBGFKDfcR 

SEQ ID 7398 



YVHGIFIWPFPVTGIIRGnAPAAEWADFJI^^ 

S TOAAATTC^TTCCCGMMaCCGC^^ 

SEQ ID 7402 

LKFVIPI>nTAISKPVIPAOMIQTPDAAGIYRK 



CAGAAACCTGCTCGCGCAGCAGGO.T1V1T1' 
SEQ ID 7404 



CGCTCGCGTCCACXXXCTCTTO 

ISoMCGAmcx^^ 



rtTfirGGCGMjCACCAAGTCTlTCAGA 



TGCcTCAACCamXG^ 



GACCffiTCGCGCOGTCGTCCAA^ 
TIMCGGTICCXaCftC 




JPARGF FTVTRDKG 
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SEQ ID 7407 

roTTTATGACC(flCCTGATO 

CanCCTCCCGOXX^^ 

TITOCCGGTTTGAAGGCAGTGGGGGAMGCGT^^ 

GTATTCAGCICGTCXSGCMCCGTOACG^ 

TGACCGACATCCAAAAAGCCGAT^^ 

TTTGCCCTATATCTGTCMGTGGCITCGGGCAGG^ 

GAAGGGCATATCGCGGC^TGMGGCIGAAAGGCGGCGTT^^ 

GTTTGCACATTCOTACCGAAT^^ 

SEQ ID 7408 

WSSSAmGDAEKVPYTEDMRPGDTA^ 
EGHIAAMMGGVAGVHLFNLGSGRAYSV^ 

SEQ ID 7409 

ATGGTTCAAAATGGGCATTTGTGTAAACGGTTTTC^^ 

CC6(XGTGCAGATGGTGTC6G<XXn , GTC^^ 

GATGGGCGCAAAWSATACCGGCAMCTCGCACTT^^ 

CC6CTGCAAAGCCCC6AAGCG(KjCAAG6^TGAATAOT 

GCTATGCCGAAGCC^TACCATCCGCGCAGGGCGGGAAGTO 

CAGGGCGGTTTGCACGAAA^ 

SEQ ID 7410 

MVQNGHI£KRFSQSAAMVAYRW^^ 

QGGLHMGRRTFGHSMIVDPWGDVLD^EGEGIVT 

SEQ ID 7411 

TTGCGGCATATTCGCC^ 

TOACCTCAAMGGCA&^CTC 

CAGCATCGGCGGTGCGGACTTCMTACTGATTGCC^ 

A^TCTTCGGTCATCGAAGCCTTCCGCCGCTGC^^ 

TC(X3CCACAGAAAACTCATCATCAC(X^ 

ATATGCCCX^COTjACCGC^^ 

ATCTGCCTK5CCAACAMGACAGCCTGCTG(^ 
TOTGACXXKmCGACC^^ 

SEQ ID 7412 

UIHIPPIGYHGGTLRKPFT^ 

KSSVIEAFRRCLPGRLHRCIAAHPIAGSDRNGAQAAQFGIJRHRKUITPHGGE 
DFTRIASGHPAWADICIANKDSLI^LVQGI^ 
SEQ ID 7413 _ 

SEQ ID 7414 

KAVWKLPEIQKH^NRTMPACAGKTDFRFP 

SEQ ID 7415 

acctacgcgggcaatctcgmtcgctgaccgac^ 

at(hxx3tgatgcacttggcggcgg^ 

gcagcaaatgccgtctgaaaaacgcgaagcctkx^^ 

tccagcccctactcox^^ 

ccgaaaaactcattottoat^ 

TCAOEtf^Ca^GTG^ 
SEQ ID 7416 

TYAGNLESLTOIADNPRYAPEQVD^ 
SSPYSASKAAADHLVI^ 

SEQ ID 7417 

TTOTMTCCOTATATCGTCATTCO^^ 
AAAACGGTTTPTCCAACCCTGAGAC^ 
SEQ ID 7418 
UiIRnVIPAKTKKIKNBniOHHSRAGGHIiGI^^ 

SEQ ID 7419 

ATGGCTCimCCTGAOT^ 

ATTCTCGMCCGCTGCCCG^^ 

GC6AA6GCAA6GTCX3GCGACCAACACA0CG^ 

CCAACCOnXX^CAACTGATGAC^ 

(^AGCGAMT(£ACMCAT^ 

TCACCGTCCGCGTCGAAAA(X5CCAAAACCTGCGAC^^ 
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GGGCAGGCGGTGATGTCGAAGGACGAAAraCCGTCCGCAT^^ 
ACCGTTCC 

SEQ ID 7420 

MAVNWEKTAEQIiBDIDG^ 

ILEPLPADKI IAALPKHQPAFWNEAAI&IMTTDT^^ 

QSEIMADPRYAQLKKLIiCSIiALELAQmRM 

GQAVMSKDElTVRIKIflRGQAAAS^^ 

SEQ ID 7421 

GATGCCGCGCAAGAGTI^TTCACGCTGGTTTI^^ 

ACGACCGAACCTGCOGGTACGCGGCCTTGG^ 

CTTCXJACGATTTCGGMCGCGCGCCGATGAAGC^ 

GAffTTQOQCGCAAGMOOSAC^^ 

CGGGCAACCGCACCCGGMCTGCGCGGAAGCC^ 

TGTCTTGGATGCGGAAGGACAGCMCACGGCT^^ 

GACaTTGCOSGTATTGTTCA 
SEQ ID 7422 

DAAQKFVHAGFGAGLRVHAFDDIX3TVKAVAAVFGRQAAGro 

DI^ARAIXmPSIDHRAFV^^ 

QHGFFDFGGNGGRSim^ 

SEQ ID 7423 

TTCOTTCXXAAATTTATACTCTATTTGTTAAAAT^ 

TCGCGMGACGGM£TT(XX^ 
GTTGGGCAAACCGCCATTGGTTTTTGTCX^^ 
TCGGGCGAATAOGACTACCTGATTAAAGTAOT^^ 
TGGAGGAAGTCAAAGAAMTCCGGTTTTGGATTTGGAT 

SEQ ID 7424 , 

IaLSQIYlW/KlNNKYKVIQMKEIiDKIDFRiro 
SGEYDYLIKVRLPIMSAYRI^ILGNILLQIiPAASES 

SEQ ID 7425 I 

ATCCAAACCGCAGGO&AAAAAACATCCTOT 
CCTACGCGGGCMTCTCGAATCGCTGACCGAC^ 
TGCCG^l^CTKSCGGCGG^^ 
CAGCAAATGCCGTCTGAAAAACGCGAAGCCTTCX^ 

ccAGCccmcrcmxrrccma^^ 

OSAAAAACTaTTCCTTTGA 

CAGGTCGTTACCGAAGGTCTTGTCGGCSAAAOOT s ™ s s s Kn 

CGGCCGGCGTGGCACGGTATGAAGATTTAATTACTTTCGTAC^ 
AACTTTOAATCCOXX^CCGa^^ 
SEQ ID 7426 



Q(^pSEKREAFRFHHISTDEVYGDIiHGTDDLFTmPyAPSSTO 
QWTBGWmNIGGHNEKTNlimmCAIiLEEI^ 

SEQ ID 7427 

AT(&7ECC(mraXX£CA^^ 
CATTGTCCGTATTGCTGTGGGGTTTCGGCTAtt 



TACGGCGCGGmGCATGGCTTTGOXGm 

<XXaT(X2G<mK^^ 

GTTTTTTACGTCCAAACGGTT^ 

(^»IXJGGCGGCTTTCGCGTTTGCGGCX3G^ 

CCGGATOGGGCTGATTGCGGTCGGCGCGTOT 

CGCGCTCGSTOTACGGGCMTTCGATTO^ 



SEQ ID 7428 

MIPFCGKYIMKQGEHHK^^ 

sGiLGHiFm<&v(mra 

LSAAFAFMGVIFTVQVYRWWYKPVLKEPMLW 
YTHSIBTSSVLFAI1M1LWAWKYIPWLIRPRSDGRPG 

SEQ ID 7429 

ATGAGTGCGMCATCCTTGCCATCGCCMTCAGM 

ATCCGCAGG(mTGCGAC(^CGG<KA 
CGGATACGGCGTGTT03(n^ 
TTTATOTGATCGACTGTCCGCXnTO 
1XUTTGCGA0CGTGCGCAAAATCCGTCAGGC(K 
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CAGCCATTTCGGGGATTTGCTTTTTGAM 
TATCTTGCmOX^ 
SEQ ID 7430 

YLALADELAARVSGK 

SEQ ID 7431 

A'EOGAACOTIAAGAROG^ 
ATCTGACCGCCCTGCXXJGACGAGGAAA^ 

ACTGGGCGG(mT6AAGG(KOTATATCCGCAGa 
CATCCTGAGCAAGCGT 

SEQ ID 7432 

MERKERLRAGIAAMGPDISE^ 
VVSCfflVEAVSDVRADWSRAF^^ 

SEQ ID 7433 ■ 

ATGCCGTCTGAAGCGCGCTCAACGCTTGCTCAG^ 

!KXXXXn"GCG6ATATACGCCCTTCAl^ 

(X^CGT(XX5AAACCGCCTCCAC6C6CCCC6A 

CACATaSGACGGCAOUTC^ 

ACAATCATTTTTTCCTCGTCGCGCA 

CGGCAATGCCTGCA(X£AGGCGTTCTTO 

CCGTTCCGTOCTATTTmCC^ 

SEQ DD 7434 
HPSBARSTLAQDDDMAPMQAM^ 



PFRPIFSCGTPgAAPARASGVSAASA 

SEQ ID 7435 

ATGGTATTTGGACAACAAAACCCGGCGGCAAAACGCATAAAACGGCA^ 

CATCGCAAAGGAAGAAAGAAAAGATGAAAGGCATC^^ 

(amATTAcaxrnxsTCGGT^ 

CGCCIXXIAATATGCCGTTCMCCCAGCCXX^ 

AATCATTTACGCAAACATTCAAACAGGCCGCCGC^ 

CTTATCX^TCGAAGAGAMCCGCMCAGCCCAAATO 

TTGGAMTTTCCGACCTCAACCGGA^ 

CCTTCOTCCAAAJOTTGO^ 

AACCGCCTOCGGGCAATACCTGCTGCGCCTGATO 

SEQ 8D 7436 

HVFGQQNPAAKJUKRQLPFCTLRYGQ 

RLQYAVQPSPD6IiA(^^ 
liEISDLNKMYLEDGSLSVQIIjGRGFASJLTO 

SEQ !!D 7437 

ATGCCGTCTGAAACCGCSn'ATCGGGTTT^ 
ACACCAGCCACACGGCAAACGCATA'TTCCGACACC^^ 
CAGAAACAGCGCG 
SEQ ID 7438 
KPSmYKVSDGISVPS?PF^^ 

SEQ OD 7439 

ATCAAACXX3TTCAAACGACA!PTCCX3C^ 
TGCATTTTGCCGAACTGGAAACCCATTTT^ 
GATGATGGCMGCCCCGAATGCGGTAACGCGGCGTO 
GCA 

SEQ DD 7440 

i^LKRHSALVGLSREHH^^ 

A 

SEQ ID 7441 

A^TATATATAAACGGTTGTTTGTGCAACTTTTTO 

ATGCGACCCTGTTTGCGGGQCTGGTGTTTT^^ 
CGGCCGGCGTCGCGGTTGGGACGGCACTGTTTGG^^ 
TCGCOTACXK^TGTTTCTGTCGGGW 
CGTTTGCCG^GCTGGTGra 

SEQ flD 7442 
MIYIKGCIOTSGGRQGGARyNAACFGRGGTAHFVNEK^ 
SRYALFI^GVAACMVmPKLLl^SAAL^ 

SEQ ID 7443 

ATGGACATC^TCGACACCXKXCT^XCGAOT 

MCGAACTTTCGTC^^ 
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CGTGGCCGT(X3ATATGC(XX^GGTTC 

TTGGGCGAC(£(£{X&AAGTO 

TACTTTCGCCCAAAGACCTO 

SEQ ID 7444 

MDIIDTALPDVKLLKPQVFT^ 
tmAEVmCTDYYNPETEQVLM 

SEQ ID 7445 

ATCTATCTGACC(MCATRCCGACTACG 
TG&TGAAAGTGGTTftCCGCGCTGGTCAAA^^ 
CCTCG^CCGATGCAGCTGGTCGAGTGT^^ 
GGTTTCACGCTGCMGACCTGCT^^ 
SEQ ID 7446 

— > -nLVBMFHHBMTPS^ 



GFTLQDLLNKPTYDLLXESKIPIAVR 

SEQ ID 7447 

TTGCACAMCAACXX5T1TATATATATCATMCGAAAA 

ACCGCCACCGCCO^^ 
CCTCGGCATGCTCCAGTTCCAAGGCGCGAOT 
GGCMCCTCCTCTTCTACCTGACCATC 
ACAACGGCAGCGAATGGCTCGACAGCGGCCTGATGCGQ^^ 

ATCGTCAAAAGCCMCAGCCACCTCGOC^^ 
(X^CCGCCGCCMGCm^TCTCCCAM 

QCOTCGTCATCCTGCTG^ 
SEQ ID 7448 



RPPHPHFUIRHQPRTGSPRRTO^^ 

^ATCTO&G^^ 
MtfXXTnTCGGCCGC«^ 

(XIMCACGGCGMTGGATAGGGATGACCGTCTTCGTCGT^^ 
GGTTTGGGCGTTTTATGGCTGMCC^GCATOATTra 
CTATGCTGGCGGGGCTGACGATGTGCATGCTCATCGGC^^ 
CAAACTGCTGCCGCTGAAATCCACACTGATGTCGCG^ 
GAGCAGAATATCGTCAAAATCCG(XAAATCAA^ 
CCCACCGCAAMTC(m^CACCACCGA 
CGACCTGCAACAAACCGCCGCCOTATCAACGGCAGAC^ 
TTCCTCTGGCTCAGCACCAATATG{£ 
AAACACGGGAACACGGC 
SEQ ID 7450 



EQNMVKMRQBJAIOWKSR^^ 

FIM^TNmQEISALVILL^RTRRmi)AHERQHLRQSLLETmG 
CA 

SEQ ID 7452 

MPARIAPGRNIIMRQTKCRPNAGIjGGILM^ 

SEQ ID 7453 

(XXXXX?PTGGTCACAGAGAAG(^ 

(»GG(XMCTT!EAra 

TAACCAGCGCAttyACGATACAGCCGA^ 

GAmCTGCCGATCCGCAAAAA^ 

ACGATCCCCACGCCGGAMTCGGGCTGGAAGAC^ 

GCX2ACGCCATGCCGCCCGTAATGTGCGMTCGCCGATA 

GCGTTCTGTAGCGGGCGCACCGCCGCCAAAACT^^ 

ATAAAACGCACTTTTTCCTICCACAGCTT^ 

GCAOaTGtmHXX*^^ 

GGCAATCMaCGCftfi^^ 

nxyUU^mCAGCAO^ " 
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GTGATCAATGCGCXXaAGTATCAT 

SEQ ID 7454 

PTRLQYALCKEKHPGNEERQAQPRSGGIPVLQIQRERRI^ 

QGKFTA* *FFQHQIDDD* YRSNRimDVRPFEVVGENARra 

DYCRSAKKRQRQ*EFAVGLTO 

AFCSGRTAARlMHIflAF^ 

OffiTOTSGAKAHQ 



ATCCX26TCTGAAGCATCAGCX3GTT^^ 

CQ^CGCGGGTTTGGGCGGCATTT^ 

ACGTGAAAAAC(^CTTCATTGCACX?GCCGAGTATGCOT 

GACCGCGCCGA171TTGATGCGTTCGGGCGGCGOT 

TTGGAAATGCCCTAGGTO^ 

CGGTATGGATCGTffiTOTTATCGGTACGGTT^ 

ttTCC(XHXTACCGTm^ 

SEQ ID 7456 

MPSEASAVQTASRALPAAAHKA6AYRARSEYP 

DRADIDAFGRRGQPQPOTPAYAVQ^^ 
HPRYRLIICYCNLHL 

SEQ ID 7458 

MI^SSRYFSTEPDi^IILPljTTDVHIUASRBI^ 

SEQ ID 7459 

TTGGGAAAGCAATTATATCAGGAAAAG 

SEQ ID 7460 

LGKQLYQERQTAFIJENCXJSGOT'QGSLIF I 1 
SEQ |jO 7461 <' 

ATAAGCAtXXKK^CAATCAGCGCGGiTCAGCACGCCGTTG^ 

TGCCGAGCGACATACC^CGGACGAGGGCATGACXjACCGTGT^ 

GCOGATGGWXXJGGTGATTTCGATOQCGATGGGG^ 

CTAACGCTGCCCGCMGCTGCGAMCGATGACGGG 

ATTCCGCGGCGTT^ 

MTOGCAAG(XK3GTACA(X3GCAAGCG1^ 

ADGRGDFDGDtWGHRPGRERTO 



SEQ ID 7463 

ATGMC^TTATCGATGCKAICATC^OT 
ACTTGTTTTCCGGCAGTTT^ 
CATCGAAOTAAGAAAATTGAAAGTAAAGATTCCG^ 
TGGGAAGAGAAAGACATTATCTATAATTGTCGGC^^ 

WEEKDIIYHCRRK 

GGCGTTTCGACMCAATATGGCAGTACAAATOT 
GCGGACGOTAaTACAGGTOTXX^ 

GGOTCTTCMGCCTGATTO 
G(^CCGCCAMTCCGCCAAA(XX^TTO 
GTACTTCTTTAMCTTCTCTTO 
CAGCGTCGCATAAAAATCTT^^ 

SEQ BD7466 



GFFKRDFOTMaVMM 
QHRIKIIiITHFPI*IC*HH 
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SEQ ID 7467 

ATGACX&TAAGGAMCAAAC^^ 

CCGATTOTOTACCm^ 

GTTGGGCACATACGCCAA^^ 

AAACGCAAAACCATCGGGCTGOGT^^ 

CCGACCCTTATGAAATCCGCGAGC6TTO 

SEQ ID 7468 

MriKKQMAQFFAVHE^ 
KRK?I(MIPDHAVAQALLEELGOT^ 

SEQ ID 7469 

ATGAGCACCTTOTCCGGCA^ 
AAAAAACCCGTTACCTCA6AGACCGCCGCG(XXX^ 
CATCACXXECATCGGTITCMCOTT^^ 
AAAGTGGGTGCGCAAAGCCACCTGAAGOCX^TGTG^ 

SEQ ID 7470 

MSTFFRQTAQAHTAKHIGRFP^ 
KVGAQSHLKAMCLNLLKAANRLSAPAAA 

SEQ ID 7471 

TOGQCMSGGTTCACGGAAT^^ 
TGGGCA(X3G^CC(XM^ 
TTTCAATITCATGGCCATTATCAACG^ 
AACCCGGCGCGATTAAGAAATGCAAAACTCArc^^ 

SEQ ID 7472 

LAMVYGIDYOtfJABCYLKIKKQI 

NPARIiENAKLITYHL 

SEQ ID 7473 

TTGGGGCGAAAACAGCTGAAAAOTGTGTTTGGff 
(MGACACAAAAACTTAGATAGGTATAAAACCAACCGCCAT 

SEQ ID 7474 ! 

LGRKQLKTCVWVSAVEGRAFL^^ 

ACGATAAMCCATTTGGGCMCCTTCAAGGCAAACCATCC^ 

CCCACXJCTGCCAGTOTGGTCTGA^ 

GAAMCGTCAGCGGAATGCTGGCAAACCGCCA^ 

TAGCACAQGAACGCAAAAGGGTCTTCTACATCGGTT^^ 

TOGGATTTGCAGGACACAGCCGTACCTTOT 

GTTAAAGCX3TGGGATGAAj^GGGTFTTACCGTCC^^ 

^GAAACACTATATCGGAGGATGACGGTACGCGAA 

CCCCGTCAACCTOCCTAC^ 

SEQ ID 7476 

MQNSSPTTYNPMHISLFSGCGGLDIXjFE^ 

EWSC3fliANHHNGAVQNLLKMFIXX 

VKAWDEQGFTVQASGRQCQIflPQ^ 

SEQ ID 7477 

ATG*KXX3ACCAACTCATTCTTGT^^ 

AAGCCGTCATTAOTTCAACAMGACGGCAACAAACGC^^ 

O^TCAAAGCCAKX^CGCCGa 

GAmTTGCCCAOTAGGO&C^ 

GCACCCTa3A ^^ 

ACTCGTCTTCaCC«^ 
AATTCGGGCATTATCAGCCXXaACCGATTCTT^^ 

SEQ ID 7478 

MSDQLILVLNCVSSSLKGAVIDR^ 
NPAHI SGILAAQEHFPGIiP WGVMDTSFHQ^ 
TDPOTSYPTFHAGMDVAQ^ 
NSGIISPTDSSPAVLVVPTNEELMIACOTAEIAGIL 

SEQ ID 7479 _ _ 

TTGCATCGATTCATTCCCCGAT^^ 

GCGCAGGAGMCAGCATACAGAACAATACAAACATTTTTT^^ 
TT6CXXXX3CCCCGGCTCAAAGGGAAC^ 
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SEQ ID 7480 

LHRFIPRYSAGLSPAISLPNKEQKSA(& 
AQERSIQNmmFFHVFPIiR^^ 
SEQ ID 7481 

CCTTTTGGAOGCGOGCACX^fCGGCG^^ 
CtmGCOGATGCCC^^ 
TCAAAAAACTCCTCAMGACAAAGAAATTTCCGM 
AGAAGACCTGATGGCGGTT 

SEQID7482^ rsvevlkeotjocvr gvkpf aavgdu 

RVPMPiCLTEERRKDLIKVVRGEAEEGRVSIRNVRRDAN^ 

SEQ ID 7483 

TTCGGCAAAGCGTATGCCGTATTTCCTAGCGTCAT^^ 
GOSTGGGAATGACGGCGGTG^ 
SEQ ID 7484 

LGKAYAVFPSVIPTQAGIHICSHAG SDKKQETEYRLRIPACVGSTAVRLFGFMVI LNgMPSETADSNVRPT 
SEQID7485 

TTTTTTAACCGCAGGCAGCACC6GTTTGGCGGGGOT 
TCCACAGACTTGAAOXX^ 
AGAAGOXSCAGGAGAACAG^^ 
CGGC6TTGCCCXXXXXXX3GCTCA/AGGG^ 

SEQ ID 7486 

FFNWlQHRF(XiAF8CGRAI^ 

RRCPAPAQRERFPKVMKH . 
SEQ ID 7487 

TTGMCGCCGMtXGCGAAAGGCAGATATGAAGAGCAGCACGCA 

GTTTTCTCCC(X!CTATAATGG^ 

(KX3TC(^CXXX5AAGACGMGTTOGTIOT 

TTTMTCGGCAGATTCTGCMGGTATCGMGAGGCGGAAGOT 

ACAAACmTTGCCGMGGCGTATC 

CAATTTCCTGCTCTGG^ 

TTCGGAC(£ACTTCCGA(XAACTGCCTA^ 

SEQ ID 7488 

LHAlffRKADMKSSTQTILEHTAIP^ 

FNRQILQGIEIMALTANNTGI^ 

FGRTSBQLPIGQQRH 

SEQ ID 7489 

AAATGMTATCCCAAAGTTTCMGCCGTTCCra^ 
GTTTTCTGAMGGGTGTTTGGCAGTGACCTAC^^ 

SEQ ID 7480 

K* ISQSFKPFLRKPDRTPYRCRPAI PDIPSGQSKHPRISGGFLKGCIAVTYFRKRP.PHYH 

SEQ ID 749*8 

ATGCITrCAGCGCTTATCTCTO 

TKXIAACGCCCACTGCAGATAGGGACCAAACTGTC 

GAGCCGACATCGAGGTGCCAAACTCCGCCGTCGATAT^^ 

CACTATGTCCTGCTOC(X^ 

GCTTTGGGAGGAGACCGCCOCAGTCAAACTG^ 

ACTGGCGTCTCTGCTTCC^^ , 

SEQ ID 7492 

HTiSALISSELSYPAMQLALQmQRFVHSGPLVLGA^ 
HYmSHLPDLSVSQLSYLLPIiHYQSDFRPDim 

SEQ ID 7493 

AT6CTGAAACAA(^GGTAA!!PAACCGCTO 
GGGAGTATGCCCGTATKSCCGGTTTGTGC^ 
TGTTGfmxmTTTTGGCT^^ 
CTCCTATCCCTGCGCCCGCATCC^ 

TACCGTGTTAATCGGTT^ 
GTGTTCGAC0GTACC6ACAGCCT6ATTGCW 
SEQ ID 7494 

LVSLRPHPDDAfcP^^ 
VFDKHJSLIAVISVVAAIMS7LH 

SEQ ID 7495 

ATGTTCCXIAACCGGCAATTTGGTCGACGAAATT^ 

CGGGCAAAGAGTTGCAGGACGACATCMCAACX^TGCCXKX^ 
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CTACGCTGCCTCTCGGTGCTSXGCTO^ 
TITO 

IflHAMHGIASVAimVLGTLV^ 
epA ir\ 7407 

SEQ ID 7498 

SCPWSVCGTVRFKLKLSGFSWKRGIGCFVSVIHEHH 

ACCCTCCGGGGCTO^^ 
AGAAAACCCGGATTTGCC 

RKPGFA 

CCCGIXaTCCCGAASfiATTCCGCGTATTCGCGCTGGCAGO 

CCCGTTTTOAACAACG&TTOAAGCACGTTCGA^ 
GGCTOKAACTCAT^^ 
C^CTCGGCMTCCC^TATOmCQCC^ 
0CCCG&CTTCGOCXaX3TCCCCTOCCTGAAGTTCGCC^ 

GGGCATTTATCGGCiCACTGCGC 
(^GLPSAIAMQKGKTIYIAHKETLWSGALFM 



QIKFTOIAKTVAHCLAQIffaoroiEGIiIAQI^TRAQAMFIfflM 

TCTOKS3CGTACTCQGCAG0CCCGAOCCTBiCGGCAAACAGA^^ 

AAAATCCCTTCAGACGGCArcTGCACGGTCAAMTCTGGtt 
ACGGCGTAATHTOCQQCMCOGAAGTACAAATOGAAWTI 

SEQ ID 7504 QIICIGNASSSTSMVISDKSERAIMJ^^ 



KIPSX5ICrVKIWQKia<OTIA^ 

^ rrGCCTAAffiTCTTayilOTM 

TCGCACTICTtaTACCKXaGCACACTn 
TCTTCCGOGCAGGACGACTCGACCAffreAGCTATTAC^ 

SEQ ID 7506 



SSAQDDSTSHiIiFIi 

MMaWTTTGAAG^^ 
aa«(X£ACAATCGACGAGGGMT^ 

jCICtlCGBCMMGCBGCKETOftMCMnOG^ 

CGAlTWGGGTACGAAAGGGCTCGAAGGOGAAGASAGCCTTG<X^TO 

fYWTCGGAGGCAGCGATGACAGGCTGAAACCCGTGATOAGGCTT^ 

~ "™ - — COGTCtoOGGTAAATTGAOCCGQGTCATCGQWGGATCTCTAT 

ITATCAATACCGTTGCCGAWXGG<XGGMCCATCCCTTM^ 



TlXaTGTGCMraTCAAACGTGCAAWTTCACTTATATCGAG 
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PFGGSDDRIJCPVMRliSTM^ 
LMCNIKRAKFmE 
SEQ ID 7509 

tcccgcacoxe^^ 

CCCTACTCAACCCCC^^ 
^GCTGTTGCTG^^ 

TGGAAATAGGCGTGCTGGCAGTI<^CCCCTTCACC<^^ 

TCAGCCACGOCGGCAGCAGGC^CAGGTTGTGGCTGTAOG^^ 

CGGGGGCGi ^ 

CCG0CCAC ^^ 

CACCGCGATACCACGCOTt^CCGCCTCTGtt 
GCAGGCAGGGTCGGCATTAGAAACAAAATGGACGGAAATC^^ 

GCAAToanrra^^ 

ACGGCAGAGCAGTTGCAGCGTATCTTCGCCAAAACGGOT 
AAAAGGATGTCGCAOTATCCTC^ 
ACM 
SEQ ID 7510 
^AAFQTAFAVKRHQPFGRGIVYGRAAFQFADra 
GFW^KGVGTOKTAASVIjRAIiRRHRDDKVAGNQAI^ 



AGKTCIRNKMIXjNLPSER^ 

TAEQLQRIFAKTVPAVIEQHPVQTIiMHTLKTO *QNW*Q 
SEQID7511 

AGTAA(X^AACTTGAAATCCCTACTTTO > 
SEQ ID 75112 

SKQT*HPYFDKAyCFWS*9CLLCFRIKSIQ 
SEQ ID 7513 

GTGGGCGGTTTGGAAGCGGAAACCGTGATGCTGGGACGC^^ 
TGTTGGCACTGACCGMITCTTGCGTAAAGAGCG^ 
GGAGTTCGGCCmCTGCCGCaTGTTCGCa 
mTGGGCAGGTGGCTTGAAAAO^ 

TGGCGGCGAAAGGGCTGGCGAAGOCTTACGAAGAGCCTTCAGACGGOC^ 

OG(XGCCGCACTGTTGGCACGCMTGCCAACCGC^ 

TTGCCCGAMTGGAAAAACTCXmTC 

CGAIATTGAAAAC^an^^ 

cGceACATmarcccGAax?^^ 

CMSGGGAACX^ACATTAAGAGGTATGCGTCCGOTGCGAOT 
GAAMTGGGTTTGCCTGMGAAGACTTCAACTCTT^ 
GACGGCA(XXjTACGCCAAGGTTCGTTGGCACGCGTTG 
CGGACTATGGTCAAGGCTCMGCCGCGACTGGGCGGCGAAGGGC^ 
CGTCTTGCCGCTCCAATTCAAACCCGGCACCAACCGCtt 
CGTAAAAACGGAGAAACOTCG^ 
ACGCGGCT 
SEQ ID 7514 
VWLEAETVMims^^ 
mOGUCTAVYPRVliKPDI^RHMM 
LPEMEKLO^IVAFACTT^^ 
RDmPMSOTCTAQKAPSPLYDWRPMSTY^ 
KSVRQGSLARVEPEGQTMRMWBAIETYMNRKQPLIIIAGADYG^ 
RKNGETVItfPVTCRPDTAE 

SEQ ID 7515 

AAAACAAACTCAITOCTTTGTTTGT 
SEQ ID 7516 
I/^AFCVSGI^YHHHPMBfFVTC^ 

SEQ ID 7517 

A'KXXCGGACACGCGCACAAGCGC^ 

TTTTGGTCAGCCTGCACGMTTC(^ 

ATGGTCCCTCGtt^ 

CmTCGTCGCC(XX^rcCGCTG^ 

l FTG<XXXXXX£ACCG^^ 

TQACO}GCauU«XX:C^GGCA^ 
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CTmATmTCGGCAAAOTMTCAO 

TTTTAGOTTOjTCAGCATCAGCCT^^ 

TffrCCAAAACATCGGTTTGCGCTI^^ 

SEQ ID 7518 

MPGHAHKRGHLSAHCADTWTOI^ 
AIVMGPLTNIiAMVLLYGIjSFSFGVTEIiRPYVG^ 
TTVAGGVBKGSPAHCAGIiKPGDItLTAAIXn^ 
LKFFGKIJSGNASVSHISGPLTIADIAGQSA^ 
SEQ ID 7519 

GTTTGmAmX^GCTTTCCAA 

SEQ ID 7520 

LCFRIKSIQSSPmVCFIiPSCBBFLSFAKHKKSKQ^ 

SEQ ID 7521 

ATGACTGCCMCCAAAGTTACCGCAAACCGCTG^ 

GCATTITGTCTGAGAACTTGGTCMCCGCGC6GA 

G(nK»TCTGCCAC(^TATTCTGGGGC^ 

ATCGTCGACCACTCTTTGGCG^ 

CC6CATTTGAAAATCTGGAC6TGATTCCGGCGGG 

CGGCAOGGACTCXKATACGOCGCACGTT^^ 

SEQ ID 7522 

tfTANQSYRKPIiPGTDIiEYYD^^ 
IVDHSIAVECGGYDPDAFRKH^ 

SEQ ID 7523 

GTCCGGCGTATGCGAGTCCGTGCCGACGCAGGTATCGGGGAA^^ 

ATCACGTCCACATTTTCAAATGC^ 

CCA(XXXmA6AGTGGTCGACGATAAQCTGCGl^^ 

(XXXAGJATATCGTGGCACACCA(X^ 

TTGACmGTTCTCAGACAA^ 

TGCGGTAACTTTGGTTGGCAGWUffGTO 

SEQ ID 7524 

VRRlffiVRADAGIGBSHAVFDI^^ 
I$»IVAHHPRGIPWCro^ 

SEQ ID 7525 

TTGTCTTTGTTTGTTGATTO 
GTGGAGGCAAACGGGAraMCCGATGACCCCCT^^ 

SEQ ID 7526 

LSLFVDFGFPIC*RSMRRYSTSQIHSC*K^^ 

SEQ ID 7527 

atgaaactgaaacagattgcctccgcact^ 

tath»actacct^^ 

<^<kttctgot^^ 

CAATACT1TAATCIAGGCGACACTCM 

TCGACATmCATaCCATT^GAGGGCAAATO 

CATTTGGACATGGCTGACACGAAGCGAOT 

GACATCCAAACCAACGAAGACAAMCCAGGCA 

AACTGGAAAMCTGCTGACCATGAAGCCCGGCAAATGGTACG^ 

OTACAGCCGCTGCCGAACGCCGGAACCAAAACC^ 

GTCGTG(X3CCGCGMTTGCGCCAMTGGAATCCGC^^ 

CTGGOTACGATATTTACGGAAAAGCCITC^^ 
ACCGCGTCMT'ITOX3GCTGGCG$ 
CAMGGCCTGCTGTACAMGGCACCGTCGGCTG^ 
CT<mTACTACTCCGCC^ 

ccrrcnTGA^ 

AGCCAACGTCKXKjCCGAGCTGCT^^ 

GAAAACGGTMCAACAMTCGGTTTACTCGGAAAACXX^ 

TCAGCTACGCCTACCCGCTGAAGAAAAAACCGGMGACGAA 

SEQ ID 7528 

MKLKQIASMJflJLGISPIWA^ 

QYFNQATLNQAVAG^^ 

DlQTNEDKTRQflHTVHEGGR^ 

VVMtEUtQIffiSAPYD^^ 
I^GYDnGKAFDKKASTSVKQYKTTOGGG 
LQYYSATHNQTWPFPLSKTFTU4LGGEVGIAGGYGRTKEIP 
ENGNNKSVYSEHMKSTFT^ 
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SEQ ID 7529 

CGAC&CGAGCTGACGACAGCC^^ 

ATCCACATCAT 
SEQ1D7530 

RHELmMQHLCTGSRHHSSVS^^ 
SEQ ID 7531 

ATGGAAGATTTAJACATMTACTCGCFTTG© 

CTCCCGTGTOGGCAATreCCACC^ 

ATCGTTTGCAGGCGGCGTGGTCGGTGCATTATCK^ 

TTGCCTniATTGTTTTGCTCGG^ 

GCTGT6AAATTGTTGATAGM 
SEQDD7532 

MEDLYIIIAIfiLVAMIAfiPIDAIAfiGGGLITLPALIJ^ 

IiDGSKEGKAEMSFFLFGMTAPIjI/jF^^ 

AVKLIjIDERNPLYQHIVSHP 
SEQID7533 

TCGGTMTCTGTTTTGTTAMGAGCGTTGOGAAAra 

GC^GCGGCGMGAACCGAACTATACGCCCGCAGGGGAAAACAGTCM 
SEQ ID 7534 

SWCFVKERCEIIKYPFRIiSKISIJJI^ 
SEQID7535 

ATGCCGTCTGAAGCCCTTCmCGGCA^^ 

ACGTCGAWCCTTCCAAAAAAK^GCTra 

GCAAAMCTGCMCGO^GGCTTGGATTTGGAAA(^ 

AAACAGGCGCAGTTTGAAGAAGACTACAA(£TCCG<^^ 

TGCAGGACGTGA!mACGTCAACAC(m 
SEQ ID 7536 

MPSEAl^AFRGNIRRSFTMIRLTRAFAAMJGIXXTTGAHAD^ 
KQAQFEEDYNLRRlffiEFASLQQNANM 

SEQ ID 7537 j 

AGATATCTCTCGATATOTOjACATO 

mTTTTGGGGAJ^TTCOT^TC 
SEQ ID 7538 

RYLSIFPTFRAILFSSS&ASAAAKNRTIR^^ 
SEQ ID 7539 

ATGACTGMACTACTCAAACCCCGAOC^^ 

GCGGTTACX5ACATTCTAMTTTGGCA(^AAAATO 

AGCCATCCC(&CA(XX&A{TO^ 
AAGAGAC(X3TCGG0GGTCATTTC{^ 
CmACTGCCC(XXraAmCCGGTA^ 
AMCGCTACCGCAACGCCX&CGAAGCCGACA!!^^ 
AA6T(KXMCGCG(?ITI^a 
CGTTTCCTACCAAAAATTCG^ 
CCGAGCGC^CTACCmJGCCCTC^^ 
SEQ ID 7540 
JfFETTQTPTFKPKKSVAIiS 
SHPDSEARDIADKIiIASIKjSILLYW^ 
KRYRNADEADIRERIGRKBIVIGFGHPVYTISD^ 
PSANYAGPEWiAFVKim 

SEQ ID 7541 

GTCGGCAAACTGTTTCTTCATCCT^^ 

TAGTCOCCTTCATCMTATTGCGGATC^ 

CGGGGGCTCAIITCGTGTTCT^ 

GTCTTOGGCCCAATCCMTTCAAAATAOT 
SEQ ID 7542 

VGntflflPLSAFUHJ^ 

VFR1WPIQNILFCQLYH 
SEQ ID 7543 

ATGATmCCGCTTTCAGACGGC^^ 

SEQ ID 7544 

MTfRFQTAFQQIRIDRLKNKTHKNTKEKYHD 

SEQ ID 7545 

TTGTTCTGTCAGCTTTACCATMTGT^^ 

SEQ ID 7546 

LFCCLYHNVCPLGSPQKKNKALQSGHPLIFIRLIA 
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GGTAAO , m „„ 

ccmcottamk^^ 
53525^^ 

GGttTCGGCAAATTCMCGCCGTTCCCAAAGTGGACGGACAAGT^ 
GIGKFNAVAKVDGQVAVEAVIMCAKEVV 

GCCTCTGCK&TGGACA^ 




ATCAAAAZOGGATAAATTGTTTCAAAAA 

SEQ ID 7554 

GHRSL*AGAGQBMGPC*PTIPSGBHLTRHHIK| 

SEQ ID 7556 

IADPAGBAAGECGGHGMLSDGSDGILCMPSEAVSGQER 

CATCMTCGATCGGCAATCCTGCGGTCTGTTTCCGATACT^ 
TCTGACCAMCCGCCCGC^^ 

FDQSARNGSRHISAAEKINIHYFYPIQ 
TGGGGGATGAGTTTTTGGATGTCTTTCGCTTCG^ 
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T"IOTTPAGTTTTAAAGGTGCAATAAAAGM 
GATCAAGTGATTGCGGTCTGCGATGAAATCGACMAOT 

DQVIAVCDEIDKIj 

SEQ SD 7563 

ATGATTGGGATGCXXIGACGCGTCGGATGGCTGT^^ 

SEQ ID 7564 

MIQffDASIXSCVIiPSBCIXi^ 

SEQ ID 7566 

HDRI^SHSDGKTQPSDASGIPIIKHIWEOiSy 

SEQ ID 7567 

CAGGCTTCAAACTTCCCTATTTTOT 

GACKTCGCCGCCAATATCCCAAAAATCGATC 

CMCGO^GCCGTTOXmTATW 

QRSRCHIFYYPNFH 

S ATGG^^ 

ACATC 
SEQ ID 7570 

MGKIILLIFKTl^RKTADRSEIiCGK™ j 
SEQ ID 7571 ' 

GTC(^CACGCCGTCATCAACGGCCACACCACCATC^ 
TATCGOMCSGC^^ 

TTITACCGCAGAACAGATTTtXGCCGTCAAAGACCT 
GCCGTATTT(MGACTTCTTCGCACM 

^iffitfATO^ 
DCVVQ^FANN^ 
AVFQDFFAQSTRGIIR 

S |tG<^^ 

SEQ ID 7574 
VGIEILVFGLIDS ILKKPSERPSDGILFARn 

AGGTCGGCAAGCAGOTTTGCATGGTCTGAAAC^^ 



GGCOTOTCAGACGGCATACGGTAT^ 
TO^mCOTACCACTCX^ 

GGLQTAYGMHPMRAFGDKFGFVGIiHIADDV^^ 

S a^m^ ' 

TCGTCAATCXICCTGATTTTGGGCGAGACCGGM 

GCTT6(XXX!GGGGATGGGGCTGAATGCCTATTTCA{^^ 

AGOTTmAAAGTCAGGGAAATO^^ 

CATCAGCATTCTGACGOT 

TTTACCGTCAGCATGGTCAGCGTGATT^^ 

GCC1K5AAACGCGCACTGCTTGCAGACTCT 

CCTGACGOXXnTACOG^ 

CaaGTCCGAGGO^^ 

' — — ATTGAAATTCTGGTATTTGGGC 
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RSAPI>IIM)DHTEAAPAFLTIVFHPPTre 
CGGCGAt^TGCfmCCCCGAMOOT^^ 

(^■TTCC0CGCCGGAGAGAAAGAAA!KK3TGCGTTCCRTI " " "" ™ r ™ mrm 

TCCTCMTCCCAACCCCGCWTCGXAACC^ «—~ 

cen in 7580 

^TOIiAFGAKHKASDUn^ISPm™ 

PRGMVLVTCPTGSSOTTLWUaiWINETQPM 

VmGBKBWRSfflSESLTAVISQH^^ 

SEQ ID 7582 

MPSETPKAFQTASVFFKAVCCLIDASFCIW 

SEQ ID 75S3 

TTGTAT 

SEQ ID 7584 

LYSGUWPCATOQrVQEVRQGEmYBFKTOli 

OCGTCCGGCAGttCAGGAAGATCGGGAGGOCAT^ 

GAGATTGCCMGTCTTTGGGCATTCTGACCG^ 

COHIGMIMCMGCCGMIGGB^ 

OTAACITCCCCGAGOSAMTaTCMOT^ 

GCQACOSACCAGGCCATma^^ 

AAAAGGA(XXX3TCGATCCTACTCCGGCAAGGGAAOTAGAAGCG^ 
CTGACAGCWXGGATITOGATMCCAGTCTGTOCTIGACGK 

SEQ ID 7586 



MBFVYDVAESAVSPAVIKVIGUfflGGCIIAIW^^ — — „,„,..„„„^,.^ Trm , m , 

rtakslgILTVAVVTRPF^^ 

M^ssPiaJwruxsaRGVLVHim 

LTAADFD8QSVU3DFEIPAILSRCBHSDK 
TOSTOSCmOTCCTGTn^ 

rwMffliiQOCTTCOCGaT^ 

GMGCGmtKOTCATM(JAAAAC«^CGra 

GTCGl^TAAACAGCATATTGGCACCGCGMTGGCITCT^ 



^™°LI™VIEIRCCQM^ 
IDQBTACTIQSYIVQQRTGNQLVGRHTRTV^^ 

AAAQADYFNYRKAIXnffCHVVNKFHSKTPDRPVQRTV 

TCTGCGGCTTCGTCGCCITOTOC 

SEQ ID 7590 

tgmkeaabiihaakotsensgijmrysvaspcrtsctvcgpvals 

S !™^Iaaccto^aca™3acg^ 
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TAOTACAGCCAAGGCAGCGCAGGTTTGGTTI^^ 

TGATTATCGTCGGGCCTACXXjGTTCGGGCMATCX^CCA 

CAAACCGCGCCGCTCaTC™ 

AACAGCACAACCAAATACIK^TTGACATCGCCOT 
GCCCGCCATraGaCTTCATCra 
GQCATCATCAGTTACGAAGAAGCCCTGCGCCAGTC^ 
TCAACCTCATPTCC 
SEQ ID 7592 



YyS(X3SA<mWUNHVIPQ^ 

SRESMEYAl^QTCHLCIFTIflAOT 

GIISYEEALRQSVSAMNliRIffl^^ 

CACGG(KCTTGC6aKXK^ 

CGCAGCXIAACCAiBtfra 

GATCAAAAAATTCTC<^TOXX^ 

CAAGTA(X3GCAGTGCAGMTGTCCAAAAATGTAT^^ 

CGGCGTATGCXjTCAKX^CATTGGTGGCGGAACGACCG^ 

AAATCGTTGAGAACACCTCKXjATGCCGCC^ 

CXSCGTCAGGITICCAGTMGGTTTTGGCAG^ 

TATCGTTCTGACCGGCGGTGTGTCCATGATGACCGGGATT^ 

CGCACACCGCGCTTTTCTACCGCTATC^^ 

C^GATTGAAGCGGTGGATTGAAAACAACCTC I 

DOKCLDAVVQDniDTQLGVREPIG^ 
I^TPIflAAmHHY^^ 

RTPRPSTMGIitHAACKLEGNIiPQPENGTVQER 1 I 

AAG(^GAGGCGGAGGTCACMTGAAATGTCCGAATG$ 

SEQ ID 7596 

VSGNFSTEYSIRQYVUiKM^ 

SEQ ID 7597 

AGGAAAACGATTTGATG<^GGCGGGCXXIGGCGGGCGGT^^ 

GCGGCAGATTCTGCTCGGCGTACTCXXXX3CCGff 

(XX^GGCTCGTTOCGGCG^^ 

(XsGCACT^TCGGGCTGCTGCTCGCCC^^ 

0GCC6CC(XX2AAA1!(KK2TCGCC^ 

ATATCGCAGGGTGCAAACATCATCGTCCTCATCTGCAT^ 

TGTTTTGGATGCTGATTGCCTGCATTT^^ 

CCAAACCGGCAATaGCCCGCAA(^CCGC^ 

SEQ ID 7598 

LGMLPGASIG 
AAGLV) 



ILTOAA^IFARMLYRG^FAPRn 



GCAGGCGGGATGTCGGCGGTTTTCAT 

SEQ ID 7600 

YPPAGI*PATISCnYCVRINA(^PPK t GIIASDSVPPAARRRI^ 

S fTCGG?rS 

TTACTCGATATAAG 

SEQ ID 7602 

MGPQrVGIISIFYIiAMSTOFVKTHLOMSFRRDFRLDSmyK 

^-I^GTA^ 
GCTTGCAGTTCACTATGTTGGGCTATGCGCT^^ 
GTCCGCCACCATCGGTACGGGTAAC^TCGCCGGCGTG^ 
(^GGCGIOTGGCGGIH^ 

cgctgttcggcacattccktt^^ 

OTGACTGCCOTGTCGTTO 

gatgcactgatgcctgccgtcmgctgait^^ 
aggcgggtatgggtk?pgcgcctattgccgcogcagc^^ 
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CGGTATCGTGTTGGTCATGOTCTGCT^^ 
ATTGTGACCATCGGCCTGATTTTC1TTO 
ATGTTTCTTCCGTCATGCCGGGTACCG^ 
CATCGTCAATGAAACACGCGACTTCAAACAAAAM 

SEQ ID 7604 

l^VFIiDNI>KAFFEWSSW^ 
EGVLAVmVNNSKGEMSGGPmiEKGLGKIMWMAW 
DALHPAVKItfFSBAFSAQAVAGG^ 
ITOGLIFFAYSTIUMTTCEKCAW 

SEQ ID 7605 

ATGTGGGATMTGCCGAAGCGATGGAACGGCTGACGCGCTC 
AGGTGTCGCTGAAGGGCMCCTGGTTTATTCCGATAAG 
CCGCTATCCGTGGATTGCGTCGGT^ 
GGCAATGTTTTTGAAGCCCGCTTGGA^ 

AGCGTGGCAGCATCTGTTGCXSTAAGMTAAAAATCGOT 
SEQ ID 7606 

MWDNAEAHERLTRWLLVMMA^^ 
QJVFEARLDRPGMPVFRGAEGTSAEMLRRYDEFSTV1AK 

SEQ ID 7607 

ATGCTGOyVCGCTTCGCTAAAAAG^^ 
TT<&TGCCCAAAWX^PGTTTTGC 

CCACGGATACCGGCGOTAGGCTTCCTCTPGC^ 

TTCAGCGACACCTGCTTGACGGGCAGATGATTC 

CATTATCCCACATGTGCGGCCTT^^ 

SEQ ID 7608 

MLPRFGKKPEAFHLWPSEPIXjDAVVQTO 
PRlPAVGFIX^IDVRPQmPMDVmQTAQCLLIGIN^ 
SEQ ID 7609 j 

TTTCCTGATCCGCAAAAOTMTGCCGT^ 

ACCATGAAAACCCCACTCCTCMG(X!^^ j 

SEQ ID 7610 

FPDPQNPMPSENRIPVFRRHDFIRPI 

SEQ ID 7611 

ATGCCOTCOTACTCGGCAK^^ 
AAAGCACOCTCGGCACAGGGCTGCCETTCATOT 
CGCCACCTACACCACGOTACCTACACCCCC^ 
CACCXjCCCCGTCCMGAAAGCGTCGCCMTGCCT 

tcgcca7gagcaacaccg6cgtcatcaccgccttcm 

cgcccaagacatccacgmcgcatcagctc^ 

gcgtgcx^cm(^cc(xxx^cx:atccggt^^ 

GTCttto^TCCGCCACCTGAO 
(mXXECTCGAAACC^ 

TACTCCK3CGTAATCGTCGCXTO 

CTCCACCTTCTTCATCAOCATTC^^ 

GCGGTCAGCFACCKnCGC^ 

OJGGCEACGACATAC^ 

CGGCTTTACCCTGCTCAAAACCGGCTACGTC 

G(XX3AACACACCGC<X^^ 

AAAGCCACATCCTCCTCCAACAGCTCCAAOT 

SEQ ID 7612 

MPFVLGIIAOSbVDIiK^^ 

HRPVQESVANAYEAIXSGYLEMADPro^ 

ACOTAQAIKSGKDmSKRLGRMEGCRQS^ 

TIVEAUJUOiGYWILLTALFVCQPNYTATm 

AVSYLWPDWXYLTLERTAALAVC 

AEOTAHIFQHLPDMGPDDFQTALD^ 

SEQ ID 7613 

ATGAAAGTGCTTGTTTTAG6TGCGG(?KJT^ 

TTGCCAATGCAGGCCAfiCTTTOT 

CGGCCTGTATCAMTCGMTGGCTGTGGCGGATOT 

TTTGAAGCGCAAACCGACATGAATTTT^ 

TGCCGTACCX3C0GTCTCAAGCC0GW 

CTTCACCGAAAACCTCTACAAATTO^ 

CGGTTTGAAACAGATGC^^ 

CAAATOCGACGGCGCACCGGTGTCCACTG^ 
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ACTGCCCGAAAMCGCCGCGAAACCTTGGC^^ 

CCOTMTCGGGC©^C(£OT 

COGAAATOOGCAGCGftCGATTTGAGOCTC^^ 
SEQID7614 

MKVLVKAGVAGVSSVWYLAEAGH^ 

FEAQTEMNFEGRKKGTLQIFR^ 

RFETDAVVCALGCFSRTVLAQLDLNLPI^^ 

PIJGRTRFENIiFLfiTGBGTI/ft'fT^ 
SEQ ID 7615 

ATGQGAATTTTGGCAAAGTGGCCCTAT^ 

CCTTCGACCCTM(mACGCCGTOTCCG^^ 

GGGCATTCCCTATACCGGCAGCGGTGTOGCC^^ 

GATACCGATTTCGATGCCGTCGAAGAAAAATTGGGTTO 

ACGAAGAATTGAAACACCTTCAGGGCGAAATCATTG^^ 

<X3AGTTTTAC(^^ 

GCAATCGGT(£GGAAGGCTGCGTGC(XX^ 
CTGCCGTTACGGGCGTGGGTTTTGCCGATTO 
SEQ ID 7616 
^NFGmVUIGGFSSERnSLD^ 
DTDFDAVEHCLGIjPMFVKPAAEGSSVGVVKVKEKGI^ 

AIGAEGC^VDFLKDTIX^YIiIiEITO ' 
SEQ ID 7617 

ATOTAGACAAATACTOXXX^^ 

CGCCCAACGTAACCGGCACGCTG<^CATGGGTCATG(^^ 

CmX^TCGO^CGCAAATCGTGGTCGAG 

(mCGATTACCBmGATGOXX^^ 

AGGGCTTGATTTACCXXXXjCAAACGCTTGGT^ 

GCTGGCCGACAATCCCACCG^^ 

(^TTAATCCT(XXX3CT(^CCG^ 

TCGGCAMCGCCACGACACGC(X^ 

ATACX3CAGGCTTGGACCGCTTTGCQKXXXK2AA^ 

TCAGTGATTGAGCCGATGCTGACCAGCCAATGGTT^^ 

TOACAOTGCGC(X5TGCGCOTATCCCTGAAM 

GTGCTACGACGMGCAGGTAATGTTTACGTC^ 

TGATGACCACCCACTTCACCGGCAAAGTACC 
TTTGATCGAflXffATGGGCTTGGA^ 
AGCATGGGCGCGGACGCATTOTOCTTC^ 
SEQ ID 7618 
MLDKYSPAEESOTQNWE^^ 
GTITQQmVGCSADTOEyPTMIX^mA 
ELILPLTGRTIPVIADEYVEKDFGTGCVKITPAHDFNDmG 
SVIEPMLTSQWFVAMSATPNGGEPDNEFKGLSIADKAm 
LVPFSTPIX^SETDELKAPLPSNVL 
SMGADALRFTMASYASLGPFG 

SE Q ID 76 19 

ATGTTTTTOATTTCAATCCGCTATATA1TTG 

ATCGAGTMGCAACATCX&TTTTGTCGGTAra 

CGAGCATTTGAGCAGCCTGGGCATTCAAGTTTA^^ 

GCGTTG(^GCGGCAAATTCCCGTTATTO 

CCTCCATC^^ 

O&TGCCTCmcm^ 

CACCGTATGCCCTTCTACGGC^ 

ACGCCACXXIOTtt^AACGTCGGCGCGCAM^ 

GAACGCATTGGCAG0CAKXX3(X}TKK^ 

TTGCCAAACGGC(mCC(KrrTTG^ 

AGCCGCA(XGCTATACCCGCA(XXXXX^^ 

CGAcrcxxxmxrn^^ 

TT^TAO^TGOGGGAAGCATCAACCGa 
SEQ ID 7620 

HFFISimFVRKLWCAlK^^ 

AliERQimPRALNLAELMRi™ 

HRMPFYGKAFLCVDSEHVRAILPKVSKmnG 

IiPNGGTALLVDDYGHHPVEMAATL^^ 

LNMGAGSINRVPSALLELSKQI 
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SEQ ID 7621 

AMMTATCAAACCATW&TCC^ 

TGCAGTTCATACCMACCGGCATAGTAACGTT^ 

CATOCXnTCGOTGCT 

SEQ ID 7622 

NNIKPnPPRHIIFFNQI^PPSPVPFLQLFFPSNCAI£^ 

SEQ ID 7623 

TTGCTAAAATATAAACCG^^ 
TCCCGCAACKTK^^ 

SEQ ID 7624 

LLKYKFQMYKTLlflYI^ 

SEQ ID 7625 

TTGATGAACACCGAAAACCAAGACTG^^ 
MGCCTACGAMCCTACCGCTTTGATTTG^ 
CCGTCAGCGCGCCACACGCCATAQyi^ 
(XCAAAACCGCCGACAGCATCATGCTC^^ 

AGTCGCCGCCCTGCXXXsAAAGCGGAGACGC^^ 
AAGCTGCAAAAAGCCTTGGACAAACTCMTGCCAAACT^^ 
TOGCGAAAGTCCAAA*^^ 
SEQ ID 7626 

UffiTENQDCGYGATAAEPRGHSFPDMOTIGRI^^ 

MTADSIMLARFPETOGEIVQTAFG^ 

KLQKAU)KLHAKLSKPGYTEK^ 

SEQID7627 i 

ATCGGCGGTAAAAOTTTATGCTGAT^^ 

ATTCGATGGMGAGCGCATCGTGCCGCAA5ftCGG 

COKXXXXyU£CGCAGCG^^ 

<»C^<mAA(XXXX3TG^ 

«XXXX&TAmGCAACCTGCC^^ 

GGCC?TTG(K^CTGCI^ 

ACGCCC^^TGACX^TCmCCOrCAAC 
CCGCGAAAAATGCCTCAMTG<&CGGAAAA^ 

SEQ ID 7628 

MGGKTFMI^GGTGGHIFPAIAVADS^^ 
HEQlftVAGLSNKHLSRWAKRVLYAFPRAFSHEGG^ 
PITI^SAYREtADLVICRAGALTIAELTAAGUlMJjVPyPHAV 

SEQ ID 7629 

GGACCTGAGAAACOTTOACCCGATAAG^ 
TCATCATTCCTACAAAAACAGAAAACCM 

SEQ ID 7630 

GPEKPFYPISPRTDRKjFPPERE*RRSGFCPFR*IPKTQNPII 

SEQ ID 7631 

TTGCTGTTCTCCCACGTOCTAGATT^^ 
AAACCGGCCGCXXXXSGCGGGTCAGAAGAAGAC^^ 
SEQ ID 7632 

SEQ ID 7633 

ATCAAACGCTTTATTCTGCCGG^ 
ATCTTCAGGACGGCAGCTrcCGTG^ 
AAGCCAAATCAAGTACGGTGMGTTTCAC(^ 
ATGGACAAACCCGGAAAAGAm 
SEQID7634' 
MKRPIimiiSATTAPASPIVGTOHaOT 
MBKPGKDDVYQIi)KSGKL^ 

SEQ ID 7635^ reraxsra^ 
(mC(^TCaCACTC^TGCTCGAC^ 
iSGCTTCTGTGIATTTGGCA^^ 
AGGACATGGCX3<XKKjCTTGTGCCG^^ 
ATTT(XAGCCGACKXjAGCTGTTC^ 
GACGGCCAACCTGATTATCTO^ 
ATAAT1XSIACAGOT 

GGATGGTGOT^TGATTATCGCCGCTCCC^ 

GCGCG(^GAGTGGTTCGGTATGGGTTTGG(n^^ 

TGOGTGCTSflTATTCTGTTACGGCTGGCTQG^ 
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CTATCCAAAGTTTCT^ 

MHHHLVWIFMtt^ 

IMVQPDFGSFWITVI^^ 

CVIJFCYGWLVVRAFSIGKQSRDLGLTFKAYIASGIGIW^ 

S GTG^S 

TMTCACGCATCC(K5GCA?rC^ 
CAGGCAGTACGAATGGTACGGAACCGGTTCXCTO 



oca if) 

TflmCCCTTff^ 

GGGCGGOTCGTACTAATCCAATCMCGGCATT^^ 
OTTTTGAGCAACAACOTATCGCT 

S ?Sgtcga^^ 

GTCTGCCTTGGGCGGGCGOGGGTCGCGGCXSjGGC^^ 

Ce<m«XXm&TT^ 
epQ in 7642 

LVEIPNKRFKTCRCRGSHIQI^^ 
epQ in 7643 

GMGAAGACTTCGCGCCAGOT 

(XJGGCXaGGTTTOTi^CTCTlTGM 

CATATCCGTTCGGGCAGACGGTTTTGC(XOT 

ACX3GTCX3GTTTGACGGTMCGCGTTGTO 
CGA^CCCGCCTCCCGTTC 
GCTCGTACCGAAGATGACXjACGCGTTTGI^^ 
AAATCAAAGCGGTACATATOCCXXXJCCGGTCGCCGGCATAG 

CCGTCGATTTTGCTTAAATCCAAGGCATACGCGC^ 

CGTCCACGCX^GTAGCGGTQGCCCACATAQCOT^ 

OTCAGAlTGTAGCTGCGTTCAra 

CCCAAATCGCGATTGCX^TTPTC^^ 

CTmTTCCACTCGTO 

CTGTACG(XXX^ATCCAATOGGATGATTGTT^^ 
TTOm^ATTTTG^ 

TCAGTGCCGGCGAAGGGGCTTTGGTATTATTATTGGGG^ 
ATTACCTAGAAAAAATCTATCCAGtf^ 

GGGGGTTCCCACTATTTTGCTGCGTCA^ 
ATOGAC^TTTATAAATGGGCGAATTGTCGC^ 

SEQ ID 7644 

ekfWafotah^ 

RAS^QCMVGGIDAQNGGLCAAFVITRI^ 
ARTEDDDAPVCQSGN(X»GSNRLVAm 

TmwnriTnfflirTmmM tmrTDn on » vTTnroTtvMvewT jAn.wrwfc&nnKAMTHAVAVAHIAFFGKGAOLFGE 



M IVAAFITMVFFEI»ffl^ 
I,YAAIQSDIXn ? PPHETRLFTFNIILI 
IT*KKSIQPVFPSFRQFDA^^ 
IRQFINGRIVAAFVFVyiQWTACLSyj 

AGCGCXTWGCGCAAATCGGTAAGATCTTTGACGG^^ 

(X£GGATATCGAGGCGTTCAAGCAAA^ 

ACCACOTAAC^GOTOTCGGCT 

CTACGCGCATACCAAAGCCGAGATTTTCOT 
CACGAAGCC^TTTTTGGTTGGAACGCGGGACGGGC^^ 

gcgtoaaagat; 
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AGACGGCATACGCCCMGCCXiAAGCGGGCGAT^ 
TTTG 

TTVmVGYIOKGGUH^^ 
HEADFWIiBRGTGCLKQGKEDLISTQDIPLQGLHK^^ 
ALKDKAKGVFLIGVDAPQIRRDLI^^ 

SEQ ID 7647 

TOTOTTGTGCGOTATCCATCGCOT 
TCGGGGGGGGGGGTTCGA!PGCAGTT^^ 

SEQ ID 7648 

LFIiCGIHHVffl)SDKCSD^^ 

SEQ ID 7649 

GTGTTTCTTCT6ATTTTAAAATTATTTCT 
TAGCGTATTTATAC(mTCCGCCATa^^ 

SEQ ID 7650 

VFLLILKLFVWVl^SLSPFGM 

SEQ ID 7651 

TTGGAAAGTCATTTTGGTTTTGTCCT 

TCATTGTa^GCK^^ 
OTGGGCATCCCCTTGTTC 

SEQ DD 7652 

LESHFGFVLKQIILGRR^ 

SEQ ID 7653 

AjC^TACCGCACAAAAA^ 
TCGCCCGMTCGCAGGA(&A^^ 
GATACGQXXX5TAMCXKICCACCTGCCCGAAG6CATA 
. tcqxxxXXXSTCCGTTC^^ 

AO&AACATCATGGGCGCGCTO 

CTTTAOTCACCGGCGTCGAC^ ' 

SAPVRSPIJWffiAGtfmELDIGPMRRAAALLVG^ 
LYLTGVDYPGAYGIVRPQIPBfJL 

SEQ IID 7655 

AAAGCGGGATGTCGTCCGCACAACMGCA^ 

TCAGATCATTTTAAAAATAAAAAAAAGrTACCCA^ 

AAGCGTAAAGGCATCTTTTCAAAAAAT 

SEQ ID 7656 

HQKKRHSHRHIiFIPrS^^ 
KRKGIPSKH 

SEQ ID 7657 

TlOCATTTnftCAACACTTACTCGGGAGGOT 

SEQ DD 7658 

IflFYlfFYSGGYWRALQTGGYSGLTKTGTALPRPDPKGTVP 

SEQ ID 7659 

ATCAAAGGCGATATTGGCGTAATCGGTTTGGCGGTTA^ 

OTAA ACGGT®mAAAAGM3ACG^ 

CGACTTTGTOGAACAACTGCTTCXJGCTG^ 

TTTGTCGGCXXX5GGCGTATCCGGCGGCGAA 

AAACGCCGCAGGGCGAACCGTGTTGCGACTCGGTC^^ 

GTTTATGAAAGACGGTTTGGGGCTGTCCTA^^ 

GACGAWSGCamACOSCTGGC^ 

CCGTATTCGCCXGCTGTGTTTCG^^ 

GOCGCAAQGTSGTraXMGQCGGTOGAA 

SEQ ID 7660 

MKGDIGVIGIAVMGQ^ 
FVGAGVSGGEGARRGPSIMPGGDm^TO 
DECGEPLAEKIIiOTAGQROTKSfrGIHAIJ)^ 
aiRSAFLGNIIEAYEA^ 
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jjiCGCTCCGGfcAGIGCCCCGGCM 
SEQ ID 7664 




SEQ ID 7665 „^,n,^mwiwMim3iim^ 

(SIGGACQMMMCMGCTORCOGCQCTCMMOOGGSCMQCMTGCGT^^ 

CCCGGCGAGCGGGTO^GCTOmTOGCm^^ 



GCGA 

SEQ ID 7669 .^.^^uraatscamamk 
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SEQ ID 7670 

LGKSLHVNIiQKPSEfLPDSKQ^ 

MGAGCTKXXXTO^^ 
G(mATCAAGAACAAAACGAGCTGGAGGG^^ 
TTGATGTTGGMGAGMTTACACGCATACTTTGAC^^ 
ACGMCCGTTTTTCGTGATTGTCCCGA^ 



LMLEENYTmTESOT^ 
SVLPATAI/TElimFSIIPFEGTPP 

TOTCGCGCGMGOGCGGCGOG^^ 

•lOTATATCGGGCGAACCXSra 

(mCWTCGGCTGGTCG^ 

S mpSlYA^IHLYCATAFV^ 
LTVGWSKYIHAVVFTBMLLIVPIiAKAMro 

GCGGCGGCAAAAATCCGGGGAGOTCCCGATTTO 

SEQ ID 7676 

VKMKSRRFFKALLLIAALVGAFYAG^ 

CGACTTTACOTAAAGAGTTCATGGAM 
GCCGGAACAGAAAAAGGTA 

SEQ ID 7678 

MSLIELLFGRKQKTAWARDRLQII ^ 

AGGCACGCAAAAAACTGGCAAAACGCXKXIATO 



AAAAAACCCAGimTCCQCAAAAAGGTAAAAAaGCOT 

AGTTTACGCCAATATGGGCGCGGAGCTT^^ 

TTCCTTAAATOAAGGCGCGTTCGACT 

CGGCAACGCTGTTTGCCGCCGGCGTGCCT^^ 

TCTCTCTATCACTTCGGGGATGCGTK 

TACGAAGACGAGGTGGACAATGCGGTCGGCAGGCTGTC^^ 

ACTTGGGCAACGTGGTCGCC 

LSRAPSNHPKOTHFYCNLVAAGETGGVLESLIiDKI^^ 

FLRLKARSIKIQRRMD^^ 

YEDEVmVGRLSAHMEPIIIVIIfi^^ 

S ^TCOrarc^ACGACAGAAT^ 
CCGAOTCTCCCAMC^^ 

AOCGOTCGG^^ 
TTGCCGAAGAGCTGAAAK^ 

MCMCTtt^AGAAAAMTC^ 

CGAATTTACX}6AT(XXX3AAATCGT(XX5CAQC^ 

ATCAATGACtX^TTTCACX^ 

TGSGKTTQLPKICI£I^^ 
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LKVIITSATIDAERFSKHFNGAPVI^SGRTYPVEILYRPLTC 

KCRIVIATNVAETSLTVKa 

DJDGFQVLIiELGAVEW 

SEQ ID 7683 

TTGATGGGAAAATGCCGTCTGAAAGGCTTCAGGCGGCAra 

TCT^CGICATTOTTTAAAGGTGGGM 
A 

SEQ ID 7684 

I^KCmGFRRHFOAJALGKMSI^^ 

SEQ ID 7685 

(^<&TTGAACCATCA^^ 

ATAAAtfTCGGCATmAGAAACCTTGG^ 

AAAAGAAAAAGTATTGGMTTCTTTACCGGCTACATOT 

CGCGTGCTGCTGTACGGCGTATTGGGCGCA^^ 

ATACTGGTATACACATGATGMGCTCGMGGCGA^ 

TTmCCETCGAAAM 

GATCCXjTTTATCGTGCTGACTTCCAA^ 

TGAGTTTOTCGGTGTGAAAATGCT^TAATGC^ 

RVIJjYGVIiGALVLRTVMlFVGAALVRQFEWILYLFGA?LLYTG 
DPFTOTSNIFMLGLRAMYFLIJtf^^ 

SEQ ID 7687 

(HOTTTIACCACAGCAAMCAGGC^ 
AAGCTTCCGATGCCGTC 

SEQ ID 7688 

VFYHSKTGDKIUSRYRCVRRPNINESKYETTGPKF^ 

SEQ ID 7689 

CTGGCAAMAmTTGTAGTAACTra 

TG&3TTTGCGGAACCTCGACOT 

AAACCTGTTTATTTTGCCGGCTTCCC^ 

GACTCTCCTGCTGGTATTGAGCAGGGTGCATTGAT^ 

TGCAAAGCAMTC(XX^MGGCGGAGCMGGCGGTTCGGTTAAAG 



GTTATTGCCCGTOTTTGGGCGAGAACCGT^ 
SEQ ID 7690 

DSPAGIEQGM^YFAI^^ 
VIARLLGENREHRFIiEAKKKSFFKRLFGG 

S mga^^ 

TCAAAGAAACX?TTGGCATGGAGCG6CTTGTGS 

ATTGGAATTCTTTACXX3GCTA(^TTTTGGAAA^ 

TACGGCGTATTGGGCGCATTGGTATTGCGCA^ 

ACATGATGAAGCCCGAAGGCGATGAGAAAGAGGATTTGGCAAACAG^ 

AAACG(XIAAAAAAAT(XXXACGCCGCTG^ 

GTGCTGACTTOAATATTTTCGCTATTT^^^ 

GTGTGAAMTGCTGGTAATGCATTGGGTGCATATCa 

TAAA 

S WreTOIGSP^CTPMVLVMIALW 
YGVLGAIiVLRTVMIFVGAALV^^ 
VLTSNIFAIIiGLRAMyFIiIjADVAW 

SEQ ID 7693 

ATGOXnraSAAAAC^ 

TATTGrcTGCGGGCGCGGCGGCGGTTGTGGTTKX^ 

TMTCTGAACOTTTTGTGTTCGGC^ 

AACTTCAACAACCACATCGGATTGCCGCTGACOT^ 

CCAMCCCGATGCCGCTTKXnOAC 

CATGGCACTGATTCCTCMGAAGATGCCAATATGG 

AAACOTTGTCGTGCGAATTTGATTTG^ 

OTGTTTGAGT1T(^C^ 
TOCGACAGTATGAAAGCCGCGGTTGACGTGTTO 

SEQ ID 7694 

MPSEflXPVSRIWDSRDIREGDVFFJUA^ 

NFNlKIGIiPI/rilJajN^ 
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KPLSCEPDLVCGDEmVVLPVPGRHKVH^ 
(&YARDQGIEAAYFVOT^ 

SEQBD7695 

TTGCGCGCTAAACTTACAira^ 

ATTTGGAAGATGTGCTGGICA&M^^ 

GTTGTOTCAAGGCATGGTATGCAAA!^^ 

CTGGGTCAGGTTGAGGTGCAGAMACGG^ 

CGGGGGCGGTCAGCCAGGGGGCGGMTTGATTGCA^ 

CK^TGCAGGCGGAGTTGGTTTCTG^ 

AGTGCAATTG6CTCAGA6 

SEQ ID 7696 

IJlGKLTSLKNSM^IHNAFDIKSTKM5VLSISLHTSDLro 

LGQVBVQKTED6QKAMCTVLITC 
SAIGSB 

SEQ ID 7697 

GFCGGraGAGCAGOGGftOT 
GTTCG6AAA6C6CGGCATTTCGTC&A 

SEQ DD 7698 

VGRAADF*SVGRAFESLTTHQIKQRS*VFKTRIiFLSSDN^ 

SEQ ID 7699 _ 

TTGMTTGGTTGACAAACC6GCTGCTGAGTAAOT 
AAATATTAATTGAA 
SEQ ID 7700 

SEQ ID 77011 

AAACTCGGCTTTTTTTGTCTTCCGACM 
TCATCGGGTGATGCTTTATGCGTTGTCCCGAGTATO 

SEQ DD 7702 

KLGFFCLPTILVRKARHFVKFFGAASVFPPKimSAa 

SEQ ID 7703 1 

AIKSAAACAATCCGCCCGMTAAAAAATATGGATCAGACIA^ 

AATAOGGCTROCGTTATTCTCCCGTGGG 

(OTATATTTGCCGGTCGGCTGGCTGTATGGT^^ 

TTTGTGCAGG^TTATTTTTCAT^^ 

TGACTTTGATTTTGTCCTOXXXK^ 

GGCACAATATGCCGCCAAGCGCGCCCACATT^^ 

CGTTCGGATTATATCAATGTTTACGGTTTCCCAT^^ 

TTTCACCTATGCCCTACGCAGCGATTATCCGTGGTTrA 

GCAGGCACGMGCCTCGGCTGATTGTGATGCACCTO 

TTTCCACCATCGCGCAAACCXaATAAATTOT 

TTTTTACGCG6ATTCGGCAGTT66ACGGGTATCGAM(XX^ 
ACAGACAGTCCGACGACCCCGCGCCGTGGTA^^ 

SEQ ID 7704 

MKQSARIKNMDQTIOT^GICALUFCFGAAIASG 

FVQALFFIFGLTVmCVSVGWADVKNYKRRSKI^ 

RSDYMNVYGFPLPETPFLSR^^ 

AGTKPIOiIVHHIIWSH^^ 

FlftGFGSWTGIETDHiPDDGYDFWGMVPD^^ 

SEQ ID 7705 

ATGCGTCATCGTAATGGCAATCGCAAATTAAACCG 

CCAAGGMTTGCGCCGTGTAGTAGAGCCGTTGATTA 

CGATTTGGGTCCTCGTTTTACTGCTO 

GAG 

SEQ ID 7706 

MRHRNGNRKLNRTSSHRAAM^^ 
B 

SEQ ID 7707 

GTGGATAAACTGAAAflTCTCCGCAAACGGCC^^ 

TGAAAAATGTGOTATGCTGGCAGATGTGAAAACCACGCAAMG 

TACGTGCGCXaCCGTATCAGCTGGTQGAACT 

ex$omxxxoc(£AAAA(£TGCT^^ 

(X^GAAAATCAGCGGCATCGGTAOGTCCACAATGATTG^^ 
GCGATAACCGGCGGCAGGGTGGTTTTCCGGAATGCTG^ 
ATATGCGGCAGCGTCCGAAGGCGGTGGACAfKXXX^ 
gQAAACGATTTTTGAAAACCGCTTTATGCACGTCC^ 
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6TCAAGG(X^CGGATTTCXX3TGCGTCCGCAAGCC^ 
AACTCGGCAGCGTCGGCGCGAAMTCGAGCGCG^ 

SEQID 7708 

VDKUaSANGPLIWEITVS^^ 
RFVDQHLKGIiEAMGAEIVIEKGY^ 
MTGGRVVLRNAAPKlMEVVLDiajVE^ 
VKATDLRASASLVBGLAARGETVVEQIYHI^ 

SEQID 7709 

ATGCAGAATAGCACAACCGAATTTTTGAA 

ATGCITKjCGCOGTATCTTACTGT^ 

TTTQCTGAATATTAAAGGTATTGTGTO 

GAAMTCTGAATOTGGTCATGT^ 

ATGAGAA(XGTCAGATT6GT6CAATCCAGTT6G^ 

GGATATCGAAACCGACGGTTCTATTGATCCTG^ 

GAAAAATCAOTCCTATCGACCCTGTTCTTTra 

GCACTGAAACOSAGCTIOTAAAAaXXm 

A6GCTTGGAAAAGCCT 

SEQ ID 7710 

HQNSTTEFIiKPRQIBVmTSATRA^ 

EILNPGHVICHLADNGQIEMEIKTC^ 

EKSPPIDPVLLHPVDDLELTVRSAN 

SEQ ID 7711 

ATTCACTATACAACTACCCACTA(^AAAGAACAAA 
TGTGCGCGGGTTIX£TGACATC^ 

CAAGTCAGCCTGCtfXXiCGGCTGCG^^ 
AACTCAAAGGTACX^GCGTGGCGATGGATGTCOT 
AGTGGTCGAITTGGCGGAATGCCPGGIOAAATGGGCG^ 
GACCGGATreMGCGGGGACGTTCCTGTGTO^ 
CGGTGATTGAGGCGGGCGACGATTGGATCGCC^TC^ 
ATTGAATGCCGTGGCGGAGGGAAGCTGCCGCGTGGTGGAAA(^ 
TTTGTGCAGGGTGTGGAACGGCTTTCCGGCGC^ 
ACCAC 
SEQ ID 7712 
IHyTSTO*ERTNVDKLKISAK6PIjlJGEITVSGAKK^ 
Q\raiiP(XKMGS!U^HUCGIiEAHGAErVIffiGY 
DRIEAGTFLCAVAITGGRWLRNAAPKTMETO 
PVQGVERLSGAVVKATDLRASASLVIAGLAARGETVVEQIYH 

SEQ ID 7713 

AKTTCAGCAAGTTMGCCCTOTGGCTGAAAC^^ 

CAATCAAGGCATCAAAGATTTGAAACACCGTGC^^ 

ACGCAATGGCTGGCGCMGCCGCCGATTT^^ 

ATOTCCAAAecCTGCTCTACCGTTTCCff 

CACCAACCTCAOXX^GACC^CCTCGACTAra 

TACGGCXXXmCTCCXAGGTCGTCTGAAA^ 

TGGMGCAGTATTCCAAACCCCGTGGGGCXaAAGGAAAATO 

TGATAAGGTA3TGGATCTQCTGGCyU^ 

AMGCACTCTCCACTTTG^ 

GCGOK^TAAA(aKXmXSAACa^ 

CATCOjTTATGCGGTTGAGCAGGCCGCC^^ 

GGGCAGGCTTTGTTAACTOGTAAA 

SEQ ID 7714 

MFSKLSPIAETGIPTLSC^MGRLIiHSDSRQIKOGDIPV^ 

TQVMQAADLLGEKTAII(nVGNGFWGALEETAHTTPA 

YGAELAGIUiKKIXPDlAWSYGFSEHADIROT 

KALSTIGEIKPQGAAL^^ ' 
GQALLTRK 

SEQ ID 7715 

ATGGCACGTTATATTGGCCCTAAATGTAA(n t TGGCAC^^ 

ATGGTGCAAAAAAACCGCGTTTGTCAGA^ 

CCGTAMKTTCTACTGGCGAGTTGCTGTT^ 

GCTATAGmTAAATGGACMGTTGT^ 

CMCTCAMTCGGCTTGCCGGGTTGGGTTT^^ 

GTTCTACTCEAAA 

SEQ ID 7716 

MARHGPKCKIiARRE(m)IiFIiKSAia^ 
AIVVWC^VVmPSFQVKAGDVVSVREKAKK^ 
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SEQ ID 7717 

ATCCGCGCCGATATGAACGTGCttnTOtf^^ 

SEQID7718 

MPSEAFRKHQIiFLYRKHRLKRT^ 

SEQ ID 7719 

ATGGCTAAAGCAAACACAGCTTCACGTCn'AOT 

TAAAAATTOtoCOTCGTOT 
(K?TOCC(?FCCGCCTAAAAAACGTCOTAIT 

SEQ ID 7720 

MAWOTASRVRKKVRKTVSBGIVHVH^ 
GCRPPKKRRI 

SEQ ID 7721 

(niGGMMCGQCGCGTC 

ACGTGAAAG^fTCAACGACTGGCGCGAAAACAAAC^ 

AGCCTACOCTTCmGTGCGACGT^^ 

(X^XjGCGAACTCGACGCTTTGGGC^ 

TCGACCAACTCAITC6TCG6CGGCGGQ 

AATGGCGGCTAAAGGCGGOIOTTACCGCTGCCGAC^ 

TTGGACATCGGCCCGAMTCCGCCGOCGCGCTTGC^^ 

COTKXXmGCCAmCC^ 

CKGTTOTTGAOT^ 

SEQ ID 7722 

VDNGASVIVMmGmEGEFOTEDDVAPVAAH^ 

AGELDALGKALKQPARPMVAIVAGSKVSTO 

t£I6PKSAAALM)IiLKAADTVVHirc 

SEQ ID 7723 

ATGCTGCCCAAAGAAC^GCAGGTCAAAAAGCCGATGACCAGT 
CGGTAACGTATMOTTTTGAAAGAAWGGG 
GACG6AGTCCCTGTTTGCCGTGCCTAAAGATATGAAGGAM 
AAAGGCAAGTCGTTTATTTGGATCAAGCGGCAGCTC^ 
GCAACCTGTTTQCACACGTCATC6GATW 
CCGGCAGGGCAATATTGTGGACAGCTTGGAC^^ 
AAGG(XX3TCGAATACCATCAGGCAAAAGC0GGAACGG^ 
ACAGCGAACAGCGGCGCAACCGTGCOGTAACCGATO 
TACGCAGOCTTATAAAATCGGACCGTC^^ 
CGTTTCGGCXSCCGAAGAMTGTATGACTra 
TCGAACAGGCSACGATGTCTPTCGGTEACQGTCTGC^ 
GGTTGCGCCGCMGGCAAACGCATATTCAAAGAAT^ 
GTCGG(X^AAAACCGGCACGGCGCGCAAGTTCGTCAACG^ 
CCATCGACGAACCGACTGCXX^GGCTATTACG^ 
CGCACXXX3TCAAAACACCGTCT 
SEQ ID 7724 

MLPKEEQVKKOTSNGRIS^^ 
KGKSFIVrnCRQU)PKVAEEVKAIjGLENF^ 
KAVEYHQAKAGTVVVLnARTCEILALANTPA 
RFGAEEl^FYHELGIGVBMHSGFPGET!^ 
VGAKTGTARKFWGRYAIMHVATFIGFAPA^ 
SEQ ID 7725 
ATGGCTCGTATTGCAGGGGTAAATATCCCTMTAACGCA 
Cr^TACTAAAGC^AGAm^CmGACTC^ 
GGACATCGGCTOTATCGTGGCTO 

SEQ ID 7726 

HARIAGVNIPNNAHIVIGIfQAIYGI(ft^ 

SEQ ID 7727 

GTGGCTAATCAACAAAOTCAIICAGG^ 

TTGATGCrcTTOOTTAGCT 

ATATATTTCAGCTTCTATTATTGTACAGC^^ 

GTTTTOTAGCMTTCTTCAMGTO 

TGTTTCTllATGTGGCTT^ 

GACGAACCAAGGTTCTATGAGCAWjOT^ 

AMCGCCAGTTTMTGGTAGGGCGGGTAGTCAAAGTACG^ 

TAGGTTCCOTGGTT^^ 

TTATAOGGCTTTGGTTTTTAGTOT^ 

CGTTTGACATTGTTTGGAGCTTTO 

TAACGATGGATTTTAGTACACAAATAAATTO 
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KRQFtKRMSQSTHMPFKIjNMGVIPPIF^IIbFPSTIiIiGy^SA 
RI^GftmmOiIPEPIffTIIBVPFYLGGimiLVV^^ 

SEQ ID 7729 

■pTGGGCAGCATCCGGGGCTTAtATTCGCTOT 

SEQ ID 7730 

LGSIRGLYSUiIHIF?SPJYYPDAGIRFRHTGCFYL 

GCAT 

SEQ ID 7732 

I^SPF(3CSGYI^CVQPVAFQLGFKVF^ 

SEQ ID 7733 

(MTCTG(^mCATMGGTGGGTTTC(aG^ 

ACTGGMTCMTTGCT(7ITMTGAQATTGATATTTTAGTOT 

TmAaaSTAraAAAGTTACCAAAGGKXSAm 

S |^? Q KtHAGK^ 
LKGIKVTKGARAAIEDVGGKIB! 

S AmCGCT^ 
ATATAAAAGAATTTATGTOnAACTATACCAACCATCCCGlTCTCOT 
AMTTTATTTGGAGATGATAGGGMTATTTCAACATCAAATC^^ 
AATGACXXTGATGGTAGATTTAAAGAAATAAATGATAAGTTCTTTGAAAATATGAGGGCGGGAAA 

AAGATMTICTMTACAGAAmTCTGMTTAAAAAAAGCGAGAAAA^ 

TMTGATATTCTTTTAAGCAACCCm^ 

CnMBI«BCCCMC^^ 

jyCTCTCCTKXGCTAATAAAATTCGTAGGM 

TAATAAAATCAAAACAGATAACTTCTCACAAATTTATAAAGCTCTTGCAG 

OTAJUMTAMGTIMCTTTiCOGiAOMMQOACA^^ 

q^toATTTCAAAATMTTGAAGAGGAAAATTCACAACTTCTAAAATT^ 

AaTAAA(3AAGCTCTACTAAMCGCCAACAAAAAGAAAAAmGA^ 

AWCAGACACATATAAAAACGATGCGATTGCTTGGtWAATATGGM 

GACTACTArGTATGrTCGATrATAAAAAATATGCGGATACTGrC 

Li8A0U£ifiiMPiinx^»vGDmmLmmvEQ^ 

GNIKVSFreOffiraJlMSIlKVW 
ISOTraTOAIAVKIWHHQLSEDBVmi^^ 

^TOAGTCGGAGGAATGTGATGAGIft^ 
TCAGCACOGACATCGACGTGOTAA&CXyUAACATTOACAACAAAA 
GCGCATTTCCGlTACGCaiATTGCCCAAATCGCGGCGGCG^ 
WTTCCGCGOTGGTGCAAAAAGGTATGTCGCCTT^^ 
CCGGTATCAATATGGATGCGGTCMGCTCGCAGGCGAAACCATO^ 
TCCGTTTATGGCGGGTGCGTTCCACGGCTCGGGCGAAGCX^TG^ 
GAGGTCGKS^GTCGTGAAGAAAACCGCITOSAAATCACXX^^ 
CGACCCC<mSTCGGCGACTCG(3T(mC^T^ 
CATGATCGCTTCCAGCG(X3GTCGGCGGTTrGAGCGGC(XXnWA 

fta «racrccTccGmGTT^ 



GGGCGGCAGGAWOCCSQiCCGGTTCAATCGATGAAAAAC 

^VQKGMSPSDEVURS^^ 

EVAEVVmAraTRVt^IG8EASKMJaPFGIHH£L^ 

TAVCSVGimAVPGOTPAHWSGIIADEmtMDJSKmVRIIPVTC^ 

SEQ ID 7740 

miDGLHVFVSRKABAKAVKQQIPQCIKTDA 
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AAGMGC(^MCG(^GGTTTGGAAGAA 
SEQID7742 

ACGGAGGAAAOTTTKX^TTTPGCGCATAT^^ 

CATAACGACCAAATTCTTTA6CATTTC 

GCGGGttGTAGGGACAAGATAGCCCGTTTGAW^^ 

XGSRDKIARIiIQJAVPPMyTEKIGRAIVDNIBC 

TCOOTGGAGCAGGATOTG^ 
TATGGTGGAACATCAAAGA 

SEQID7746 

MNKSNFFHiIAVCV^ 

TAGATACCTCTGAAAACCGTGGTATGATTM^ 

SEQ ID 7748 

MAEQKKIKVTLVKSLIOTESHRACARGL 

GGAATGGGCGGCGGATAAACGACGCGGGCX£GOTAA 
CAGOGTCAGGGAAAGAAACCATAAGATTTT&CGGATC 
TAGATTCCCTGCATTTGGGCGGGCGTCGTTGCCGGGTO 

ccAMCCGTCrramcr^ 

CCK3C6ACTCATTGAGTITCCCCAACACGCTO 

cgcagmgctgttoagcgcggt(x:(xxx^ „ : 
S tofeae^ 

QRQGKKP*DFTDEFVDLVQ(^YPrRIHVGGFDHRQFGWPVrc 



gtcatcx^agcaaaaccactacaggctcmcttcam 
acgmgcgatcagggcggcggcagaaaaacaaaacct^ 

aggataataataacgagaagtaaaaatgttgat • 

RIIITRSKNVD 

S ^mGTC(^OTAACCGOTMC^ 
AAGTG(X^TTGCAGTCCAAAAGGCGATGGATCMGC^ 
TATGCAGCCTGCTAAAGAGGGTAGTGGCGTAAAAGCCGGTGGAOT 
ATCGTACGTGCMCATTAGATGGTTTGTCM 

o&5»Csl fiU 7754 ^ ■■■■■p_^», hmmhuhmi wiii > i iM »m*iQ jm^^^j^yy^QffmpYS 



IVRATLDGIiSKUriTADIAAKRGLTVEDILGVNHG 

S ^^i^ottcgagag(k: 

ATATC(&GCGCGTAGGCGC£GTCGAGGTTGGCGGAGG 

CGGGATCGTCX^TTCTCGTCCTO 

GTTGCGGTCGTAGAGGATTTTGTGGTCTTGGM 

CTCGGCTTGGTGATGCECGAAAT^ 

MCCGCCGGGATAGGTTTOK5AMCTTGT^^ 

VAWEDFWLEHDA^^ 

GTGCTGAAGGTGATAAAGTATTGGCTC^G^^ ■ 
GGOTAAAGCAGCAGGTGTAGAAAAGGTTGCTTT^ 
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SEQ ID 7758 

MDKffTTRLRRARKTRARIADOT 

SEQ ID 7759 

GTGAGTGGAGCAGAAAGTTACCGGCATATCMX^^ 

GGCTGATTTTGTCGCGTTTGGGCGATGC^^ 

CGGTTTTGCTTCGTTTCAGACGGCAT^^ 

CGTTTCGATGCCCC^^ 

GGTTTAGOCGCCAGATTGTGCGCGCCATTGTOiCG^ 

TCCTGCGACGCGCACCTTCCAGGCAATCTO 

TCGTGCHKAGGGCATTAAAGCCGGGTGAAGCGGAA^^ 
SEQ ID 7760 

VSGAESYEHimLNEAVDALAV^ 

RFDAPIJ3MRMDTTR(34SAAEWUVASE^ 

AFHSLEDRmQFVmSQHBPLPSWAAVRBADIiPDPP 

SEQ ID 7761 

ATGTCACGTGTC6CAAAAAACCCAGTGACGGTO 
TAGCCATTGAAmMTGATGGCiU^TTO 
AGGTTTTGAGMGAAATTACAATTGATGGGCGTGGOT 
TCCGTOWCTCCTAGC^ 

SEQ ID 7762 

MSRVAKNPVWPAGVEVKFGTEALVIRGKNGELSFPLHSDVM 
SVQTPSQTEIVLTGSDKQVVGQVASEII^^ 
SEQ ID 7763 

TTGGGGGATGTCATGTTCGGCGGCGCAC^ 
CGCTCGAGTCGAAACATAAGCTGTTGATOT^ 
GCTGCATAACGCGGAAATTTTGGMTGGGACAGOT 
GAGCTTTGGGGTCGCGA(X^GTGG(^ 

SEQ ID 7764 j 

LGHWGGAHELSIDSKGRLAVPAKFRDIL^ 

ELWGREQWEAEMVQALDIH)PDELAFQLSQTDLQL j 

SEQ ID 7765 

ATGACTATGCAT6ATCCTATTTCCGATATGTT6ATO 
MGMGMGGATATATTGAGGACTTCGCAGTTra 
CGGTTTGCGTATTTATAAAGCGTCTAGTGA^^ 
GGTGGTC^GTTGTTATGCATTGTAGCC 
SEQ ID 7766 

MSMHDPI SIHLTRIENAQRANKAAVAMPSSKLKC&IA^ 
GGSLTjCIVA 

SEQ ID 7767 

ATGGCTAAGAAAGCACTTATTMTCGTGATCTGAM(^^ 
GTTTTGAGGCTOTTTGAGGTTTCMTC^ 

SEQ ID 7768 

MAKKAIJKW3LKRQALAmAAKRAM 

SEQ ID 7769 

TTGTTTATTGGTTCGGTCACTCTOT 
TGACCGAGTGTTTAAAATTTOT 

SEQ ID 7770 

LFIGSVTLYCTRRAVVVIFYLIVFKQNKNFRYVNEFSC^ 

SEQ ID 7771 

A!K^CTCGGTTGAGAGAGTm 

GTCAGGCTGTTGCTGACAAAAAAGTTATGGAAC^^ 

TMCTATCCGGTTGGTTGCAAAGTMCAT^^ 

TTTGATGGTCGTCGt^TTACAATATGGGTGTT^^ 

CCGATGAGGAAGCGAAAGCTTTATTGTCAT^^ 

SEQ ID 7772 

MARLREFYKETVVPELVKQFGYKSVl^^ 
FIX3RGNYNMGVREQIIFPEIEYDH 

SEQ ID 7773 

ATGACaGCACCCGTACO&CGT^^ 

TCATTGCCGAAGAAATCGGCAAACACGGCMTTGGATT^ 

(BGGGATTTTATTACCGCA^ 

CAGCTTGCCGCCGATTTGT^^ 

CATCTCAAAAAGTTGTCCACTTGACCXX^ 

CGAGCACATCGGCGTTTGCACGGATAACGG(X!(H^ 

AGCGAACTGCACCCGCAACMTATGOCTT^^ 
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ATC^GGCACATTGATCGGACACTACCGACACCACG 

TGCAGGACTGGATTTGACAGGOTATCTGCCC^ 

GCTCTTCAAAMC^TTGEACtt 

SEQ ID 7774 

MTAPVFTCNMSGNPINLHi^ 
QIAADLIXjSVSDSIHCYYIIEISPEIiAARQKNLIQARAPB 
SELHPQQYAFIRTLASRL^ 
AVQKLIDQHMELFKVIAFGKNIGIDWAGFRPGDI 

SEQID7775 

ATGCGTGGCATTMGGGCGGTAmTTACTAAAGAMTGCX^ 
AAGGCAAAOTAAACCXCTTCGT^^ 

SEQID7776 

ffiGIKGGnTKEHFLDISiaAILN^^ 

SEQID7777 

TTGATGGCGTTTTACMCATTMCGCCC^ 
ATTTTATTCATCGTGCTATTCCTTAT 
TTTATTATTCAGCMCACGGCGGCATTGTTATCGAAm 
CCACGCGGAACCGCATCTTTAACTCCC^^ 

SEQID7778 

IMAFYNINAliNNKFTrQHSHYI/PljLro 
PRGTASLTATLIISPTEA 

SEQID7779 

TTGCCTGTTTTGAACmGTGGm 

ACCTCmCAT(mTGCAAGCGCTX?PW , 

SEQ ID 7780 

LPVLKAVVYSLFSGNQAIKRUPFTVCA^ 

SEQ ID 7781 

ATCATTCAAAIPGCAGACCATCTTAGATG^ 
TGGCAGTTAMGATGCGGTTCCGCGTGGCCGTGT^ 
CAATGCCGCCGTGTTGCTGMTMTAAACTTG^ 
TTA 

SEQ ID 7782 



L 

SEQ ID 7783 

ATGTCTGATTTGTCTGTATTCTCGCCGTTO 

GGACGGTAIITPGCCAMGAACATTTAAACCTGC^ 

GAACATCCCGATTGTCAGTTACCTGCTCCTGCX^ 

TGGCAATACXHXJTGGTCTPGGATTACGCTGG^ 

TCTGGCTGGGGCTGATATTTAATTTGGA^^ 

AGGAAMA(XGGTATGGGCMCGGAGATTTCAMCTGATO 

ATCGTTATK&GCGTCGCCAAGGGGCGGCATTTT^ 

ATCCGGTGAGA 

SEQ ID 7784 

MSDLSVLSPFAVPLAAVLGI^VGS^ 

wqygwswiti^liltaflisltfidadtqylpdsktlpl™ 

IVMKVAKGWffAFOTAI/rVSGWIITO 

SEQ ID 7785 

ATGCAAAACACTAAAMTCAAGCACGCAGCTCAGGOT 

SEQ ID 7786 



SEQ ID 7787 

GTGMCAATTCAGTCATCACCGTCATCGGT^ 

ATTTOTTACTATGATTATCTTGGTGG^ 

AGAMTCTTCCAAGCGATGCACCGCAfT 

SEQ ID 7788 

VNNSVITVIOTRVGlVYDVBKIIAMRINimSQQ 

SEQ ID 7789 

ATGAGCGAAACTAAAAATGTTCGTACTTTGCAAGGCAAACT 

(^TOTC^AAMTCCATGCCCATGATGAAAAT 

TGAGAAAGCACGTTCTATT 

SEQ ID 7790 

MSETKNVRTLQGKVVSDKM^ 

SEQ ID 7791 

ATGGATTTTTTGATTGTCCTGAAA 

GCAATCACAAGGTTITO^T^ 

OTCGTCCTCAA5rTTGCCATTG^ 
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GGTT?imATTTTGT6(K3TGGAG^ 
TTCCGGCTACGTCGXSTTCGGGCA^ 
TTATGATGTCCTGAAACATTACCGATTTTT^^ 
TCCAAGAAAAACTATATCCCGTTTGCCTATTACC^ 

SEQ ID 7792 

MDFIJVIiKAUmVBGFTB^ 

GPFItWVmQSRAEPKIADVDM^ 

SKmiPFAYYRIWGIVIIIimSGWISWE 

SEQ ID 7793 

ATGAAAGCAAATGAATTGAAAGACAAATCCXSTTGAGCAGT^^ 
GTGMTTGAMCGTGTACX^GCGATATTGCT^ 

SEQ ID 7794 

MKAl^DKSVBQU^LLDIi^ 

SEQ ID 7795 

ATGMGTTTAAACATCTGTTGCCGCTGCT^^ 

GAAAAATTGAGGTTTTGGMTTTTra 

GGTCTGGCGGCCTGAAATGCTCGG^^ 

GAAAACAGGGCT6TTG(XX3GGAMTCG^^ 

CGGAACAATACGGTATK^CAGCACGCCGACCGTO 

GGAAGMCGCAAGCGTCAGACCCCTGCTGTAC^ 

SEQ ID 7796 

MKFKHLLPIiLLSAVLSAQAYALTEGEDYLVLDKPIPQEQrc 
ENRAVAGKWALSQKGFDGKKIiM^ 

SEQ ID 7797 

ATGCTGCAGCCAACTAGACTGAAATACCGTAAGCMCA^ 
GTTTGACTXjCCCGTCAAATCE 

TCGTATGGGTGGCGGTAAAGGTMCGTGGMTATTACATTGCCG^ 
GCTGCCAAATTGCCTATTCCTAC^ 

SEQ ID 7798 

MLQPTRLKYRKQQKGW^ 
AAKLPIPTTFWRQVGQ 

SEQ ID 7799 

ATG(WACAAMGATTAACCCTACA(X}OT 

TGA6GATATCGAA6T1TTGAAAC6TCACTTCCAAGTC 
CAGCAGTTGGAAAAACraTTeAATTCCGTCGTG^ 
TTGCCOGTAGCGAATtCTATCGTCMGGT^^ 
TTATACGGAAGGCAATATTAMTCTTCCAAACCTGAA 

SEQ ID 7800 

MGQKINPTCFRMVTKDWASKHFAKSTDF 
QQimVQFRRAMKRAMQNAMRSGAKGlKIMTSGR^ 

SEQ ID 7801 

ATGmATGMCAAATTTTCCCMTCCGGAAAAGOT 

CGTTCAAAATCCCGGCTCCGTCGAAGCAGCCTGCAGA^ 

TGTTGCXaAAAGAGGCAGAGCAGTCGGATGOGGAAAAAGCTGTO 

CAGGCAGTGCGCAAGAAAGCACTGACGGAAGAGCGTGAACA^ 

AAAAAGCTTCAAAAGAAGAGAAAAAGGCGGCGAAGGAAAAAGTTGCACCC^ 

G^GAAAATGAAAACXTTTTGGCAAGGCGGAAGCAAC^^ 

GAAGTGGTCGGCTATCAGGCGGGACATAAAACKK7OT 

TCCGTGCGATTGAAGGCAAA 

SEQ ID 7802 

MFMNKFSQSGKGLSGPFPGIJLATVIIAGILLyLNQGGQNATO 

EVVGYQAGHKTLYRVQSGNMSADAVKKMQDELKKHGVASU 

SEQ ID 7803 

ATGAGAGTAAATGCACAACATAAAAATGCCCGTAOT 

AAAAAGGTGCCGAGCTGATCAAAAAAGTATTGGAGTC^^ 

CITGAAACGTTTTCAAGCTCGCGCCA^ 

SEQ ID 7804 

1©TOQHKNARISAQKARLVADLIRGK^ 

SEQ ID 7805 

ATGGCTCGTTCATTOAAAA^ 

TGCCTGATTTTATOGGTCTGACCATTG^^ 

AGGCCACCTGGCCGATAAAAAGGCTAAAAAGAAA 

SEQ ID 7806 

MARSLKKGPYVDIfflJiKKVDAVRASNDKRPOT 
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ATGGCAATCGTTiUUAT^GCCGA 

CCGGCCGTMCMTMTOm^^ 

CGAATACGATCCTAACCGTACTGC^ 

GCT(mTCAAAGTAGGCMCACCCTGCCGA 

CGGT&TTGTIXjGCTAAAGAAGGTGCATACGCT^ 

GAAAAAAATCGGTAAAGC^ 

CGCGMCCGGTTAGTCCATGGGGTACTCCTGCTAAAG 

SEQ ID 7808 

AAIKVGNTLPIRKIPVGTTIHCIEMKPGKGAQIARSA 
REPVSPWGTPAKGYRTRNNKRTraWIVBERYSNKG 

AAGTAAAGGAAAOXXm^ 

(^TTTOCGATTGCMTCGGCATCCTT^^ 

QCXHTOXX5ATGGCGTGGCAG<mT^ 

AAGTCGCCGCTCATTTGAACGGTATCGAACCTTTGG^ 

CCGCOTOTTTGGAMTC^^ 

GACGAATTGGTAGMGTITGGGCATTGCGTTC^ 

G(XMOTCa3AT(X«^^ 

CGAAA^CATTTCCCO^^ 

ACGCCCGAAAGTGTCGCGCGTTACCGAAGCAAAAT^^ 

AGAAGCGCAGGAAGCCTTGGGTAGCCTGCTGGAAAAAT^^ 
AGAAGCCACGTCATGAAAGCCATAGGCTTCCGTCGTGCOT 

j&H&tfQGMQAKRAFVLPE^GQM^ 
DavEWAIJtfLLP^ 

TPBSVARYRSKISGPIiLDRIDLTIEVPSljSAAEUiQQEAGESSAS 

RSHVMKAIGFRRAL ' 

S A^GTTO^^ 
TOSGTOffAT^^ 

SEQ ID 7812 

tflTKVLANATKPEIKMVELLFGVQVASTO 

SEQID7813 ^ ^.^wvvan^^lliqrPlLaiLlOT 

TTGGATACTCATCACATTCCTCC^ 
CX!TGCCGCGATTTAGGGCATTTCGAGGTATCCAC^ 
ATCGTACACGATGCCGACGCGGTCm 
SEQ ID 7814 

GCGTGCTTGA1TFP 

SEQ ID 7816 

LGI£NGILQQIjSVYRIjFFKKR^^ 

S k^ctataL^ttct 

AAACAGCATATTGTTTTGTAGTGTCTTGTMTGTAATCG^ 

ATCTTTTGACGATTATCCGCCQCX^ 

GTCfKK^AAAATCAGCGAAAra 

^TCCAGCAGCAGGTTTTGATTAAMCCC^^ 

■RXtt3AAGAAGCmC«^^ 

VGSKlSETIAHSPAKDVEKNnCAMLGGAFN^ 

SEQ ID 7819 

TTGMGATAATTTATCCGAATCCXXJTT^^ 

ACGAGGAA 

SEQ ID 7820 

LKIIYPNPIiSGIRIFRCTFIKK5TQAQVACNFKACIBBV^ 

SEQ ID 7821 

atggaatoaaagtaat^cgctaaagga™ 
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OTACTK3(XrPTCAC^ 
GGAGCAATGGGTA 

SEQ ID 7822 

MELKVinAKGQVSGSLSVSDAL^ 
RLFVIEM/TAETPKTKVFABQVKNIiALE^ 

SEQ ID 7823 

A^CTPTAGGTCTG(nTO»CGCmGTTGCT 

AAGATACTGACGGCTATACCGCOTTCM^ 

TGAGTTIWTTTGACTGMGAAAAACTG^ 

TCXGGCACGATCAAACGTCATAACTTCGGT6CCCAACCT 

AGCGOITGGOTGCCAATACGGCAAC^ 

CGATGTTOTAGTTCGTCCCAGCGTGAAAGTAGGT^ 

SEQ ID 7824 

miGLVGWCVGMTRVFDEQOT 
SCTIKRHNFGAQRTSHGNSRSHR^ 

SEQ ID 7825 

ATGMTCCCATGTACATCACTTTTGCMOT^ 
CGTn^TITACGGCGATGTCGGCAGGCGC^ 

ATGAAAGTGGTTTCCGCACTGATTATCCTG 

TGTGGCTGGGCGCGGGCQ^CCATCGCCTAT^ 

GGTCTATCTG6GCTTQGGCGGC6CG6AA(^6ATC 

ACC(XIC6OTGG6GOTG6GCTATT^^ 

GcemXXmXKX^AGCGGTCGG 

6TTCAACCOTGGATGX2CGGCATCATOT 

TTGCGTAAAMCGCGCAACAGTCGGMTTGGTATGGGTCGGC^ 

OTACGCGTGGGCAGGTTTCG(?KKX^ 

TGTGTGGGCGGAATGGCTAAAAAMCCGGC^ 

TTCAACAAAGAACCTTCGCGCGAAAT^^ 

SEQ ID 7826 

MNPMHTFAIYLVAVIililGIiAAYFSTO 

MKVVSALIIbFPFTnCASGmGATl^Si™ 

TAAWGLGYFGQPHILARFMAAES^ 

LRKNAQQSELVWVGRIIIVIjAIAVISILIASDPNSKV^ 

FNKKPSREIQERFEKADADYRAAR 

SEQ ID 7827 

TOGCCTGTCTTTCAGACAAAGTTAAACTT^^ 
CGOTT 

SEQ ID 7828 

lACLSDRVKlOTVPSLT^ 

SEQ ID 7829 

TTGTGTTTTACCCGTTTCATTTTCAAAG^ 
TC 

SEQ ID 7830 

lOTRFIPKETEAPILPAFBALmaKTILPYSDRmV 

SEQ ID 7831 

TTCCCTCGCCTTGCCGTACTATTTGTACTGTC^ 
AG<XXXX£GTTM(&TCT^^ 

SEQ ID 7832 

liPRLAVLFVLSAASSIiCro^ 

SEQ ID 7833 

TTGAAAAAGCCGA0GCAGATTAC(X3CGCXJGCCCGATA^^ 

ACC(^TGCCGTCT(^CCCCCCGAAAAAAC^ 

SEQ ID 7834 

LKKPTQITAPH&PLSDGTASRFVPSE^ 

SEQ ID 7835 

TTGTCmOAGAATTTTCAGAC^ 
GCATCCAOTGATTCTAAGGATGG^ 
SEQ ID 7836 

LCLKNFQTAFFSEQNRIFF IHNF PKACPPKRGRIGLGLGRSIHLIIjRMDDVIiPSSK 

SEQ ID 7837 

ATGGATGCTTCTGCTCAMCai^^ 
AGGCACAAAATAAGCCGATTTCA^ 

SEQ ID 7838 

MDASAQTKPDSTPF^^ 
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SEQ ID 7B39 

CGCAGATGAAATACAAGGCTTCAMTGCGGGCAGM^ 

SEQ ID 7840 

LAQGSLCRKCKMKHFI^ 

SEQ ID 7841 _ 

AOCTGCAACGTGCACAC^TGAGTGACGAAGAGCGCAACGra 

ACGAGTTTTOiCTCTC^^ 
CT^GAAAAGCCATTTGAAC^ 
CTGAT(XMCAAAGGCGG(XXIACCGCTCATCCGCC^^ 
GCGAAAAAATGGGCTAC^^ 
GGCAGGACAAGGCGTTTACGAAGGCAACGGCATT^ 
TTGTTCCTTTTGGGTAAAAAATACGATATCGGTAT^ 
ACAMGGCATCGGTTTCGTTGTOAAGOT 
GTATOGGACA^GAAATCAAATGGGCGC^GTGGAC^ 
CAAGACGCGGTATTCCCGCAASTCXXXIAC^ 
ACGACQAGTCGTCGGTCCGCAAMOTW 
GTCTTTCGTCAACCAAATCGTTGATGAAAACATCAGCATO 
TTGTACGGCAAATGCCGTCTGAACTCGC^^ 
TCAACGGCAAAGCCCGCGATGTCGGCGMGCGCAACCGAW 
A(£CGTTGCCGCOTCCO^ 
GMGCGGGCAAAACGCTGMCMCGCCATTGCCG 
TTGTCCXX^raGCCCGlX^CTTC^ 
mTGCCGTTTCCCTCATGCACGAA© 
ACAGGCTCGAOTMGTGGCGCGCCTGATCM 
CCGAACAAGTCTGCGCOSACGTATTGAACTC 
AGGCGC6ATGGACGAACTCGT(X3TCGGCAAACX36ATT^ 
GCCM(?TCTTACaCGMGTCAAAGaXX^ 
C(X3T(X^ACGTC(?iraxrrAa^ 
CAT(XXX^CCGCATCGAAGCCGGCMCGTTTACGTCM 
TCGTTCTACCTGCAAAAACTGAOT 
TCAACGCXXMGAGAAAAAAGCCGCAGCGGCCGCTTTGGGAC^ 
C<mxmACGCGmG(&TACA^ 

GCGCACTOTCCGCGTCCTCC^^ 
CGAC 
SEQ ID 7842 
MI£RAQMSDEERNMSEIJUtftfjVTQ^^ 
LI6KGGAPLIRQ6VNYAMRIjIi6KQFVTGQTIEEAI^NGKER 
LPUGKKYDIGINII^EANRIiELSI^ 
QDAWPQFATHNAYTLGAHQ^KGKDFEH^^ 
LYGKCKiKSQGVDLSNEim<lQLQEQPKA^ 
EAGKTIMAIABVREAVDFCRYYANEAEHTLPQDAKAVGAIVM 
TGSTEVAHIJNKAIMRGDNPVIiIAET^ 
AKSYHEVKAAADVBSEKSTFVRPIIiFELNNL^ 
SFYLQKLTRIPEWVAPTLSQIGQADEAALKRLEALIHKLPFNAEEK^^ 
TADLEGUjLVNGKPETAGISHVAAI^ 

SEQ ID 7843 

ATGGCGGGCGGTCMCTGGTCGCTGA(XXJATCC 

CGCGCATCGAT(ara&^^ 
CATATTGCGTAAAGAAATCTTTGCCTCCCCATC 
GGCATTGTCGAAATTCXGCTGCTCACGGAGAAG^ 
TGAOGCGCGGCGAGGTGGCXXiACATCATO 
GCT(X}TGCACX3CGTTT!FATTCAGGG^ 
SEQ ID 7844 

MAGGQLVADPSGEMTAWVGIjTGGIGSGKSAAAQ^ 
GIVEIPU/TEKRQFISLIRRVM 

SEQ ID 7845 

ATGGCGGAAACGTTTGCGCTGACATTTATGATO 

ATTACGCGGOTATCAMGCTGGATGACGGGTATTM 

GfXXXJTCGCGGAAGTCATGTTG^ 

AAACAAGAGCACGGTATTTOGCXXAAACCGACATACAGC^^ 

CTGTGTTCAAACAGCCTGCTXXAAGCAAM 

TTCGCCGTTTTTAACCCGGCTGI^^ 

MCGGCTTGGAACAAATCAGCGGCGGCGATACCAA^^ 

TCGGTMGAAATGGATAGACX^TCTGATTCAGCCGACG 

ACCGACCAAATGATTCAAACCGTATO 
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CGGTGCAGCCCGACAGC^TATTGTGCCTCT^^ 

(tfTCCTGATTCACAC^^ 

TATGTTTATCCGCAA 

SEQ ID 7846 

MAETFALTFMimYLFARYKASRIiLIAW 

KQEHGISPKFIYSRIKANYFSFt3YFVGBVLPYQIjro 

NBliEQISGGETNMFRIAKESQGYETOT 

TDQMIQTVF^LQKgPDC^ 

YVYPQ 

SEQ ID 7847 

TTGGTTGGAAACCAGCGGGGGCGACGGTTTTCTO 
TTACGATA^TTCnTTCGGGTTTGACGOTG^ 

GCX36GTATCGCACACGCXXIAGGATTTTTATO 

TTTTGTGGCATTTTAATGGTCAGAAAAAAATGGCGGGG^ 

TGGCTTGAGTGAGGGTACXX3GATACCGTTA 

AAGACGGAAATGGTTATGGTCAACGATGAGCC^^ 

GCTATGAAGATC^TTCTAAAAAGAGCGAGGCTGTTTATC^^ 

SEQ ID 7848 

LVGNQRGRRFCFPLDARAVIGRRV^^ 

AGIAHAQDFYYPSMKKYSEPyiVASTQIKSFVPTL^ 

KTEKVHVNDEPAKI^ 

SEQ ID 7849 

ATGGCAAACCAAAAAATCCGTAT(XX£CTGAA^^ 
CTTTGCCGACCAAAATCGAGCGTTTCAACATTT^^ 

SEQ ID 7850 

M^KIRIRLKAYDYALIDRSAQETV^ 

SEQ ID 7851 

■PK&ACTTTMTTTCT^^ 
TCACGGGAAGTTTTGTTCAOSTGCGGAGAA^ 

SEQ ID 7852 

IBFNFYIWK&QIQIiHQRIGSFIGPIHDVB^ 

SEQ ID 7853 

gtgtcggtcggctacgatttcggcagctggaggatagcggcagattato 

ggaaaaaactgacgcmgacctgaagacggaaaatcaggaaaacggt 

tgcaggcgtgcgcgt(agctacggacacgtcagacacagcatc^ 

aatccacagacgcaamcccttatcaccaaagcga 

acca™otg(^cgcctggaaaacacc^^ 

SEQ JD 7854 

VSVGYDFGSWRIAADYAKYRK^ 
NPQTQNPYHQSDSIRKVGIjGVIAGV^ 

SEQ ID 7855 

TTCCCAmAT(^TATCCTTCAOT 

AGTTCTACAGTAATGGTTA(X3TTCT 

AGGGGGTATGGOGGCEGOCCTCTIC^^ 

GGTACCGCGCAGCAATACGOTACGT^^ 

TCGTCACCAACGTGGATGATACXnCGCTCT^ 

SEQ ID 7856 

LPFINIIjQLSDNRRNHAGTHGTAAFANRKAQTFFHSDRRNQFy 

GTAQQYAYVVARLTKVftJFAE^^ 

SEQ ID 7857 

ATGTOKSATTATACTGTCAOTATGCCtm^ 
ATGCCCAA 

SEQ ID 7858 

MFGLYCQICBItRMPFARSCALKMPSEGSDGIGIGESEAVAHAQ 

SEQ ID 7859 

ATGTT(X2(£AAACTGCTGGACGAAGGTCAG 
CKyOCCAAGTTCAMGCAGAAGTGTAOT 
GGTTACTTTGGAAAAAGGTGTGGAAATO 
CGTACCGTGGGTGCCGGCGTGGTTTCTTC^^ 

SEQ ID 7860 

MFRKLIJEGQAGIMGVLIjRGTKREIWERGQVIjAKK 

RTVGAGWSSVIA 
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GCTCC(?KX^CCCCCGACTO 
CCGATTCCGTCAAA!ICCAT(£ATT^^ 

SEQ ID 7862 

VQADIAYAYEHITREWEAT^ 

ATAGGCAGCAGGAATGGTTTGTCCAOGGC^^ 

CGGAACCTTGTACGATCXXSGCAGTCGTC^^ 

GAACACGATGATGTAAGGTACGCCTACT^ 

GTMTCATGTTTTTAACgTAGTCGGCGTGA^ 

GTGCGTTGTOGATTTGGKOTAAGCTT^ 

EHDDVRYAYLTGQQDWAGliRHRAVGSRTYQOTVH^ 
SEQ ID 7866 

LPPASTNPFNITDPAIAADSVKSIHSAESRLFPKNIjDAAQ 

S ATCGCT^^ 
GCGGCGCTGCAAAAGOTACGACCAAATCGACJ^^ 
CTGTCCGGGTCAGjCCGACTACGTTAAAAACATGAOT 
CTGGCCCGTCMGTAGGCGTACCTTACATCATC^^ 
ACTTCCCGGGC^CGACTGCCX^ 
TCCCGAGCGTGCCGTGGACAM(^TTCTO 

SEQ ID 7868 

MAKEKFERSKPHVNVGTIGHVDHGKTTLTAAIjTO 

SEQ ID 7869 

CCAAGCATCAGGGCGAGCAGGAGGGGTTOTATTGCATGGTO 

SEQ ID 7870 

I^GGI^PAASRFLGNRRDSAEI^ 

psibasbhomfowrlsbkiq 
S atgg^ 

GCGMGTGCATGACGGTGCGGCTACTACCX^CTACATGGMCAA 

GCACCGOTOUO!^^ 

CMTCTGAAACCGTATGGCGGCMGCCAACAM 

TGCGCGCAAACCCCGTACCTATCGraTTO 

TAOTATGGCGATATTCCTGCCGMTTGGTCG^ 

CTGGCCGAAGAAGAMTCGTAGGCGCGTTGCGTCAACCT 

GTTGAACC^CAAATACGKXXJraCX^ 

CAAATGACT(XXmGACCGTACTCAAAT^ 

TCTTGGAACGTATGGAATTCCCCGAGCCGGTAAT^ 

CCGCGTTCGTACAGAOGAAG^TCCGGTWAACtt 

CCTCAAGTGGOTACCGTGAAACTATC 

CGGGTGGTGAAGGTTAC^GTTTATCGATGAAATIAAAGGTO 

TOTCTA(nTCACGTACGTAraX3TCTGCT 

CCTGCCCra^GCCGAm 

TCGGCGGTAAAAAAGTCCGTGCCGAAGTACCOT^^ 

A(KH!CCrcCCCA^ 

SEQ ID 7872 

HARKTPISLYRNIGISAHIEAGKT^ 
QSETVWRQANKYQVPRLAFVNK^ 
IAEEEIVGALRQR^ 
QMTAADRTBIEEVW^IAAAIGiravm 
PQVAYRETIRKAVKAEtfKHAKQS(&KGQY^^ 
PALLEPIMAVEVETPEEYMGDVHGDLNRRRGVV^^ 
SEQ ID 7873 

CGCTTCTGCOXKXXSGTGCAAT^ 
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SEQ ID 7874 

LMRRAEARPATLSMHPLA^ 

SEQ ID 7875 

ATGCCAA(^CGTA(^GAAGTCCCCMGCGCGA^ 
GTATCGTmCGGT^^ 

GQC6AQTTGATT6ATGCGTCC6AA6GCC6TO 
SEQ ID 7876 

MPRRREVPKRDVLPDPKFGSVEL1TOINW 
GELIDASEGRGGALKKREEVHRMAEANKAFSHFRF 

SEQ ID 7877 

MX3GTGTCCGAOT 
SEQ ID 7878 

MVSDFMMPRAK)AAEQpyFRFVDVYVRQARMVQLGI 

SEQ ID 7879 

TTGCC(^TGGT<mXXm 

AAGCCTGCCCGCAAAAACGCGGCGTGTGCACCCGra 

ATATATCG(K}GGTGAAGGCCACAACCTCCAAGA6CACA 

OGTOTTAAAGACCGCAAACAAGCCXGTTC^ 
SEQ ID 7880 

I^TWCROIFNSTGRBNMPTII^LVRKGRQ 

GVKDRKQARSKYGAKRPK 
SEQ ID 7881 

ATGOTGAATCGCGGCAMCACK3OTT^ 

TGGGCGGATGGGCAGACGGGAAATATACGGTTTCCGGTC^^ 

SEQ ID 7882 

MAESRQTRLQVKCPTCQTAVVWKPENAFR^ 

SEQ ID 7883 

. ATGCCO&TTTCCACCAC^ 

AAATATTCCTCAG0CAAAGCCCCAACC6ACTCTG(^TCC^ 

CTTCCAGACCATMTCTTCMTATTGAT^^ 

TCCGGTGCAMGATCC^MGCATOATGCACCAGCTCATCCT 

AAATCAMCTCCCCCTGCCAGGCCTGGTGTGTCAA^ 

GATATCGGGCAAGCATATC^AMGGCAAATTCAAAGTA^ 

TGCAAAACG 

SEQ ID 7884 

MPDFHQQYTFMQKLCRIQKNI^ 

SGAnQSIHHQIiIVFYPRSKSVTQQFINKRARVNH^ 

CRT 

SEQ ID 7885 

TTGCCGATACGSGGTGCAGGCTCGGTCGGCE^^ 
TTTCAGGT 
SEQ ID 7886 

LPIPGAGSVGYDVKVRFFAWMVSAQITINPRCFGmVISG 

SEQ ID 7887 

CMTCAGTCGGACGGCTGGAGAAMTAOTnC^ 

SEQ ID 7888 

LPTPCRQERNHFIASRQSSITTRKOT 

SEQ ID 7889 

TTGCTTATAGTTGTACGGTTT^ 
TAACCGGACAOCAGTCCCCAAACAAATCCOCCATGTAC^ 
GGGCGCAATCGGCGCAGCXXaTGGGCXK^ 
CCCGAMGCOTACCITCGCCACC^ 
ACACCGACAACACAGGTTCCGATGCAGTCAACAATa 
CGGTTOXaTATGCC^^ 
SEQ ID 7890 
LLIVVRFDICTEGNAHTPFKPSTVVLTASAL^ 
PKVTFATGSAALGGSAQYAI^MQTLVQYPimTINGH^ 

SEQ ID 7891 

ATGGTTTGTCACTATCGAAAAAACGTAT^ 
TGCTTGACTGTCTGCTXX^TAAAAA < I!A 
SEQ ID 7892 

MVCW&HVLTVFSPAVRLIVRTO 

SEQ ID 7893 

TTGGCGTTGCAGGGCXXX^ 

G(£ATAT&X&ACCCTAGCCGT^ 

CTGTGTTC3TCGGTGTGC<XrGT^ 
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GGTMCGCTTTCG(XX^TCACCAGCC^ 

ACTGCTG1XXX3GTTACGGG6TCGGCAAC^ 
TAC^ACTATAAGCAATCATGCCGTCTGM 

SEQ ID 7894 

IJOiQGAIiHVPCG^ 

GNAFGHHQPDLVALALDFDIiRAG^ 

YHYKQSCRLKKCH 

SEQ ID 7895 

ATGGATAAOTCTX5CCGTTTGCCGGG 

AGATATTCAATCAAAATGCCGTC 

SEQ ID 7896 

MOJVSPPAGKPQIMGGVQVGPTLQTI^^ 

SEQ ID 7897 

ATGAGOGCGGCTTEATGGCATAAAAAACTQTCGT^ 
GGTTTGTCAGTATGGAAAAAACGTAT 

SEQ ID 7898 

MSAALWHKKLSWKGFTPQINFRYl^^ 

SEQ ID 7899 

ATGTTCGAGGCGGCCGATCTATACCGTGAGTTGTO 
AGTTGCACCGGGCCGAGGAGGTGTTGCCXjCCCGATAT^^ 
AGATAATCTAAACAATGCGTCCCTTTCACG66ACATC 
CGGATGTTTMCATGGCAGGCAATCACTI^^ 
ATCGGAATAGCOTCTAGAGTGGGGAT^ 
AT(Hm»(^CCCAATCTm 
GCTTG<KIAGTTGTACGGGGGTATTAG(X3GTAT6TTTGA 
TGAAGTTGGGTGCGGGTTACACGAAACGGATAOT 
SEQ DD 7900 



RHFNMAGNHFAMiGISGSGVH^^ 
AWQIiYGGISQIPDNTVHCE^ 

SEQ ID 7901 1 

ATGAAAATCACCAOTGGAACGTCM 

^TTTCKX56CCGCCGOTTGCAGA 

TCCCTOEGGATGACCCGCA^ 

CAATGGTTTGCCGCACTGACGGAGTTTGTC^ 

GGTATGAAAAAATCCACTGTTCGTCCGTCGMCGG^ 

ctatcgcggc<^t(?itcc^^ 

CGTCCGAt^CQCGCGCC^^ 

SEQ OD 7902 

MKITTWNWSLNmPQVQNLLVI^PDILV^ 

QWFAALTEFVRDE^RHGKL^ 

RPSDHAPVAAEPDL 

SEQ BD 7903 

AT6GCGTTGCTGGGCATTTTATTTGAATTTOT 

CGCGGC^GGCGGCGATCjTTCGAGGCGGCCGATC^ 

SEQ ID 7904 

MAI^GILFEFVKMKSmFFHGYLIASATFAEDIGVPVEPIWGSRA^ 

SEQ ID 7905 

TOCT crara3A(x^^ 

ATCGCGCCGCGATAG^^ 

GGATTTTTTCATAIXATTTTTCAGGGTO 
* TGCGGCAAACCATTGTTCX^rTATATTTGAA^ 

TCATCCGGCAGGGMGGCAAACCEAAATG^ 

(XXCGGCCGGAMTTTGTCCTGATCGAGTT^ 

AGTGGTGATTTTCATAACATTTCC^^ 
SEQ ID 7906 



CGKPLFLIFEFGAVBGFAVDAIDIDDAHAADGRRHHAAI^^ 
SGDFHNISEKCRLKRSDGKRKGF 

SEQ ID 7907 

ATGCCCAGCAACGCCATTTTTOT 
ATOGGC«5ACWmTI^ 

SEQ ID 7908 

MPSMPIiPVLPGHICSVRINCPKRNYCCFBV^ 
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. ATGAACACAATACCGCTCCACACCATACTCAAAOT 

TGACGOXaCCGCaniTTCCTG^ 

GCGGACAAT(XXK^TTTGGAAMCAAACGCGCG^ 
SEQ ID 7910 

MNTIPIBTILKLMAHPERMAIIJ^ 

SEQ ID 7911 

ATCTOBTTCAA<m^ 

MGCCAMGTATTAMGGTTAMGGTGATGMTATACAGGAG^ 

TAGTMGGAACMCTTCAGMGATGGATGTATATGTTAATGGTAAAAAATA 

GGTGCGGATAAAGMCAGAATTATGCGCTATTAAAMCTTGGG 

GACAAMTCCT(1AAGAGATTCCTTTTGACTTC 

TAATGGGGTTTTATCCAMGAATCX3TTAGATMCCATMTGGOT 

AAATTAGMGAAGCTGCTATTGAGAGA^ 

ATGAGATTAAAMGTATCATTTGGGTATTTTC^^ 

SEQ ID 7912 

MSVQPSVSBQIiKDNMTOIM^ 

GADIOE^NYAIiIiKTWVYEQ 

KLEEAAIERIPYRESGSSLGLKD^^ 

SEQ ID 7913 

ATGCATMTTTGACGGCACGAAGATATGGGACACTTATCGGGOT 
TTGCGGTTTTTTAT 

SEQ ID 7914 

MHNLTARBYGTLIGLI STRLK^ 

SEQ ID 7915 

ATGCAAAATATGAAAAACCMGTACG^ 
TGACCAAGCTCGGCGMCAMTGGCGCGCCTGOT^^ 

CGCGAGCGCGACAAAGGTTTGTCCMCMGCAGTTGGTGCAA 

TTGAAATGGGCTTGA(XACCAAAGAAGCTGCGTO 

GGATAAGAAACAACACCGCACCCAAATCCGCGCTACCAMGMGC^ 

GACTACACCGGCGCGCGCGGCAGCCGOTTO 

ACGTCOCOGTTATCCAGCCCGAAIXK^TCGAGCAGGAAG 

GTTTTACGGCCTGACCGTATTGCCGCGCC^ 

GATTTTTTTGTCCACAACAAAAAGCTGATO 

CCGATTmATATGGCGGATTTO^ 

OTTTCCGCMCCGCAGCAAAa 

TTCTGCGMGCTAAAACTAAAACCGAAAGCAGCCTGCACT^^ 

CCGTGGGGGAGGGCTGGGGAGAGGGCAAAACAGTTGCCGCACAMCC^ 

TGACCCCAAACCCAAAAASCAGCCT(^ 

TTGTTCCTCAGCCGCGACC&CC^ 

ACCACCCGCTCGAC&GCGTTAC^^ 

AGCACTCCCCAAACAAATCG^GCATCTGOT 

AAAACCGCAGGCGACATCCGCATACTCGAGCAAAra 

O^TGGGTCCKGATTTMTCCAAATCCMC^ 

GCCCGMTCCAT^W'PICGCCC^ 

GCCCACAGATTAGGCGTGATTCiAACTCATGAM 

ACACCCIXXX^GACXJACCTaCCC^ 

CGCCGICAAAGAAGCCCTCAGCCGCTACCT^ 

CTTGCACCCGGCTTC^ 

GCGMGCCXjATArcCAAGAGCTGGAACAAATGTGGCAGGAAAAAACTGAC^ 
AG?GTCGCTGTTCGCGCAGGAGTTGMGACGCOT 
SEQ ID 7916 

MQNMKNQVRGSGMDARSNPANVSK 

RERI^GLSNKQLVQWCRQyPLSHLRMREWRELHH^ 

DYTGARGSRFHLFPASAIJ^^ 

DFFVHKRKLIKEITELEHKSRKQDVLVDDEALFAFYHERLPDFYMM 

PCEAKTKTESSUISQRLSENOTPPS^ 

LFLSRDDLMQHAAAHITE^FPKHWQTAIXSKFK^ 

KTAGDIRIIJ3QINQDEWAAPRIiP]&OT 

AHWGOTEimQLKEQVKD^ 

LAPGFATRTTOAQWPRLPIYIiKAMTLRI^^ 

SEQ ID 7917 

ATGGTTCATATTATGTTTAATCAGGTCGCACMGAATO 

(XXIATGATGAAGCGGTTTATTTGATTCT^ 

GCGTCGTGCTGTCGAACACATTCCTGCC^^ 

GGT(^CGGACTTCGACCTI^ 

ATGCGGTTCATGTGAGTTTGGATGOT 
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TGATTTGATTGTTTOCAAT^^ 

AGGCAGMTCTTCTGM^ 

AAACCAGCGGGGGCGACGGTTTTGTTT^^ 

SEQID7918 m mmBrr 

MVHIMFNQAAQELTTIRD^ 

GK?IiRHfIEyDELVHNALDLCTGS^3iAIQ^^ 

RQILLNAAKFIilTCKGV^^ 

SEQ ID 79119 

ATGATTCCGTTCG(mCAGGGTCAa^GGCTAGGGa 

SEQ ID 7920 

fflPFGNRWQGRGGSiMPSESIjSLRIiPAKIVGKGTYHPVQ 

SEQ ID 79211 _ 
ATOJAAGAAtaGlTCAT^ 
Q^OXX^TTGTTCTGCGCCAAA^ 
OTGTCCAAAGTGCGCCGCGMCGCATGGGC^ 
GATATC6AACGCCTATTGTACTPK5W 
ACGAOTCGATGCCAAAATGGGTGCGGAAGGT^ 
CAAAAAAATCGCTAAACECTTGAAACT^ 
T<PGGACGGCGGCCGTTTTGCCACTO 
AAAAACGTATCTTGCMGAAGCXK3TT^ 
CGGCCGCTTCCGTCAAAACCTGTTGGGC^^ 
CT(?rTCAAACCATTrATTTTC^ 
TCCGCGAACAITCCGATTATGTTGAACC^^ 
ATTTAATQCCGACOTGACGGTGA^^ 
GMCCGATTATCGTAOTTCCCMGACATCGT 
ACCATACCAMCAGGTCGAGCT(^TACmMTCACC^ 
ATTGTTGAGCGAAATCCTGCCGAAAGGCCTGCC^ 
GTTATCTTCGCCGACC&CETGATO 
ATtKXX3AAGTTAMGAAATCGAAGACCAATA(XX?TCAAGG^ 

CAGTTGTCCGGTATGCGCGGTTTGATGGCGAM^ 

CGCCTAAGGG1TOXX!G(&?AC<X3^^ 

CTTTGTCATGAAGGCAGTGGTACAAGGCGGTGATC 

GCCGGTACGTTGTTGACTGAAAMCTGGTGGATATGATTGAC^^ 

GCCGCGACTTGGCGCGCGGCAAACram^ 

TCCGAAGTCGTGATTCACGATGAaTCGGCra 

GQGATCC«MACCCGTCtt&^ 

AGTGATTGACGGTAAACGCCGTTOTCTAGTGC^^ 

GCATTCCCCGTTGGTGCX^^ 

GCGTTGCXX3AGCTGTTTGAAGCACGCGT(XXX^^ 

GGACGGTO!AGCATACGAGACTITGATTT^^ 

CGTCTGCAAGGTATCGAAGCACTGGCACXOT 

GCCGTGTGAACAIPrGCGGATOTGGCGAAACCGGGra 

CGAAAACATATTGCTGGGTATTACCAAAG^ 

GMCTGCGCGGTCTGAAAGAGMTGTCATCGT^^ 

CCCAAGTAACGGATGAA 

SEQ ID 7922 

MEEEFDAIKIGIASPETIRSWSYGEVKKPETINYRTFKPERIX3IJ 

DIEmYFEAPVVTDPOTPIiQRRQIiLTEDDYYNKU)B^ 

UX^RFATSIMDIjYRRVINRNNRIiKRLLEU^ 

LFRPFIPHKLEKQGLASTVKAAKKLVEQEVPH^ 

EPIIVPSQDIVIX5LYYMTRDRINAKGEGSLFADVKEVHRA 

VIFADHamFGFAAKGGISIAVD^ 

QI^MGLMAKPDGSIIETPITSNFREGLT^ 



CEVVIHDDI(mERERHKVPY(^LL^ 
AFPVGAVITVREGQEICaCOJVLAW 
MjQGIEALARYIVQEVQBVyRLQGVnSDKHIEVIH 
EIJIGIJfENVIVGRIiIPAGTGLTYHRSRHQQW 

SEQ ID 7923 m „ 

TTGTGTTOTCGGTATKCTAAAACCTTO 

GTACAGGATTGTATGTTCCTTTTOCACT 
TATTTTT 

SEQ ID 7924 

IXSSVFIflPFPPAfOIHHlQIWMILII^^ 
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SEQ ID 7925 CAGOTGTO^^ 

TAACGATGCACCGCATCCKXACM 
TTCCGGC 

SEQ ID 7926 

mdifresrpfcattcptptsgsa™ 

SEQ ID 7927 

ATCCCT^TCGCGCX^mt^ 
SEQ ID 7928 

MPKSRLPmiFF^KHHDIfiEKCRMMSDGIPSIiRQ 

SEQ ID 7929 

GTCTGTATGMCTATTCGmACCGAGAAAAAACCT 

TGCAGCTGGAAAATGCTTTTGACAAACGTAC 

(^CGAGCCTTTGTTCGATATCCCTGM 

GAAGTTCGTGAAAACGAAGTGTATATGGGCGAAATT^ 

TCTTCGAGCAT(»CAAA^ 

CCGTATCGACCCKCGCCCTA^ 

GTTCAAACCGATTTGGTCGCA(^ 

ATATTACCAATGCAGGCCTGACXICGTTTGGATC 

MTTACAGAAGAGTTGTTGGCCAMTTTGATATTMCXXS^ 

GCCGGCCGGCAGGCGGCGCGTGTTGOjATTT^^ 

TGTCC05CGTAGGCCGTATGAAATTTAATAC(^ 

AGGCGGTTATGTCCTGAGCGTCGAAGATATTGT^^ 

GTAGGCGAGCmCTGAAMCCMTTCCGCAGC(X^^ 

CGGOmACCC<XfcAA^^ 

GTraCGC(X!GCAC(^^ 

CAC^GGCGAATGCCGATTTGGATTCAGAC^^ 

CGGTCAGGTGGTATCCGTTGCGGCATCCCTC 

CCGATGGTCGGTACCGGTATCGAGCGTTCCGTTGOX^^ 

ACGACGMGCGA(yTCXtt£TGA^ 

ACaXKXWTTGGTOX^ 

TCCGAAAAAGTOOTXXSGAC^ 

AGCGTATGCAAAACCGraXaGACGMTCCGGTATCGOT 

AAAACTGCTGCGCGCCATCITCGGCGAAAAAGCGTCTGAre 

CAACGCGACAAACGCGCCCAATCCATTATCGATTCCGM 

TOGTCAGAAA(£CAA(m^ 

TTTOmAGC^TTaSAACTGATTAAATTGA 

CAAAAMTGOTAMtmTTTATCGCCATCm 

ACAWK5CGGA05Gaxm^ 

ttTCGACOTATGCTGAAAGAGCGACGCAAAGCC^ 

GAATTGGCTTCCMTCTGCGTAAAGGTGCAOT 

A(^CTGGGCTTCAACGACAGTAAAACTCAAA 

TGACGAAAAAATCCACGCGCGTTCTACCCKXICOT 

GCATACGGCGCGGCATACACGCTGCMGAGATGCTGACTGTGAA 

SEQ ID 7930 

VCMNYSFTEHOtlRKSFAKRENVLEVF 
EVRENEVYHGEIPLMTPSG SAVING™ 
VQTDLVAGRIiKGETAKVDILDKEGNVL^ 
AGRQAARVAIYEMMRK 

VGELT^FRSGLARVERAVKERMQAESEN^ 

VtARTNDYGFLETPYBRWDGKOTEEID^ 

HJVGTGIERSVAVDSATAIVARRGGVVEyVDmVm 

SEKVAADDRYTSIHIEEUIW^ 

QRDKRAQSIIDSEIiKRYM^LNDQLRIFDN^ 

QKMVKVFIAIKRRLQAGDmGIUiGNKGWSRILPVEIBIPY 

ELASNUIKGASFASPVFIX5AKESEIRBMLNL^Y 

SEQ ID 7931 

ATGTCAGGTAAGGAACQSOTAACOTMTC 
TTCAGGCAGGCGCGATT 

SEQ ID 7932 

HSGKEPLNRHFVIASGRECRI^ 

SEQ ID 7933 

TTCTTTACATTTATTTG(^TAGTO 

SEQ ID 7934 

LFTFICIVFIKSIiQINVNIRIiM^^ 



WO 02/079243 



-596- 



PCT/UB02/02069 



(X&1CK5AAAGAAGCTAAAGACATOT 
ATCAAA 
SEQ ID 7936 
HHTKEDH^VG^ 
IK 

ATC^CGCCGACOTTG^^ 
CGAATTITimCAAMGCOT 
SEQ ID 7938 

^EQ ID 7939 

^(X^AAGAOTGGGC^ 

(£(XXTCGCCTTTCAGACG^ 

GCCGCCGftACGCACCGCQCTCAACTACCTClAC^ 

■T^CCCTCCTGCGCGT^^ 

CGCCmGCCGTTCXmGGCAAAA^GGOT 

TGGCGCi^CCGGCGTGGACGGCATCGCCCTCGGCT 
ecn in 7940 
^PIGRTSIFFMKG^ 

DHMDDETLKEA^C^ 

TOGEGCGAATGCftOT 

GTTCCTGAGTTGGCTTCTATT^ 
GCGAAGAAGCTGCT 
ccn in 7942- 
^EP^VmSAAIAN^ 
VAEIASIPSRBELI^KLLFVMQA^ 

G(^GCCGATAGCAT(X^CATCCCGACGCGAAATGGCAGG^ 
TGGGCAAACTCGGCGGACACGGAGGAAGTCCGC^ 

(^CCAAAATCAAATACCGCCCC^ 
OQCOGCC^ 

CGGACATCG6ACTGCCGCTGGGCAGC0GT 

^FO ID 7944 

I^VTOG^VIjTVAQOTjLPirePLSPtiP^ 

RRQIQQQNIJEPSOTGVSGSKGRPAQLraFRffilVLSDRIiISDIGIiPI^R 

GwvasGTTTGCCTO^nX^ 
jm'KXTKjAGCCGCTTTTGGCG^ 

GTGCcrow(mc«rrmcAccc«a^ 

TrCTGCGGGIGGGiAAGGCTTTATTGTAMCCAMGCGGT^ 
SEQ ID 7946 



VimBLGGIATOTAFIQHFIMflHVFFRPVF^ 

IOTAM!fflCMA»GCTOCTAAAQ^ 
CCTASOGTTGGTCAATTGGGTACCATCTlfiGGTC^^ 
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AGTACCGTACCGACMGGCAGGTATTGTTCATGCA^^ 
TGCTGCCAAAGGTCAGTACCTCAAA^^ 

SEQ ID 7948 

MAKVSKRimRSSVKAmYAIDEAIALVKmTAK^ 

SEQ ID 7949 

ATGGGCAACCTGATAGGCTGGGAA^ 
T«ym(mTTCTCCCACGGT(K 
GGGCGATGGACGGCAGGTGAGGTTGTOCOT^ 
GCTATGTCAATGAAAGCAGCGGGCATGAAC 

SEQ ID 7950 

MGNHGWBGTGIjVAGKQWITAKDD^ 

AMSMKAAGMN 

SEQ ID 7951 

ATGOTCCTTOAGCTGGTTTTCA^ 

ACCtKAAAACCGACCX^TTC 

GTCGAGCTGGTTCCCGTAT 

SEQ ID 7952 

HFPCSWFSXMKVCRIOT^^ 

SEQ ID 7953 _ 

ATGCAMCCGO^CGCC(X^GTTCGATTACGAT^ 
CAGGTCGTCTGAAAGACGAAATCAAGGTCTTG^ 
CGATTCGCTGGAAATGGCGC6CTTCX3GCG(^^ 
COTGATTTCGAA(XX>GAATGTTCTW 
AAGCGCGTGCCGACTGGGTGGTAACGTCTTCGGT^ 
CGAAAC6Q60GCG6MMGCTGTTGTGGCAG^ 
GAATMGGGCAAAlKXJTCA^ 
ACGAMTGCAAMGCAG(X^CC(&CAMGAA 
CAAACACGCCCTGACAGGCGGACGCAACGAAATCCM 
TTCA&CGGCATGGGGCCCGCC 
SEQ ID 7954 

MQTAAMSFDYIMPLIQ^ 

PDLEAECSII)I^PEEAFSAPCDQHPDRTVAVYANTSAAVKARAW 

ESPQSVIEIXaWVGSTSKL^ 

FNGMSPA 

SEQ ID 7955 

GTCGCAAAGAAAATTATCGGCTATATTAAACTGCAAATTCCT^ 
CATTTAATGCTGCAACCCAAGGTATGGAATCTGGTT^^ 
AAAAGOCOXXOTTG^^ 
TTG(^T«^GCTGTOG^ 

SEQ ID 7956 

\^IIGYIKLQIPAGKANPSPPVGPALGQ^^ 
LDAAWTIAGSARSMGLDVBGW 

SEQ ID 7957 

ATGTCGftAAAAATGGTATGTTGTAaGGCCTAT^ 

CTGTAGA6AAAGTTGTTGATATCCGCAATGGTCGTAAGACT 

CACTCCCCGTOTTCTCGTTT^ 

GTTGAATTTGAGGTC^TCAACAGGTTCGTGTAAAT^ 

GTA(&GAAACACC(X3TOA^ 

SEQ ID 7958 

MSKKWYVVQAYSGFEKNVQRILKERUREM^ 

VEFBVGQQVRVNEGPFADPNGVVEEVNyERHKIiOT 
SEQ ID 7959 

ftfGACAGlWGIff^^ 

CCGAATTCAAAMGGTGGTTTG^ 

(OTAmmi^TGTAmaGAGAAGGGAAGGT 
SEQ ID 7960 

SEQ ID 7961 

AIXmACOaTTGCGAC^ 

CCAGCOTCATGCGCAAGGCGGGATT6CGGCGGC6T6GT^^ 

CATOTGTCGCAOHa^ • 

OmCGCC(MACAC(^GAAGCC(m 

CAGCGTGOBAC^^ 

CGAATGCACGGGCGACCKTATCGCCATGGCGATACGCG^ 

CTGftpnXXXSAAGCOT 

TCGCCTCCGAAAC^^ 
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GCXZTGTACCGGACTGCACGGAGCCAACCGCC^^ 

CGATACCGACC^CCGCGC^ 
CAAGCCOATACAGC^^ 
SEQ ID 7962 

hvadytceavmqsliiqirrrpmirvyerqma™ 

LISEAVRGEGGILTNQSGERF^HYDRKAEIiAPRDIVARAIAACT 

ACTGUK^NRLASNSLLBCW 

QAAYRRRQNIGAHPNSDC 

SEQ ID 7963 

TTGAGaATTCGCCraAAA^ 

G(^TATGGCGGCCGCCCTCTTCTTTG^ 

ACCGCGCAGCMTACGCOTACGTTGTCGCCC^ 

TCACCMCGTGGftTGj^ACCTCGCTCT 

GGATGTAGCTGTCCAATGCQGTAGCCAGTTCG^^ 

GCTGGACAGCA(XjTC<£(^^ 

ATGTOTTCGC(y3GTTT(K^^ 

GGT(?raOTAC?reGC^ 

TTTTTTAGC 

SEQ ID 7964 

LSYWPLNLSI^IRRKHAGTHGTAAFAMKAQT^^ 

TAQQYAYVVARLTFVQQFAEHFNAGTGGFLGFFQTD^ 

AGQQVADFHFNQFQQ^^ 

FFS 



SEQ ID 7965 

ATGGCTMGGAAAMTTC£AAOTA(£AAACCGC 
GCGGCGCTGCAAAAGCTTACGACCAAATCGAQU^ 

CTGGCCCGTCAAGTAGGCGTACCTTACATCATCGTG 



CTGGCCCGTCAAGTAGGCGTACCTTA^^ l y l auwiauiw * iunnni«wwniv,^^wiwv *yw»*w*<wv 

AOTXXICCGGCGACGACTGCCCGATOT 
TCCCGAGCGTGCCGTGGACAAACCATTCCTGCT^^ 
ATTGAAATCGTOGTCTGAAAGAAACCCAAAAAA^ 
MCGTGAAGACGTAGMCGCGGTCAGGTATTGGCCAAAra 
flTTCGCCMCTACGCTOGCCA^ 
ATTGCGCCTA!PCGCTA!TGGAAGAAGGTC^ 
SEQ ID 7966 

MAKEKFERSKPHVKVOTIGHVmGKmTAALTO 
LARQVGWniVFHNKCDMVDDAELLBLVEMEIRDLI^ 
IEIVGIjKETQKTTCTGVBdFRKLI^ 
IAPIAMBEGLitfAIREGGRTVGAGVVSSVIA 

SEQ ID 7967 

GroTTTGGGTTTCGGATGTCGAAGGAAGG 
T?m?CTAACCAGCCAACCT 
SEQ ID 7968 

SEQ ID 7969 

ATGTCGCTTWATTACCGACGAGTGCATCAA 

GTTTAAA 

SEQ ID 7970 

MSI^ITDECINCEVCEPE^ 

SEQ BD 7971 

ATGGGTTTCGGGATGTTCTTCX5T^ 

TCCTCGCOTGGGAAATGGCATCATTGGGGCA^^^ 

CGGGCGCGGATTA 

SEQ ID 7972 

MGFGMFFVNQDAVHRANHiAL^ 

SEQ ID 7973 

<I«rG(5GGCGAAAACAGCTGAAAACCTGT^^ 

SEQ ID 7974 

Ua«QLKTCWVSDVEGRAFI« 

SEQ ID 7975 

MGGCWCRHGQGKMICflQIM 
GCACAAGGTTTTTT 

SEQ ID 7976 

MASI/ffiSGSfflSQUIHSSVIKSDISFPFPI^QN^^ 
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SEQ1D7977 

ATGA(X&CCTTCTTCCG(£^ 
AAAAAA(XC(n!TACCTCC^ 
CATCACCCGCATCGG?TTCAACCT^^ 
AAAGTGGGTGCGCAAAGCCACCTGAAGGCGATGTC 

SEQ ID 7978 

MSTmQTAQAM'AKHIGRFPLSEIJJQVIDWQPIEQy^ 
KVGAQSHU(A10iNLLKAA^^ 

SEQ ID 7979 

TTGCCCATCX3GCGTGGGCAA 

SEQ ID 7980 

LPIGVGKTEDLGSDIALYSCTAELTyAYLQNVWEQSGR 

SEQ ID 7981 

ATGCGCTTTGATAATCATTTCCGGGTOT 

SEQ ID 7982 

MRFIMFRVVKIlUiQPSGSXKFCAXIRARFVLII 

SEQ ID 7983 

TTGCCCATAC(OTCCGOCATCGCCG 

SEQ ID 7984 

LPIPSAIAVSVRAMVFARHI£SC)K3^SGyiiAI 

SEQ ID 7985 

ATGOTAACACCGTAGGATTGC^CATATCCAAGCm 

ACCGGTTGAAAAGCTTGGQATATCAGMTCTC^ 

GAAAAm<mOTAreCA(^GCAGGTTCA^ 

CCTAC^CGAGCAATACAGGCTTTa 

ATGCGGAAATCATCAAAGCCATGMTGAACAGCTT^ 

AGGCAGAATGAO^CAGCCGTATTGTTTCATCATC 

AG66GAAAA6GCAAACTGA(XIAAGTTTG(3CAACA6 

AGAAGAAGCCTAAAAMGTCATCATCGCAGCATTGATGOT 

SEQ ID 7986 ! 

Mram/GLDISKLTmTAMVGKTEHSAKP™ 



RGKGKLTKFGNRKLRAVLFMPAMVAYRIRAFPDn 

SEQ ID 7987 

TTGGGCATCGGCATCGACCGCAAGTTO 
TGACCTGCCGCAAAAACGACMCTACTTGGAGATGAT^^ 

GTGTCCGACACCXXX^jAGCTGAOCTACGOT 

SEQ ID 7988 

IXSIGIDRKFffiELKGVSWJDSI^ 

aaaegvgadirrcrrrrhqslwrfahrrgqnbr^ 

SEQ ID 7989 

atcggcaaagcgccagaggcttoatggcggcgac^ 
gggtcggcgacggtgtcgaagaagtgggtcag(xccatgcgto 

GCAAAATGCCGGGGTACACGTCTTCX^GTTT^ 
GAG(^TGCGTTTGAGCGAATCGTCGCGCGACA^ 
GTATCGGTMTCACGCCGTCGAGATCGAAGAGCAC^^ 
GCAATTCGAGCGGATCGCCTTTGAGCAGG 

SEQ ID 7990 

MGKAPEALMAATPAAAYVRTHALGGGKEDW^^ 
BDAmiVARHAFELLVKIiAVDADAQFFGKLFPCEVW 

SEQ ID 7991 

TTGAGTTTTGCATCCGCCGACGGACTGCGC^^ 

GCGCACTCGGCATGGAGGCGGCATCGCGTMTGCAAM^ 

MCCGTCTCTTCGGAO^ 

CGOTGMTGACGGCGCATATGTTTATATCGAGGCAGG^ 
AAGTGGCTGAA 

SEQ ID 7992 

I^ASADGIiRPTPOSVREKIjFI^ 
RLNK^YVnEAGRQPDI^DWIiTGYREGK^^ 

SEQ ID 7993 

TTGGGCTACACaXXmAGCCTrGGCGAAAT 

fK^GCGTATTCGTGCAGCACGACGGCTTCC^^ 

CTTCATCAAACAGGCGGACGTATTGCAAGGCAT^ 

«fl?TCCATGTATOACTCOT 

TGCATATTACTTCTATGACCGGCW^TG^^ 

CTTCEACATCAACTACCGCXKXXOT 
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MGCTTGCCGAAGAATT^^ 
SEQ ID 7994 



SPCIHSIIAAEIXHEmVB^ 
RQPHRQGHWIAQNIFYRSAI&SRER^^ 

SEQ ID 7995 

TTGAGCAGGGTGAAGACGACGTTTT^^ 

CCGCCCCAAGTTTTCATTTGGGCGAAAC^^ : 

G(X?ITITrcGTCCATGTCGAAAC 

TCGTCGGGCGAAAGCGCGGATACGGGGCGGATGTC^^ 

TTTCCAACTCGGCGGC^^ 

TTOJTATTCtnra^^ 

AGATAGCACTCGTCACCGGTGTAGTTGGTGT 

SEQ ID 7996 

LSRVKTTPPAD(a)FNQTAAVVDVEGIACPm 
AFFVHVETVAEEVDALQYVRLPDB^ 
FVFVRTGYAIiNHVFAVAFGEVDAVR^^ 
SEQ ID 7997 : 

TA&ACGGTTTCGGTCAGCTTGO^ 
SEQ ID 7998 

UiCLVCLPAMIXSHVQ^ t 

SEQ ID 7999 

TO£CGCAGGCGCAGCATAACGCAC^^ 

GTAACGGCGCGATTCCTTACGCCATCTACMCTACACCM 

(XACTTCTCCAAACGCAACGGCAAATAC^ 

SEQ ID 8000 li 

LPQAQHNAREQGLA(^YPMVTFTGIECHNEWEITFEE1HRN^ 

SEQ ID 8001 

ATGCOTCTGAACCTOAGACGG^ 

CCGCGCAGGCXK&MTCCGGATTTGTC 

TGCATCGGTAAAGCTGGATTTGTTAAAMTGGAATTG 

SEQ ID 8002 

MPSEPSDGIPDALAVYGRGAGAGKYPYBHCRYTVTL^ 

SEQ ID 8003 

TTGTTAATCCACTATAAGATTTCACCATTC^^ 

AACACAAACGGCTGCAAGAGAGCCT^CCAGCITGGGC^ 

CTTTCCCGACAAAACCC^^ 

TTGGCGAAAMCGACGTTGO^GC^^ 

AAAMGAGCTGGCTCTCACCCGCTGGGAAGCCGTGTCC^ 

ATTCTGGCAGGTATTGGftCAAAGGCGTOCftG^ 

AGOTCAAAGCCTTCGGCGGCAGCCAAACX^ 

ATCAGGGCTTCGMGCAGTGAAAGCCGCXE^ 

GGCGAAAAATAC6GCX3GCGCGACCTATTGGGACACCGAAGCOT 
CCGCAGGCGCAGCA 
SEQ ID 8004 



LAKNDVAGFSVELDM^ 

SFKAFGGSQTWVSNSPEAEVGGTPETPmVI^ 
GEKYGGATWJDTEAYAVEH^^ 

SEQ ID 8005 _ 

TTGATATGCATACCXXX^CAAAAAACTra 

ATGGGCGCGGCCCGTATGTGCAGGCGGATCTGGOT 

CAAAAACATCCGCACCCGCTCCGTC(^ 

MCATAAMGAGTTGGGMGAMCGATMTAGCGCTTCTGACAGCAA^ 
TGTCAACCGTOCGATT^ 
CACTTACCCMGCGACGGTAGTGCGAAAACTTCTGTC 
ATAGACGTCG(KCCaxmGACCTTG(^ 

SEQ ID 8006 

LICIRPTKNFSSLLFSSLLFSSLLFSSIjIiFSSAARAAGEDHGRGpyVQAD 
IIWAPGLTIiDAGYRYHYWGRIjENTRPK^ 
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TGOTACGGCATCGAATCCCTiTO 

SEQ ID 8008 

VPSDIGRPAAVLQAGRRRR^ 

SEQ ID 8009 

GTGC(KKX3TTCGACATCMC6GTAGGACTTACCG^ 
(XXaACGGTTATCCC^CGATTTC^ 

S V^ffI^LTAWRPI^ 
FSRPKPPDAAVAALRRET6RF 

SEQ ID 801 1 MCXJQGCTATGTGCTCTCCAACCGGC^^ 
AGGAJ^^ 

CATCCG0GGTGCGGCGTTC6ACATCAACGGTA6GA 
GCAGACGGCCGACGGTTATCCCAACGATTTGGATATTO^ 
CACATTTACTGGCCKKTK^CGCGGGCCTGCACGA 
TATT 

S M^ATOFl?iDGRAVTGyVI^ 



mTGGAATCGGGCAAACGTCA 
CATTTTAAAACAATTGGTAAAQCAiXM 

SEQ ID 8014 

LFIRIVRSYWSDCaCiaWPFPYQSRttPFCRTAKPy 

S otaa&ca^^ 

TTATTGCGGTACACAAATACTCAACCGGTGTCAGAT^^ 
CAAA 

SEQ ID 8016 

VKHAASKSNSIELSLMV^^ 

ATACCTTTCGTAGraTGATGTGCTTG^ 

SEQ ID 8018 

MPFLRDAIYKVYTWVPNAQSATLLPIM 

SEQ ID 8019 

ATOTTGCTATTTGCCGTTACCTCro 

OTACCOfTSTTCATAATCA^ 

TtXMTCTTTTGCAAGATCCTCATCGGTGAM 

CAGGCTGACCGAACTGa^TATTAAAO^C^ 

TTAATCXXIAATATCGGCAGTTTCCAAATCAC^ 

TMCCTGATTCAT(mX3GCAGGTTAA^ 

CGQaCOXOTACC^ 

amracGACcmrom^ 

TCTCCAAACGCGACATTACCTTAATCCGGGAAGCMGC^ 
AAACTCGAAATGGATGTCGGATGCCCCGCTACGCAAGGC^^ 

CAGAAACG(mTrCT(^TTTO 
CCCCATCAGCACOT(7K^ 
AACACTTCCTTACCC^CTGCGACTCATTGT^ 



QADRTGNIKRRDTHIQHTRQRRRHIRVQ^ 



IBQHPVARIKPQIKNRITEHPRNQCIUCRI^ 
SEQ ID 8021 

TA 

SEQ ID 8022 

IAHHRENFLYTHLDBlOTMHAYDISFTITn^SAFOT 
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SEQ ID 8023 

GTGflTTTGTTCGT^ 

(^GCTCTGTCTGTATQOWCAAATOT 
TOTCAGTCTGTCTGATTAMGAGATOT 

FSVOiIKEIHGGV 

ATA(X3ACATATCGTT!EAC6iOTACTTA 
CamAGCOXOTMTCG 
TGTTTATACGGATTGTTOGTAGTTACGAOT 
TTTGGAATC^GGC^AACGTa 
^PO ID 8026 

aYGLFWmQIVSGFGR^^ 
SEQ ID 8027 

rcnmx^cG6GCGG(^ 

ATTGGACGCCGGTATCTTOGGTTm^ 
TTTGGGGCATTCGCGACCGTCGAGTTC^ 

acxmcmACGGCccAmrGOT 

mOTGTTTGATGAACGGTTTCa 

GGCGAMTOTTOO^^ 

OMGGGCJGQCCGATOOTA^^ 

CGCTCATTTCGG6CAGTTTTTTGC 

GACGGTMTATCGACGGTCGTTTGGTCGMTTTGGC^ 

T(XGGCGTGCG6TACACG6AAGTC6GGC6GATC(X^^ 

TGTAATGGCGGATTTCGQT H 
QPO ID 8028 

FG&FATVBym^FRFGDAF6SFP(H3FV0VAGWTMfiIT^^ 



S SXtoc(^t?aag^ 

SEQ ID 8030 

LDGDSTIKRFFAICRSLDGGFDAGVSDGIGRAGAV 

iGMGAGCCGCATTITrCAAACGCTAGGCGCAGACCOXACSATT^ 
TOGMGCCGCGCmQGCGGCCGCCGCCTGC^ 
SEQ ID 8032 



reYTOXXXXMXyUlAAAGCACGATTTGGATGTro 
ACAMTTGGGCTGGTTT^^ 

jayKHCGTCGMTATGGCGATGCA«XXTO 
GGCAGCGATTCTGCCfflTreTGTTCGCGTATATCG^^ 
MWraSTTGOATCTCCAAAGTCAAAGAATA^ 
CTAAAGTGTOTIGGAGSGTTACTCCGGTACAGTT^^ 
CGTAGGCCATCAGGAGGCGGGCAACCGGIACGGCGTTnGGCGGCGGTG 
ceo ID 8034 

IiYGTLIAVIVMILMPNSGSFGPGllASIAAIiSFGALMIAliLD^ 
SAFTISKVXBYDPBTYASYHGIIJVAANQEKANWFE^ 

S TTO«EG^TmmATCCTCGCGTOTA 
GCAMCATGTCGATGCGATCCTTGCCGCTGCCAM 
A(XXX3GTCTGAAAMCATCMGCCGCAGCGCGTCGTGTT^^ 

GOaARACGJiAKK^^ 



OTATAIPXOTACCGACAGCGTGMTTTGGCTGACGAAGIXW 
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ATCCAAAMCGCCCAAGAAGCCaT(^GCGA^ 
CGfACOOTGCCT^ 

TTTAC(&AGAAAGCAGC($(X&TGM(&GAGC^ 

G00GCGCTACAAC6MGCCACACTGGTTAA 

AAACGOTCATGCCCACXIGACACAGGCGACATO 

CCCGAAATGCXXjCGAACACAAACTGCAAATC^^ 

GCCGAACGCATCG(KAAACCGGAAGOCGAACAG(^ 

CCAACTACCXX^TGCAftA^ 

6AAAAAGTCT60CC6CAAAAAGAATGCG6CTG6AAA 

SEQ ID 8036 

LGGDFEIIASYGHVRDIiWKSGAVDPDNGPAI^QIJSMKSKHVD 

ARRALDYLVGFNLSPLLWKnRRGLSAGRVQSPALRLICER^ 



RTVACQMTPJ&FDQ^ 
KRFMPTDTGDIVNKFLTEHFAQYVDYHFTAKIiEDQ 
AERIAKAEABQVEUXJRBCPK^ 
EKVCPQKECGWKBQIEPPAPQE 

SEQ ID 8037 

CCCTGATGATGATGGMGTATTGCTCAACAGCTCC^ 
GCTGATGC^GTATCTGATTGAAGAAAAAGCCG^ 

crcxrocrmATGG^ 

SEQ ID 8038 

MTEVIAYIJEHFQDFDTCPPPIMiGMLI^ 
LLLLWANKSEIjPVLVGDELMSAiiLLDNmMN 

SEQ ID 8039 

MGAAOCGGGK^^ 

CCCGTGTCGGGCGTATOTTATTTATAGAATA I 
SEQ ID 8040 

M^VETHnQYAAIXSmCRPFRRHFCGKR^ t 

SEQ ID 8041 

ATCCAAMXXSTCAAAGCTO 

SEQ ID 8042 

MQTOPRQQESNLYFTLRKifFFYPIiNYGEPK 

SEQ ID 8043 

ATGAAGATMCACACTGCAMTTAAAGAMGMGTACAGAAAGAACCGCTO 

A^CGGTO^ 

AAAATCATGCCGGACGGTATTGTTT^^ 
ACATTMCGGCATTGGGMCTTTTGGAATCAGGCAA 
CACACCGTCCCGGCAGCTAAAAATCCTGCGGGJ^^ 
SEQ ID 8044 

MKiraCKLKKEVQKEPLRSFVPETO 
KIMPDGITODSPGSRGKSDAGGFTRCRIimSK^ 

SEQ ID 8045 

TTGTTTGACCGMTCGAGAAAACTATGTCCCAACM 

GCGCGAAMTCGGCCTCTTGGGCTTGAACGGC^^ 

CTATCTGCCXX2AAGAGCCG6AGCTTGAT0CXX5AAAA 

MTCCCGAT(XXK3ATTTTGACGCGCTGGCGGA^ 

TG(X3CCTGCCTGATTGGGATG^ 

CCACTTGGATGCGGAATCGGTGGAATGGCTGGAGCW 

CTCGACXXXX3GAaC<mT^ 

AATTGGAATGGGTGCGCCAAAATGCCAAAGGCCGCCAAGC^ 

TATCCCTGTTGCCGAGCGTTTGGGTAACGAAG^ 

G(K^TCATCGGCCCBAA(X^GCGGGTAA^ 

AAAGCCGCGAAGGTTTGCAAAACGACAAA^ 

TAAAGGCAGCGACCAMGCAAAATCG(^GGCAGCTTTC 

GATCTCGACGTGGAAACCCTGCGCGCGTT^^ 

GTGAAGGCGACTCCAMTGQGTGTTOT 

GCGT 

SEQ ID 8046 

I^iaEmSQQmSMtRVSKVVPPQmiKDIS^ 

NPDADrPALABEQGRLEMIA^^ 

UHGBGIPWKGNySSWLEQKEKRUSN^^ 

GIIGPKGAGKSTLFKMIAGKBQPDSGSVKIGQTVK^ 

DUJVETIiRAIfllAIJ^ 
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SEQ ID 8047 

ATa&TTOATGAttGAGGAC^ 



ACACCaXXmATCAGOT^^ 

AGGGCTGATTGTCAGCX^GTTCCCCATCGGCACGCG^ 

TC(XXjTTCXX!TGATTACCG(XX3GCCTC 

AACTGACGGMTGTTTGGACGACATCCIGM 

(^TAC(KXXXXKX^CCTGCC6^ 

CAGTTGGCTATGCCTGCCGCAGATTT^ 

SEQ ID 8048 

ITOFMTEDEmWLQIiAFT^ 
SAAIVGSRHATPQAMMAKDFGRAL 
SGSHTAGIMEMGREVMAVPGSIDNPHSKGCH^ 
QLAMPAADLYAALLELEHDGSVAAMPGGRYQRIRT 

SEQ ID 8049 

TTGGCTGATTTTAMCCTGACTTTGCG^ 
TGCCTFTGGCKSTTTGTGCT^ 

CTCAAACaXXrCCOSTCGG^^ 

SEQ ID 8050 

IADFKPDFAWIimPLCFLAPGFGSGLVPFAPGTO 
LKPPPVGVCTKNLHGGLGIMAKmAAVm 

SEQ ID 8051 

ATGCGTAGCAGCGATATTTTAATOTAGAC 

AGCTGCGCCATCAGGCGCG(X£GGCMTGGTGC^^ 

GATGAGCGGGCATGCCAGCATCGATACCGCCGTAGAAGC^^ 

CACGGTGCGGCGCAAATCGAAACQGGACCGGTGTTCGACAM^ 

CGGGCGAGGCGGGTTCGCCGTTTGAAACCGTO 

GGCGGAGGGCGGCGTTTTGTATGTCGGCGACATCGCC(MTACAG 

AGCTAGGCGGCAGGTTC^ 

AGGGGATTGCCTGCAATGTGGCGGAAAGCCAAAAMTC^ 

AATCTGCCGCTGCGCGAATTAAGGGAG(^GGTGGAGC^ 
TTTACCGCAAACTCAAACAGCTCGGCATCGGCG^ 

SEQ ID 8052 

MRSSDIUVDDEVGIRDLLSEILQDB^ 
HGAAQIETGPVFDKLGNSAAIQEMNREVEAAAKRTSPVFI/TC 
SYAAGSDGISCEEKMGLFSES^^ 
NLPLRELREEVERRYFBYHIAQEGQNMSKVAQKV^ 
SEQ ID 8053 

TTGCAMCAGGTACGGATAAGGATGTCGTATTGGGCATAGGCGA 

TTTTTG(^GATGTCAAACCTGAAGGCn^^ 

GGATGMGTATGGCTGGMCGGTTTTGCGGCAGOT 

GGCGAATTGCCGAAGGGTAGGGCGTTGCGGCGTGATG^ 

TCTTGCCGGATGATGTGTTttsCCGAATGCGAA 

TGCGCAAGATTTGGGGCATATCCTGACCGCTTC^ 

ACTTTGGCGGGCGGCGACGATTAOGAGCTGGTTTT^ 

GAGGATGCCGTCTGAAGGTTTTAGATG<X^ 

SEQ ID 8054 

LQTGTDKDVVLGIGDDAAIYRPRBGFDLCFSAM 

GELPKGRALRRDAAVAGDDIWVS^^^ 

TLAGGDDYELVFTAPESCRSRVLDAAERQ5VPVTCIGH 

SEQ ID 8055 

ATG(X3(X£TCGT0CTGCrcTAC 




CACGAAGCCCTCGMCGCAGCOTAATTTGAGC^ 
GCAATATCXX^GTGTGCTGGAACACT^ 
TCCCGACAAAGMC^TTGGGAACAGATTCA^ 
TACK5CGCTGTTCTTCCGCCAGCCG 

GGC(^GGGTCATTTCA(XX^ 

AACCGCCGGCGCGAGGAAGCCGCCXIGTCAOTACCTO 

AGAOTTGGGGA3XKXX3CTCGCCCCCTO 

GGACAAA(XX£TCCAGGTGGAATA!P^^ 

GTGCTGATACGCGCGCAAAAAGAAGCCGCGTGGGGTGAA(^ 
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(^XjGGAAGCTGGAOGATCAGGACGCGCJ^ 

GCTCAMCTG6AAMTCAG<^TTTGMCGOT 

JKXSAGCGOCAIGOG^ 

TGACGGTTTGCGACAACGGCAAG^ 

TGGAGAACACGGCGGtt^ 

SEQ ID 8056 

IOTSKfflREQIMRRF^ 
HEALERSliNIiSKSMiDU^]^ 



NRRREEAARHYLECVI^TTGVVVFDK 

VLIRAQKEAAWGEVAKRLAHEIKNPLTPI^^ 

TTAMRQVUJHIFKNAAEAAEEAE^EVRVKSETG^ 

SEQ ID 8057 

ATGGCTlTrATTACGCGCTTATTCAAAAGCATO 

TAACOaCGGCGGGCGGCTTTCCATCAGCAG^ 
CCCGACAATOX?nOTATC^^ 
TCCACCGATTACX&CACTTTGGATGC^ 
GCCTGACGCTGKTAOTCAA^ 

SEQ ID 8058 

MAFmiiFKSIKQ^VLLPII^ 
STDYDTLDAAIiRATGAVANWKVL^ 

SEQ ID 8059 

GTGTTTCAGCGTATITCACTTTTAGCOT 
CGGCAACTTCCGCCACCGAT 

SEQ ID 8060 

WQRISLLALAIPYPYAPmFQHTroiSKHETARRPEC^TSOT 

SEQ ID 8061 

TT<XXGCTTTG^^ 

AAACACTGAAT6AGOTCTTTATT6TTIX1TA 
SEQ ID 8062 

SEQ ID 8063 

ATGA(^TGGCACTKXXXAAAMOT 

AAAACGGCGCOTGCAGGATATCGCCTACGGCTO 

GGCGGCGTTGTACCAACTGCATTACACGCGCAACGCGCC(^ 

ACMCATCGCCGCCGCG(n<^TCCC^ 

GGA(XiAATATGCGGTTACGTTGGMGAAGCCGTGCCGGTA^ 

AAAGACGGCGAACGGATTTWK^CGCGTGCGCOGC^^ 

W^GGAC^TATCGCCCGTTTGGGCTTTC^ 

CA(X^CTTCCGGCGTGGCGCG<K 

(^gggcggcagmtgctgcttgccacx:tgtto 

CGAACAAACACCAAGATGGCTTTTATTAC« 
SEQ ID 8064 



RRFLREREKLAASCKKDDVAKHNLPIiWW\rAY 

nXSERILIftCAAPGGKTGHI^ 

QGGBMIMTCSVFVEI^ 

SEQ ID 8065 

ATGATGATAAAAraTATCCGGCATACAGTGCO^ 

AAAGGAraCCTPGCTATGGGCAAACXCACATCCT 

SEQ ID 8066 

MMIHPYPAYSAAHIGHDSIHFBfiKaK^ 

SEQ ID 8067 

TTGTTMTCCACTATAAAGCGCGGCAATGGGACACAA^ 

CAAMGTOGGGTrTGTTTCACAT^ 
SEQ ID 8068 

SEQ ID 8069 

TTCTCCGACMCOTACGCACACGCAGGAACAC 

CGAAtXXX^mmrrCAAAG^ 
•jXXK^reCTGCCCCGTTGGCGCGGCGCGG^ 
GCGMCACCGOTAOGCCATCmCCTACCGATACCG^^ 
GAACGOGGTCAAACAGCOCGAAGMQGCGTAAW^^ 

TTCOTtfmGAA^^ 
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SLLPHMGAAPIQRAI^^ 
PVPAAWVEYQGKPMKI^^ 

- 

SEQ ID 8072 

MIFERS INGWKYRAVMBIRTTQKBLALNSLOTDRIE 

GTGGTTCGAGAATGCCCGCGCTGCCGTTTTTtt 

GGGTC^TGCCGCCATGATTATGGMCTTO^^ * 

SEQ ID 8074 

I^EEGAHGKSPKCLGGNI^ 

GCTflXSACTAWGOO^ 

CG(XX^TTTCCGATCGAA^AAACCTGAACGT^^ 

AAAMTCCAAGAGTTTO^ 
TTTOTAAAGTTATTGGAAGAAAAAGGAT^ 
jUtf&OTGGAACCAGTCAACT 
TTGTGCGGGATTATCCGATATCAATT 

KSWNQSSIRANGIflYAMF^ s 
SEQ ID 8079 ! 

TTGmCACATCATCGATACCGGCC^ i 

SEQ ID 8080 I 

IfHIIOTGRRFHJIAWPGTKSANIifflRQDIT 

(m(^CTATGGAGAATGCTGGAAAAACGOT 

GCAGGCTCCTCAATATGTT^ 

AGTCGAACCAATGTCCAATCTGTCCGOT 

ACGGCCTACCTGCT^ 

TGAGGCTCTGATTGCATGTATCGAACCTCy^^ 

SEQ ID 8082 

HreOLKLSALFVALLASGTAVAGEASVQGYTVSGQSNEIVRKMYGECVnTO 

SEQ ID 8083 

ATGAAAAACAGGCGGCCTATra 

^fi^nwryiaavvrapqv^^ 



KVEAIiNEKGEKFTLEADGLIMC^ 

GTCGTATTATAACCCTGCTCTATATTCAG 
SEQ ID 8086 



AAAAAATCCCCCTAAAG 

SEQ ID 8088 

VWVCTOWIJJIVYNIjSYFL^ 

ATtn^GCCGTGTGATAACAAATGTTCCGCATTT 
SEQ ID 8090 
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SEQ ID 8091 

AIGAAATTACAACMTGAMTACGCOTAGAA 
AAGMGAAATCGGCATTCAGATOTTATCCGC^ 
AAATATCGGCAGC(&(?ITrACCGGACAGGAC^^ 
GTCMTCTGACX^TCAAACAGGGAAGCCCltXX^ 
CCTOTACGACTC^^CGCGC^ 
(^TGCGGGCAGCAGCATCGCGCGGGCATTCGCCA^ 
GGACTGATGGCGAAAATGGCATACAACCCGGATACGGACG^^ 
GCGGATATGCCTACGACTTTATCCAAGCXSTIT^ 
SEQ ID 8092 

MKLQQLKYALEVYRHNLNVS 

mTIKQGSPAAIAQHVTSGESDLAIVTE^ 

GOiAmYNPOTK^LQLVDAAHL^ 

SEQ ID 8093 

ATGTTCCGCATTTTAACATCGAATTATCCCTACCAT^ 

SEQ ID 8094 

MFRILTSNYPYHH6NYEKQAAYP PKERENYGFTEYRAGL 

SEQ ID 8095 

ATCTTAAAATGCGGAACATITG^ 
GTCGTAmTAACCOTCTCTGCGCG^ 
CTGGGGTATTTCX3<X&AAAT^^ 
TACGTTGGCGGCXaAArcGCGCCTCGGCTTGGAACA^ 

CCTGAAA(MOOGGG(XETTATCTGACCT^^ 
ACCGACTCCGCCCWmCAAC^ 

TGGTCGAMCCGCCATTTCCGAAATACAGCAGGGCGACTAC^^ ■ 

TTCCGTCTTTGMGGCGTGGGGGTCGGCGGACAGTTC^ 

CAGGTCAACCTCTCCAACMTTTGACCGAA 

CTTACGCCATTATTTTGGAAAACATCTCCGATAOT 

CCAA 

SEQ ID 8096 

MLKCGITVITRHIPRGCim^ 

YVGGEPRLGLEQTiXtfPVVKISPDKEVSGYGIPMDVN^ 

TDSALEQRSKQAEEFUiK^^ 

QVNLSNNLTEE PKSRDTVEL I ST P AEE VG LAMVYHTAPKLAYAI I LENI SD IS EG DTAANPGRDLDNMPDQGRAR VDSD PFQ 

SEQ ID 8097 

ATGCAAMCCCTGAOTAGTCGGACAAAATGOT 

SEQ ID 8098 

MQNPDLVGQNAFYPMESVSARTEITAAVSLADRYSELTKTGTAIjPR^ 

SEQ ID 8099 

atgtgt^caagcacatmtcgggat^ 

gcx^w^tctcgsgctcctgcocaaogagt^ 

tgccatccettc^^ 

agttgcgacgcggtcgccttggaaaa^ 

aaattgtgcgcgatcaaggtgttatgctgtot 

CTTCATCTTflAGAATCTCCCGO^^ 
ATCAAGATTCTCAATCTGCAGGTGCTGAAAAAAA^ 
SEQ ID 8100 

MCQQAHNRMAKHTOYGMASHNTT^ 

SCBAVALENSEVCI&PFTHIEEU^^ 

IKILNLQVLKKMVSGCSEAI 

SEQ ID 8101 

ATGATGATGCA(X£TTCTGTCCAAAGT^^ 
CTCCGCTTCTGTCCGCATTC 

CTGCCCGACGACGAA6ACGCGCGCAC<^ 

TCGAGTTGTCGGCGGC!OTGGAAGCXjCG^^ 

TCAATCA6G(XjTTGAGG6AAATC1^^ 

SEQ ID 8102 

MMMHASVQSRFAPILYVIJFFAGFLTAQI^ 
LPDDEE^TIAAHAKMI^ 

SEQ ID 8103 

TTGCAAATTCTGTTCAGTTATACCATTTATAC^ 
CCTCAAGCGfiGGCGGTCGATTCGG^^ 
GGCGATTGAAATCMCGTGGAAGACGACG(KZAGAACC^^ 
GTTACCGCTGTCTGCTTAGGT 
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SEQ ID 8104 

LQILPSYTIYTDRSKPMTKLYA^ 

VTAVCLG 

SEQ ID 8105 

ATGAGCACCTTCTTCCGGCAAACCGCCCAAGC^ 

AAAAAftCCCGTTACCTCX3GA6ACCGCCGCGGC^^ 

CATttCCCGCATCGGTTTC^ 

^GTGGGTGCXSCAAAGCCACCTGAAGGCGAT^^ 

SEQ ID 8106 

MSTmQTAQAMPAKHIGR^ 
KVGAQSHLKAMCLNLLKAANRLSAPAAA 

SEQ ID 8107 

ATGAAAATCATTCAGATAC 
ftCCGTTT(XATGACGGTTTCCG^^ 
CTACTACTraSGCTGaAC^ 
GCXX^CTGCACTTCGGCGGCGGCACGCCGACXn^ 
XCGftOCCQCGCWAGTC^ 
AAGCTAC<&AGAAACCAAAGAAGTCATC^ 
QATA(XGTTTTGTCGCTCGATCCCGACCGCX^^ 
TCGATATGCTGCAATACFGCGTCCAAACX^TGAC^ 
• ACGGUyCTTCCMGGCTATTCGACCT 
GCX!GCCATCGACGAAGGCAGACTGCCCATCATG(^^ 
AC^AAAGTATGTTCGGCATCCCGTTCX^CCGCT 
AGGACGCTTCX^TCCGCAACATC(^ 

SEQ ID 8108 

MKIIQIQNNHNTODRPEFDRALI 

AQIiHFGGGTPTFLSDEQIERV^^ 

IMVLSLDPDRIiALYHYAHLPHVnCPQm 

AMDEGRLPlMRGYQIiNQDDILRI^^ , " 

SEQ ID 8109 I 

TTGC(KX)TAACGTACAAAAGCCGTCTGAAA 

ACCMGATACCAAACAG<aTATCCC^ • I 

SEQ ID 8110 

LRLTYKSRLKGFFRRHF^ 

SEQ ID 81 11 

TTGCACTACGGCAACGCGGTAACCGTGGGCT^ 

TGTTCGCCCCCGAAGCG(^TCMCACCATra 

CCACGOX^GCCGGTCAMAGCCGGT^ 

SEQ ID 8112 

Iiff<^VWGFffliPSK^^ 

SEQ ID 8113 

TTGTTGATCGAATGTGCGGGCTOH'GQ^ 
TGAAAACAOTATCTOGGC^GCAAM 

SEQ ID 8114 

IiLMBCAGIiCUfflNIiFCRIMI^ 

SEQ ID 8115 

ATGCCTAATCCGCTTTACAGA^ 

TGCCAAAAAAGGCGAMCGCTTGCCGATACCGCCCGTATO 

CC£<^TCAACGCCGGC^ 

TGAMTACGGGCGCACCGTGCATTCGCTGTGra 

iK3CC£ACTGCCGATA(XGCGCCCT^ 

GGCAAATOAACCTCGMGCGTCTATGCTCGCC^^ 

ATTTCGAGCAGGCGACCAAOCKXX?ITrATGC^^ 

SEQ ID 8116 

MPNPLYRQiniSISDLSREQIiEYLIjQTAIjKLKAHPROT 

PVINAGIX?rNQHPSOTLLDLVTIYETQGR^ 
GKFNIiEASMIARAKP^^ 

SEQ ID 8117 

AJ(m6ATAACCGTQCCTTT^ 

TTGGTCGCCTGCTCGA^ 

CTTCGAGGTTGAATTTGCCTTGGATTTTGG 

q(X3CTATCGGCAGTCGGCTTCTTC^^ 

A06GTQCGCCOTATTTCAAGTCGOT 

CGCCGGC(nTCATGACGGGGACGCGCGAAAACTCC^^ 

GAACCGATGAGTTTCCCTTC^^ 
TGTAAAGCGGATrAGGCATTTTTGCXXXXXIT^^ 
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GCGOTATTATCGCACATTTGCCTC^^ 
CRGGAMCOQOM 

SEQ ID 8118 

MVDNRAFFDGFOTBFIiRFHIGSSHFFVQHQ^^ 

AVSAVGFFQLFGNVVGH^^ 

RFAFFGIGIRAVRKAIffllffAQlIiHRR^ 

ARIIAHLPPQSFTARILYFSTOISKHARISIRYNEARHFFSGSRHVNI^ 

SEQ ID 8119 ^ __ jjLLui _ jLn ^ 

ATGTTMrcTGCAACCCCTACGAAGTCGTCAIXX^ 

ACCGCCTCTCCTATTCGAAATGGGTGCGCXXX^ 

CXXICGACAACGACCTGCGCGTCATC6AOT 

GCXIGCCGAGAAAACACAGGAACAGACCGCCTCGGGCGC^^ 

ACATOXKXXX^AaGAACAGATCAACAA^ 

CACCCTCGCC(KX:G(KXXmCATra 

OTACCAAAATCCACOTAACGTCCACG^ 

SEQ ID 8120 

MIJCNPYEVVIHGTTSSCft^ 

MEKTQBQTASC&SYVLREIC^^ 

VTKIHIiMVHVNHDRTDAHRLy 

SEQ ID 8121 

ATGGCTCAAATCCAAATGAGCGCAAATGTTAAAAC^ 

CCGCCTTATTCATTATTGCCACGATGTTCG^ 

TTTGCTGATTGCGGCGGTATTTGAG(^ 

GTCTTTATTATGATGTCCGCGCTTTTGGCGGC^^ 

CEMCQCCT^^ 

TTCGCGCGTCMGAAAAACGTCTTAGATTACAACGCTO 

GMGCCGAAAAAGTCGAGTACGCCACCAAMTGGCGCACGATO 

ACCGTAAAAACAATGTCGATTOT 

$XX^TGAGCAACGTCAACAGCTTCCTGA 

ATGTTTTCATGGXTGCAGGTCTTTAC£GC^^ 

OTOXGC(XAAAGCGCCGTCXX^ 

GCATCCTGCTOmmTACCX^ 

CTGGTCAACCGCAATGCCAACTGGAAACTGA^ 

SEQ ID 8122 

MAQIQMSAITOTIKAWAAMLVGTVGYFIYV^^ 

VFIMMSALIiAAALWLIiFASI^GLPVS^^ 

EAEKVEYATKMAHDAQIYDEPEFDP^ 

MFSVMQVFTACGFAPSHGANDIANMGPm 

LVffilKAHWKI^IGIiAWVW 

SEQ ID 8123 

ATGATTATGGGTGTATTMTTTTTTTOT 
CATCACTATCTATTTTATTCA 

SEQ ID 8124 

HIMGVUmiVPII^ICATIlOTIIHKFKLPKm 

SEQ ID 8125 

GTGGAAAAATATTATMTGCGCCCGCAACATC^ 
(»!H3CGTCTG0CATATC^ 

SEQ ID 8126 

VEKYYNAPATSSSRSVIQTVSGSSFAFGQVV^SDAVQKEGCVCHIPffi 

SEQ ID 8127 

TTGCGGGCGCATTATMTATTTTT^^ 

TTTGGGAG 

SEQ ID 8128 

LRAHYNIFPPVU^FTLYLHCGGKSVYERQYTLKWRF 

SEQ ID 8129 

TTGCTTGGCCTATGTTTGTTGACCC^^ 

(EATTGCCACGTTACTGAATMTTOT 

ACTGGGTTTfl^GCA<^^ 

AAiCXjGAGAAACTGTTTTGCTGGCT^ 

TCCCAAAAAMACTGATTTTOT 

SEQ ID 8130 

tLtn£LLTPASYQNIHVRSWKARRDSGIVKQDIiDFS 
NGETVUiADPSLGHVSTSKSQFLSAWraK 

SEQ ID 8131 

ATC GCGTTTTGCflCCAGTT^^ 
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ATACAGCTTGCCGATTTGCCGCT(£T^^ 
AAGCCCTGTWX^ 

GCTGATGGACGCGTGGACGCAGGCACACTGGCAGC^^ 

TCACMKXXACCCAAAAAGCACGGGGCGCGMCTGTTTO 

AAftCCGmGCGAOX»GTOTM 

GCAIGCAAACCGTCTATTCTGGGOGCGG^ 

SEQ ID 8132 _ 

MAPCTSLGVMMETQIiHGIMSGT9IDGADAVLVR^^ 
IQLADI^LLAELTRimGDFRSIfflLAAG 
SQPHPKSTGRBLFALNWIjE^^ 
ASKPCILGAGYYY 

SEQID8133 

OTOTTGGGCGTOTACMGAACCG^ 
CGGOTGCCAMCACATTCGCTCOTC^^ 

SEQ ID 8134 

VLGRLREPFPWSK^PYTOLIHyKSm 

SEQ ID 8135 

MCCTTTCCGGCAACAAAAGAm^ 
CGTCCCATCAAAGGCACGCCGCATCCCGCCAGCC^^ 
CCAACGGACTCAAAACXXXTATGGTCAAAAGAATGATACC6AG 
GACan'MTCGAATOTOTiraX^CA^ 

SEQ ID 8136 

HLSGNKRQDVGRKQOTRDTAHRCHPD^ 

DGNRIVVADHPQQNQRNQRQGNKTEIH 

SEQ ID 8137 '' 

ATGAAGGTTTTGAACGGTTGGTCAGATAGGAAGATGTG^ 
ATTTGCAGOaWXXSTGC^ 
GGTTCTGTTTTTGCTCAA^ 
AACGCGCTGCCGCTGCTGCTGAA(X3GCGCGGACGT^ 

GGTTCAGGCTTCAGACX»GCATCGGGCAGGGTGGC^ j 
SEQ ID 8138 ! 

NALPLHjKGADVVNTGNARYVLTAI/aiPFPAVSCI IGLVGRFRLQTASGKVAKAGGAVKAGG 

SEQ ID 8139 

ATGAAAACCCTGCTCCTCCTa 
OXGCGCCGCCCnmAGAMTGGACTTCmX^^ 
CXXACmTACGCX»CGGCTAOCACAACAA^^ 
TCGCTTTTGMCGOCCCCGCCGCCQCCCTG^ 

CGTCCGCAGCCGTACCXSAACTGCAOT^ 

GCCTAC(MTCC(^TACmGAACAATACGa^ 

ACACAACCGCCWAAACCGTCXXXaACT^ 

SEQ ID 8140 

MKmLLIPLVLTACGTIiTGIPAHGGGKRFAVEQELVAASSm 

SUiNAPAAALTKN^KRKGERSAGLSVNGTGDYRNE^ 
AYESQYQEQyALMPYSVGKTVKASDRLMVDFSDITPYGIOT 

SEQ ID 8141 

TOXX?ICGCCTT(K}CCTACT 

ATAAC 
SEQ ID 8142 

LPRIAVIiPVLSAASSPCPDLNLIH^ 

SEQ ID 8143 

GTCKjTCGGGTATAGTGGATTAAATTTAAATCAGGACAAGGC^ 

SEQ ID 8144 

VVGYSGIMjKQI^TKPQTVQIVRQGEATPYWFKFNPL 

SEQ ID 8145 ' TAG CGGCGGTCGO^^ 

AAC(XXA(X^ACGMaXX«^^ 
GCTCATOAACAGCXXK3CAATCCMGGCMTOT 
GGmCGTTGACGACGGCTTTACCGTGTAC^^ 
ACGAA^CACCTATCACGTCAACGGOO 
CCGCGCCGATGAAGCX^AACAGAAAMTGTTO^GC^^ 
GGCGAAGCCTTGGGCATAGG(XIACATACTGTACGG 
GCAGCGCGGCGGCXnT^^ 
AAACCTGACAAAGOXSGCAAAACCGG^ 
GCxx^TATTCAAGCCMGi^ACC^^ 
CftgftCQOCATCGGCAACAGCCAAACC^ 
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CAAAACCCAAACCGT(XX3CAACATCG1^^ 
TCmTCATMTGATTOTCCGOT 
GAAAGCAATATTTAGATTCCGACACAGG^ 
AAAAAGGGTAAAAAATGGGAAATGGACACCAAM 

SEQID8146 

VKPLRRLTNLLAACAVAAVALIQP 

GEMiGIGDILYGTRYMDKAAMRNIAPLPAEGKFAHGGLGSA^ 

AHIQAGDRVLSKDEASGETGYKPVTARYG 

VWHNIXPPKPKPmiAQQRKEEAMB 

SEQID8147 

ATGTTTGATGGCGAAGTAGAAGCAGATGAMGTTATTTTGGC^ 

TTTATACGGTTACAGTACX)GAATACTCAAACCGCTACTra 

GAGCGAATTTAGCCA!ITOPAGCTT^^ 

SEQID8148 

MFDGETEADBSYFGGQRKGKRGRG^KVAVFGLIiKR^ 

SEQ ID 8149 

ATGGGAAATCX^(^CCAAMTMTCAATCTTOOT 
TCGTTATGGCGMTTTCCTCTATOCAAATGA^^ 
TGAAAAGGCAATTAGCCATCATTTAGAAGACTITTCG 

SEQ ID 8150 

MGNGHQNNQSSfcEKRIFYLEHSGQTCffiM 

SEQ ID 8151 

TTGCAGGTAAAATATCAAACCATT^ 
AACACCCGMTGAGACXXja^ 

SEQ ID 8152 

LQVKYQTIKIMIFTEYSAAFTKWrSNRFQISVI^ 

SEQ ID 8153 

'ITGAGTTATCTCTCTGATAGAGAAAATO 

CGAAGCTGGCAMGGCGGCGAMCCGGGGAAAGCTGCGGT^^ 

GACTTTTAGAGATGTTMTCAAAGTGC^AAAATT^ 

OTMTTCACATGCTGAAATTGG 

TIKX!AGCTGCTGCT(^GCTTCTO 

A 

SEQ ID 8154 

LSYI^DRENSIEEAVDRWIQENPNAAETVEAVFNVAAAKV^^ 
ANSH^OTQQAYKAGETKGAStfBffVSGKDVCGYra 

SEQ ID 8155 

ATGAAAACATCAACAATTGTTT^^ 
ATTTTGAAAAAAAAGCCGGAGTOTTGTTTT^ 
AGAAGAT^GATATN^CT^^ 
GATATMTTTCTGMTTTATTATGAAAMTAAAC^ 

SEQ ID 8156 

MKTSTIVFC^rcTDNGERIQIPILEiP^ 
DIISEPIMKNKPITIMI 

SEQ ID 6 lj^^^^ 

ACGTCGTCCGCTTCAACTTCAGCC^^ 
CCCGAAAATCCGCGTCGGCAAAATCGC^^ 
CGCGAOCTGCCCGACGACGTTGCCGCAGGCGA^ 
TCCTGAAAAGCAACAAAGGCATCMCAAACGCGG^ 

GAAMCCTTGATGAAATCATCCTTGTC 

GGTGATGTGTKX)GCCGAAM 

ACG^GTATCCCGM(XXX?TCAGCACCMOTA(^ 

AAATCAGOCG(XACAATATT!A0OT 

CGACAOKXXH'TGAACGAAGTCG^ 

GAAGTGCTGCGCGTCAAA 

SEQ ID 8158 

MNQTSRDI/TRISHNTK™^ 

RDLPDDVAAGDVLWLDDGLLTL^ 

ENMEIIIAGDGIMVARGDIiAVEVGHAAVPAI^ 

TEYPEAVSTIMIAGGAVSVAiUVHAKAIVALTESGSTAFEIS 

EVLHVK 

SEQ ID 8159 

TTGAAAAGTTCAACAATCAAAAGGTCX^^ 
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SEQ ID 8160 

LKSSTIKRSPEIVFHPQTTYFIffiiK(^OT 

SEQ ID 8161 

GTCCAAATCCGGC^CATCGGATMM 

GCGTCGAOCATCTTCTACAA^^ 

SEQ ID 8162 

VQIRQHRIKSSAYTKAKQGISDCniFYLSTVRI^ 

SEQ ID 8163 . ____ 

GTOTCCGCCGACCCG<^^ 
AAATGAAGGAAATAC&TCGTCTGAAAAT^^ 

SEQ ID 8164 

VVADPDTG^mVCEFDQGAAVmGVG^ 

SEQ ID 8165 

ATGTC(KXX^3CAATCA6CGGCCCXjACA^ 
CCGCCAGCCMGCCACGAAAATACCGATAAAAACGGCGGATATO 
CAAMGMTGATACCGAGGTTCMCXXXnOT 
ACCACCCCCAACAGAATCAGCGTAAO^ 

SEQ ID 8166 

M^SGPTGIPPINATPMOTISKPVTim^ 
TTPNRISVrRNRGTKLKPIGYDTLTOSFAS 

SEQ ID 8167 

ITGCMCG»TreMTC<X^ 

TAACCGCAGGCAGCACCGGTTTGGCGGGGCCTO 

AGACTOAACGCGCCGTm^^ 

GCGCAGGAGAACAGCATACAGAACMTACAAACATTTTF^^ 

TT(mC(mKXOT€AAAG(^CGATOOT^ 

SEQ ID 8168 

IfflFIERTOMLSPAlStPNKBQKSAGGSRCPPLYM 



SEQ ID 8169 

GTGCTGCCCAGCATTCATTATGAA(XAGATTCCG^ - 

ACAAAGTCAACGACMTCTGA(XGTGGGCTTGGGCGTCT 

CG(XMTCGAACCTGTCGCCGCGTGGAAACTC^ 

SEQ ID 8170 

VLPSIHYEADSATDFTGLPVQGSKNGKITKTT^ 

S TO(^TATMAAAMTAAAAGA^ 
CACCCAAAAATCTGAACAGAAAAGAAACAAGAAACGGTACATT^ 

SEQ ID 8172 

U3YRMK£RTRNDAAIFKKG^^ 

AAGGMTGGCGTTCGTTGAGTTTCCACGC^^ 

AGGGGACGTACACGCCCMGCCCACGGTCAGATTGTCG^ 

GGTAAAGTCGGTGGCGGAATCTGCTTCAT^ 

GCGTCGGCGGCATTTGCCGTGCTTTGCGCGTTGA 

GAAAGGGGTCATTTCGGTTTCCGTAAAAA®^ 

S L^GSMI^ 
GKVGGGICFIMNA($HDVGV^ 

SEQ ID 8175 

ATGTCCGAGTPCAAGGAOTTAGAAGAAACATCCCTO 
AGA&TGCTORATCX&GGTTAGAAAA 
AATTAAAGCTraamGTT^ 
T1THXAATACAACATAGTAAAGGTAGATTGAGA 

SEQ ID 8176 

MCEFKDFRmPCFBEYlMSFIGKWYDIX^ 

FPIQHSKGRLR 

ACGACCGCOXKmrcGGCGTGAACT^^ 

AOCGCTCGGTTACACGGOGMTGAAAA^ 

ACGC6CCM ^^ 

GCTTCGGCGGTTCTTATC^ 
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SEQ ID 8178 

MLQATPSNPTAMQIKAKjHADVKG 

TRHSRFNKAELFFEKEKNIMGKXSDRTTO 

DSKGASCARFKNHADIIGIjQYTYKFK 
SEQ ID 8179 

ATGTGTGAGTTCAAGGATATTATMGAAACGTK^ 

AGAATGATTTAATCGAGGTTA^ 

MTOAAAGCTTCCCOTGGTTACCTGAW 

TACAACAAAAAA 
SEQ ID 8180 

MCEFKDIIRNVPYFEGYDENSFIGKWYTOGVWDDEEYWK^^ 

YHKK 
SEQ ID 8181 
. TTGMTATCKKXEATGGCTTTOT^ 

GGTCGCCTQAAAAT 
SEQ ID 8182 

LNMGIX3FVAVCRSDQGAAVERAGTC 

SEQ ID 8183 

ATGGCCGGCTGCCTGACCGACTTGGAA^ 
AaGGCTGATGAAAGCCAACGACMGCTCMCAC^ 

CGTCOCTACCATTTTGAIEAT^ 
SEQ ID 8184 

MA(KM)LENCIiEKY^ 

RPYHFDMKEVLYIPSRIPR 
SEQ ID 8185 

TTGMTAGGATTCAAAATAATGGTCAAGTTAAGTGCGCT 

TCATCCCTAMTCTAA(^T^^ 
SEQ ID 8186 

U&IQNNGQVKCANCGIEnPAKQSIK^ | 
SEQ ID 8187 

ATCAAATCGAAACTPACTGTCGTCTATTACG^^ 

C(^TTTGGCTTTAAACX5ACATT^^ 

AAAGGATTTATTGGCAGOC^TAGAAGAACAlC^GTGGTAAAATCAGA 

AGATATGAGGAAGCAAATATCCTQU3CTAG 
SEQ ID 8188 

^IKSKLTVVYYDLESNIAEEI LSGNIMP DGNFLI QKI PLFAPNLAIjNDIVAI EREDKMLF FDHL IKASGNTT INIVVLDHFPKDLLAM EEH SGKI RKNG ENYL SVNF PPKKYNSDLKGILN 
RYEEAKILSYREiCLGFS 

SEQ ID 8189 

TTGCCCTACTATCTGTACTGTCTGC 
CC 

SEQ ID 8190 

LPYYLYCLRLRRLVLIFVNPLYC PFGRESLKKNFPARWPAA 

SEQ ID 8191 

GTGGAAGGAGCAGGTGTAGGATTTTTC&AAAAOT 

SEQ ID 8192 

VEGAGVGFLKNLI PNILFQHKEIGRLKIKHGFRRPFNR 

SEQ ID 8193 

(XsACGGCACATAAAGCGTCXXXriATGTG^ 

SEQ ID 8194 

LTEMPSETLLSVSIX3ILTm 

SEQ ID 8195 

ATGATGAGTGTAAMGAAmTTCAMGTCATCTT 
AATATGAAAATGACCCGGGATOT 
CAGCAAAGAGGAATGGATGCAAAGTCCT 
SEQ ID 8196 

SEQ ID 8197 

GTGCTTCATCAOTTAGCX^TC 
AAACCATGAAAAAMTGTTTGTATTGTTCTC 
GGCGAAGGCCATTGCGGMTACCGC(XX3CAAM0^ 
GGCGCGCCCGCACCAAAAGGCCCCGCCAA^^ 

SEQ ID 8198 

VIMLRESFPLSR(K3ATFYRFLIiIRHIVARCGLIf 
GAPAPKGPAKPVLPAVKK 



WO 02/079243 



PCT/EB02/02069 



-614- 

SEQID8199 

TTGAACGGAGGACAAAAflATGATTACACCGCA^ 

CGCOXSMCAAATO^GATTT^ 

GGAOTGGCTGGTCGATATCGTCGGTACGGGCGGGGATGGCGC^^ 

TCGGTCTCTTCCTCCAGCGGTGCGGOT 

CmTCimCAGTGCCAJrcC^ 

CGTGTTCCACACCGATTrGTGCGGCATT^ 

CGCGTTGCCGAACTCAAAGACGGAAAAAra 

TGAAAATGAACGAGCnAC^ 

CGCA(XX5GAAGGCATCGATTCAGCTAGGGCGAAGGCGAAAA 

SEQ1D8200 

UKHftKMITPQQAIERIJSN^^ 

SVSSSSGAAEMffiQMCmNLTO 

RVAELKIX2CISEYDIRPEDPGIETRRNLDK[KVAN^ 

SEQ ID 8201 

ATGAATACCGTCCCCAAMGtt^ 
CAGTTGCCGOmTTGGG^ 

SEQ ID 8202 

MNTVPKSMFVKPLPEmA 

SEQ ID 8203 

ATGGATGCAAAGTO^AGACGGCAAAATCAAGATGCCX^ 
TGAAATCTTTAAGAAAAMTGATTATAGT 

SEQ ID 8204 

MDAKSLDGKIKKPLYFSDG^ 

SEQ ID 8205 

ATGATTATAGTTGAACATTCAGAACATT^^ 
CTAOTTTTCTACTTTAGGCTTGAA 
GQCTCMTATTT(^TTGACAATA6ACAAOT 
GAIW^TGATTTTCAGGTCTOT 

TCTTGCTGAATAATGAAGTT^^ j 

SEQ ID 8206 j 

MIIVEHIiEHYLGEIESGIKCLDRRYHLSVSVPPSQPYKGVW 
DDDFQVCYGEHYNIWPLLVPLYK^ 

SEQ ID 8207 TJGCQC ^^^ 

TTCAGGAAACGCraCGCAGGAAGCGCGTTCTTO 

GGAAACGCAGGAAGGCGGGCGCACGTTCTGTAICGCCGATTT^^ 

GACATCGTGCAGCAGAAGACGGGCXSGCAATGTC^^ 

T(5XX3ACGCAAACGCTGTCTGCCGCGT^ 

AGAAGMCCGTCCAAACCCACCCCCG 

GTC6AGCGTGCCGATACCGTTACCGTATCACG6GGC6M 

AAGAGTTGGTCGGCGMCAGCGCMGTGGGCGCAGGAAAAAAT^ 

GATGACGCGCGAACGGATACAGTATCTTCG^ 

SEQ ID 8208 

MYRKLIALPFAIiLLAACGREEPPKALECANPAVLQDIRGSIOETL 

DIVQQITCKXjNVEFKI^ 

VtyJklTIVTVSRGEVEEARVQNQRA^ 

SEQ ID 8209 

TTGGTMGGGCTTTCTTAAATTTGATTA^ 

SEQ ID 8210 

LVRAFLNLIKGRIiKLSIJFRWSPLHLKQDIH? 

SEQ ID 8211 

TTGCCCTGCCGGATAMGCAGGTCTGCCIKX^ 

(^TGTTGAAGGCGGCTTCGACGff 
GCCGTGGAAGGAGCAGGTG 

SEQ ID 8212 

LPCRIKQVCLimOTH^ 

SEQ ID 8213 

GTGMGATTAAACCATTACMTTTTCTAACMTAATCACA^ 
AAATATTTATTGACMTATAATTCATTTTC^ 
TTATTTTTTCGAGAirMTAGATGACAGTTATAT^ 
GTTATCaGCTOGACAGAACCTGTATO 

SEQ ID 8214 

VKIKPLQFSNNtfflRFYVOT 
VISSTEPVFYSK 

SEQ ID 8215 

TmCAAGCCTTAACTATTCXOT 
TC^TTGCCGCCACTTTGGTTTTGTTO 
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TGCGCTGCA(£A{XXX7^ 

GOTTGCCGCTGGTGOT 

CGGTGTCGC^ 

CGCACCCGCCTGATTACGGCGGAATCTGCC^ 

AATGGCGTTGCTGAATTOAAACTGTG^^ 

ATGGQGCGCGACTATGATGCCGCAGTATTGGCTGC^ 

AGG<^TTTTTCATraGOT 

SEQ ID 8216 

LTSLNYSPNIFAIASPSGGYKEKHFRRSNDMEWAFNSYTO 
G^PLVVFTAIVGGFILVQNFVGVGLATAI/aliDPU 
RTRIJTAESAVETIiAMFAACIiAFAEIMDGFDKByOT 
KGRDYDAAVLAAGHCOTLGATPTAVANMQSVTHTFGA 
SEQ ID 8217 

SEQ ID 8218 

UI^VLSVLSVASSrcPDLNIJHYTTOT 

SEQ ID 8219 

TTGCGTGCCMC^CITCXXX^ 

SEQ ID 8220 

LRANNFGKIKPASPCILCQSEKMPSESRIPLQTAP 

SEQ ID 8221 

TTOTGCCGTACTTTTT^ 
GCAAOCCCCTGCATCCTGTGCC^ 
SEQ ID 8222 

limFOJKLGKTKNILND^^ 

SEQ ID 8223 

CGCAACGAGCGTCTCCCAACCAATl^ 
CTGTTATTTCAMTATGTGTATTCCCATGATGCCGTC 

SEQ ID 8224 
VKQIKASDFTDWGLraLDILSELEKPQPRTDSRPQTTTQ 

SEQ ID 8225 

ATGGGAATACACATATTTCIAAATMCAG^ 

CC(^TT(^TT(mCAa5CTCG^ 

TGAAGTCGGACGCTP^^ 

SEQ ID 8226 

MGIHIFEITEKAGAKPAAASVAVHPCCRTYRFNFAN^^ 

SEQ ID 8227 

ATGTTTAAAAAATTCAAACCGGTACTGTTGTGAT^ 

MGTAOTGAAAATCCGAATGCP^ 

GGATGACGGTCCGTTGCAGTCGGAGMGGATTATCTC(^^ 

CTGGATTACTATATCXSGCGAGGGCGOTTTCG^ 

ACGG(XX^CGGCTCACGGGGAAAATTATGAMCGACAGGAGAATA^ 

raTTGCCTCGGGTOTATO^ 

CGCGATTATGTGTTTTATCAGAATATGCGGGMTTGA^ 

ATGGAAAAAMCGCCAGAGTTTCGAATATTATTTGAA^ 

GCAACAGACGTGGTATTTGGATGGCGGGCGGATTATCC^ 

SEQ ID 8228 

MFKKFKPVLLSFFALVFAFWLGTGIAYEI^ 
LITOGE^LVAVSLSQRSPEAFVHAEYLYRND 
RDYVFYQNMREUIPRGMKANSLVVGYDADGLPQmW 
AAARRSGGRRGLSH 
SEQ ID 8229 

ATGGGATTTATAGAACCGTTTTTATCAAGCTATA^ 

TTCAGCGTTCAGTAACTGAAAACTCTAATATTCGACAAA 

OTAAAAACCTTO^CAAGATTATCT^ 

TTAGACAAATTAGACGACTTAAAACGAGTAGGTGAT6AOT 

GCCTATACAAAMCCAACTTCMGAAGAAAGTGAA(^ 

GGATGATACCGMGCATACGA(XCAMTAACGGCTTGCTATTAAG^ 

TOTCAC&TGATG'HTCGCGACGTO 

TTCATATT 

SEQ ID 8230 

MGFIEPFLSSYTPLSRDYVQARTNmQTLLS^ 

LDKLDDLKRVGDDICT^ 

I^DVWRRHaJVKLDMHL™ 
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ATCCTTFCAGO-^ ' 

TTCCAACGCCOCT^ 

GAGCCGACATC6AGG10CCAAACTCCGCXX3T^ 

GCTOGGGAGGAGACCGCCCCAG^^^ 
ACYGGCGTCTCTOTTC^^ 

SEQ ID 8232 

^ISSELSYPAMQbM^PVHQRFV^ 
iffVLLSHLPDLSVSQIiSYLLPmQSDPRro 



SEQ ID 8233 

MtSAAITGATJ^TATTACm * 

CATTCCCCMGATTCCGTrATGGCTTC<^^ 
SEQ ID 8234 

ccn in 8235 

TTATCAAACAAACAAATATAGTGGAT 

SEQ ID 8236 

MPSETGFQTALESQTIiFAMPYH^ 

*?EO ID 8237 

ATCAATMGAAATATOGAAAATOAACTATATCGGTAAT&AA^ 

OCTTTGOTACGAAGC^ 
TGTCGC^TGATTTTTK^GGCAGCCCGCAl^ 
CTGAATMCCAGTATMACGTGCTTTGGCXT^ 
AJffACM-^^ 

CXKOTI7MTCMGATATATTTGATTTATTACCT 

TACATTACATC^TCCATACCXIAAACCATTTGCCAA 

AC^CMTGCTTCATATOCAAACCGAGAT^ 

AAGCCATAAGGAAATTCTGTTTTTAGTAGAAAATACA t 
ccn in 8238 

nTSSIPKPPWKMraQAVELPKEVINHIiKPFO«M 
ceo in 8239 _____ 

ATOAAAG5TGAAGGAATATATC^G% 

TTMTCAGTMAWTOTSA^ 

C£C(?IKX&ACAATATGT^ 

T'K-TCTTTGAGrcGGGGCXSGGGClAA 

TTCGGCATATACGGCATTGCAAAAAG(X^^ 

CCTOTCATAAC^^ 

CGCATCATAGGTCKCA!EA3CCGC0G^^ 

TOKKSAAAC^ 

GCGAGTCCGATGCAATCOGGT!K>CTCTC^^ 

CTCATAMCOTCm^ 

TACACATTTACAGAAAACX^CGATCGTTCCGA 

GC^CCTCTTCCra 

CACAGCTCCCmAGGOT^^ 

TCAtOTXXX^^^ 

«, - m >«™-,r-r T^t^«f_*TnTmtJTnun_^T im/V^T.OTV_TO 




LnnFPDPPAPRRIKYRTVRLPHNPIFSCRFIIFPVSRSAVHimRT^ 



epQ in 8241 

M^CMACCAOTACCTATATTGACOTITCTCAGGAa 

CITACCGTACCMOTOCCCOGTCATCAATTACTCCAAA^ 

TreTCAAGGTTTTACTATGGCSGGAM(aTTGGACGGA 

GAAAATGTAfXGCGACTCTATACACACAATTCAOTAAAACACOT^ 

TKX?K_iGATACGTAGTO_V3TGATATraATCGGGAGGAGSGA 

ftCTCGCTKTOKX^GCAATCCAaCG^ 

CCATTAC<nOXAAMCAGGTGATATCC6TAAATACATC0GTTACMC^ 

CCGTKX?I^TTAGCTGCCTTACMTCTTTOXT^ 

TGAAAGTATTITAAAAATGAGTGAAMTGAA 
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SEQ ID 8242 

MHTPLTHDLFSGAGG^^ 

EJVARbYTHNSGKTRIEIIQAFQNICT^^ 

PLRPKTGDIRKYIRYNSI^AVCITGDMRKVFHYEQ 

SEQ ID 8243 

ATGAACGCGTCAAACTGGAKGTTTACCTGATA 

ATACCCXK)GTATTCAMCCGGTGGCGATGCOT 

GGAACAGGCAGAAAAAATGCCGTCTGAAACC 

SEQ ID 8244 

MNASNWSVn.Il£ENSAFyC6ISPNPQQRIAIOT 

SEQ ID 8245 

GTGCCGCCGGTAATCAGGACGGTCATAAACAAAtt^ 

TATCGCGCGTTACCGTAAAGAAGCCACGGGCGG^ 

ATTGAAGAGCAAGGCAAGCOT^GACGW 

TCGCGCACGMCACOTTTGCAGCCGCTGG 

GGATGGCGCGCGCGCGATTCTGATGGAGCAGTTTGC^ 

ACCGAAGGCGAAAAATTCAGCGATTATFKX&CCGCCGC^^ 

MTACCAGCCC6ACGACACGC(^TTA(XXIA6 

CGCGAAAATCTTTTTGTCGCTGGMC^^ 

GGGCX&CTGACCACTTTGGGACTCXJAC^^ 

TGTTGGCAACGTTGTCGCGCCTGATTAAACAAC^ 

MGaGCCTGaCAAMTCGTCGTTTC^ 

CGCAGGCTGCAAGAC0CGCTT0C(X>AGT1^ 

MGACTGCGTGMCGCXXnXXjGC^TGGACGC^^ 

C6CGTTXK3ACAGCCGAAAAAAATTGCTAAAACT 

CACCCCGAAGCCTATCX^CGTCGTCGCCAAMTGCT^ 

GOTCGGTCTGCCGACTATTTTGGAOT 

GA(XCGCG05AAGTGGTGAAAGCCGGCG^^ 
ATAAAATGCCGTCTGAAMCCGCAGCCGCGAACGGACAGCCG^ 

SEQ ID 8246 

VPPVIRTVINKHSFQIVSGKVYQYYUiLFQTITHIAPSAVQ^ 

IEEQGKLSDDIiRAQIEAADNKTAIiro 

TEGEKFSDYFDRREPVRAMPSHRA^ 

6H)TTLGU)PGyRKGVKCAVVDDTGM 

RM^DPLAELVKIDFKSIG^ 

HPBAYIVVAKMIjAQQGITAAELIGNRERVKQIKASDFTDE 

DPREVVKAGDVVKVKVLEVDAARKR^ 

SEQ ID 8247 

TTGCG«3AAAAGATTTGCCCGATAAAACCT^ 

G(XXXjTTTGCCGTTO^ 

SEQ ID 8248 

LREKICPIKFDFE&VIVRTAKKQKPIV^^ 

SEQ ID 8249 

TT(HTPATCACXJGTCC^TTW 

(XGTCCT(rCGCGCOTCGG^ 

TTTTGCCGGTTTGMGGCAGTGGGGGAAAGC^^ 

GTATTCAGCTCGTCGGCAACCGTTT^^ 

TGACCGACATCCAAAAAGCCGATCCGCG^ 

TTTGCCCTATATCTGTCAAGTGGCTO 

GAAGGGCATATCGGGGCAATGAAGGCGAAAGGCGGCG^^ 

GTTTGCACATTCCTTACCGAATCCAArc 

GATGGAAGATTCGTGGCGTTCGGTCAGCCGCM 

SEQ ID 8250 

LFIflVLITGGTGFIGSHTAVSLVQSGYDAVIL^^ 

WSSSATVYGDAEKVPYTEDMRPGD^ 

EGHIAAMKAKGGVAGVHLFNLGSGRAYSV^ 

SEQ ID 8251 

ATGCTOnCTGTATTTm^ 

TTTGGCGGCATATGCCTTTTTAAACCCCCGG^ 
ATTTTffFCGGACGAAGAATTGGCOT 




TAMGMTTTTTTATGCGCMGGGCATCGC^ 
GGTTTCAGCAAGACCGACATCGGTTTGA^ 
TATTCGGCGCGGT(KAG6CAATAACGGTTT^ 
GGTftK£GTGGGGTTGGGGACGGCGGC^^ 
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AimiTCCTTTGCAGGCTATC^ 
AAAAAACCCAGGATGCAGGCAGA 

SEQ ID 8252 

MLVCIFTGFASGLPLYFLINLIPAiMSE^ 

ILSDEELGLGNSVH^ 

GPSKTDIGIJAKNAGL^AmGILGGVWMm 

rNSFAGYHBWGYVPFFlUOTLAItf^^ 

SEQID8253 

TTGCTACTGAAACCCGACGGTTTAWGGCTCT 

TTtKTMAACACGCCGCCGC^ 

GGTTTTTATCGGCAGGGICGCCE^ 

GCGGCGCGCCTTGCTGCTGTTCACAGA(^^ 

GACGGATTGGTMTCGC(OTCAGGGCT7K^ 

AAATG<^TTGGTG^CGTGCATACCGCTGCCG^ 

TTG GT(X3GCGCGTTTGACCAAAGTG(K^y^ 

GCGGAGCCK^AATCTTGACCGa 

CGGACGmTTTCGTMOSGGTTTOTG^ 

GGCTTCG^XX^CGTGCGATGGAGCGCGGGTCGC^ 

GTGOACGCATCGGGGCGCAACTGCATATCGGMGC^^ 

Ga^GGCACGGTAAAGTGC^TGCT'^^ 

AAAMGGGAAATGGCAAAATATGCGGAACGCGGCGGCTTGACGCAM 

SEQ ID 8254 

LLLKPDGLQAVIHFKFQRVRRHTDAAD^ 
AARIAAVHRRDTTCSGTVFGGIGQPGRCF^^ 
LVGAFDQSGETGMF^^ 

QIDKAGFAFFNQFYGVFNGHLQLL 



KKGNGKICGTORLDAKSARFYHADAV 

SEQ ID 8255 

AI£CAMCCX£AGGCAAAAAAAACATCX!T^ 
CCTACGCGGGCAATCTCGAATCGCT6ACCGACATTGCCGATM 
TGCCGTGATGCACTTGGCGGCGGAAAGCC^^ 
CAGCAAATGCXXaTCTCAAAAACGCGAAGCC^^ 
CCAKXXXTACTCCGCCTCCAAAG^ 
CGAAAAACTCATTOST^ 

CAGGTCGTTACCGAAGGTGTTGTCGGCGAAACCTACAATATCGGC^ 
CGGCCGGCGTGGCACGGTATGMGATTTAATTACTTTCX^ 
AACTTTKMTCCGGCX^TCCGCAAAAfXX^^ 

SEQ ID 8256 

MQTAGKKNILVTGGAGPIGSAVVRHIIQIHWBVV!^ 

QQMPSEKItf&FRFHHISTDEVYGDLHG 

QVVTEGVVGETyNIGGHNEKmEVI 

SEQ ID 8257 

TTGTTGACAATCTreCCGTTTGAAAC^^ 

CATATTTTGCCAIITTCCCTT^^ 

GCACCACTOACCGTGCCTCCGCGCA^ 

CAGTTGC(£(XX&&TGCGTCCAj^^ 




TGCGGTCMGATTTGCGCTCCGCCATGGTGAAC^^ 

CPTTGGTCAAACGCGCCGACCAAACCCAAGATAT^ 

TATGCACGTCCACCMTCCATTTGGAAA^^ 

CTOAACGC^ATTAiXAATCCGTCCACC^ 

TGAACAGCAGCMGGCGCGCCGCATCGAAG(X£GT^^ 

SEQ ID 8258 

IiLTILPFmFSGIiEFDAKPVSIXSIGVVRSCRimPPRSAYFM^ 
QLRPDASTAFVDPF7DDVTWITCDTO 
CGQDLRSAMVNILEGIX5IEVEVHHSEVGTC 
LNAITNPSTNSYKRLVF^ 

IJlGGVFSHKCDSyiAFKEEDVRRIRMAPHPIjEFMn^ 

SEQ ID 8259 

AITCCTTGCACATOTC^ 

CTGCCCAOTOTSTCG^ 
AGAAAACCCGGATTTGCC 
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SEQ ID 8260 

MLAHLQLTFRHRAGVTPYTSTFVLABCO/FNKQSQ 

RKPGFA 

SEQ ID 8261 

CGACACCCTGAtt^AAGATTCCTTCGCX^^ 

TATGCCAAAGCATCCGAACGGACGACATAATC^^ 

AAATCAAAGGCAGQGTCGGTTTTCCTT^^ 

CATACCGTAGTCmC^^ 

TCTTCGGTAATGTCCGTATTGC^ 

GAAA!KX^CGGGTATATAAAAA6TCGCCAAC^^ 

TGTGGAACn^GTGGATAAACTCGACCATCGCTGC^ 

ATGATATATGTACCGATTTGCGAMTCi^^ 

GCAT 

SEQ ID 8262 

KIPDQRFIjRQIiNHSIIiHPAVGQGIQRGIK^DCFARIiAYRIOQS 

HTVVFQC&FVKHjSFAQIOD^ 

• CGSVDKUfflRCQRIQPLVIITDRmQNRH^ 

SEQ ID 8263 

TTGTCAACAATATGCTTOKnT^^ 
CCGCCGTTAtAAAA 

SEQ ID 8264 

LSTI CFGLSVQKRNARPFVRMPTKQMP SEGGFRRHRSHLRAAVIK 

SEQ ID 8265 

AT(&CCGCCCCGCCOTATGGA^ 

GTTTaTCTGGCGGACGMCATT^ 
GACATCACGCAGGTCAAAAACATTGCCATCGAGG^^ 
TOCCMCXCTACCCCXXmAA^^ 
GAGCOmGCTTCCTCXXK^ 

AAAACT6CCGACGGACTGGGTATGCTAGTCGGTCA6GC6GCGG i 

SEQ ID 8266 

MTAPPLWNfflMHALPRYAVFGNPAAHSKSPQIH^ 
DITQVKNIAIEGKTIIJiLGAGGAVRGVIPVLKEHRPARIVI^ 
KTAIX3LGMLVGQAAASYALWRGFKPDIRPVI SMKAL 

SEQ ID 8267 

ATGTOC^GCATCGTCAAATQGCTGATTGCCCT^ 

TGTCGATGCGGATGAAGCAGTTTGAACAAGAAGGTCGCG^ 

CXX3TTTT(K^mCACG6CG6CTTC^ 

GGCACTACGGCGTTTTCGGCGCGGAAGCTGC^^ 
CGCTGACCATCCGAAMGCAAACGGCTGCGC^ 

SEQ ID 8268 

MFRIVKWIiIALPVGIFIPmmGNIITYRAVA™ 
LFLNESRNYIiRKGEEAAITAMMEATO^ 

SEQ ID 8269 

ATGGTATTTGGACAACAAAACCCXKjCGGCAAAACGC^ 

CAWX&AAAGGAAGAAAGAAAAGATGAAAGGCAT^ 

GATTTATTACCCCTTGTCXjGTTTT^ 

CGCCTGCAATA1OTGTTCAACCCAGCCC06AC6GCTO 

AATCATmCGCAAACATTGAAACA^ 

CTTATCCATCGMGAGAMCCGCMCAGCCCAAATO^ 

TTGGAAATTTCC(UCCTCAACCGGATGT^ 

OTTCGTCCAAACCGTGCAAAATATCCAAAACCT^ 

• AACCGOTACGGGCAATACCTGCTGCGOT 

SEQ ID 8270 

MVFGQQNPAAKRIKRQLPFGTLRYGQIVFBQHPDAMPSBNPSQW^ 

RliQYAVQPSPDGLAQAFIIGBEFIGNGNTO 

LEISDLNRMYLEDGSLSVQILGRGFAWIJ^ 

SEQ ID 8271 

ATGGACAraTCGACACCGCCCTCCCCGACX^ 

ATCGAAOTTCGTGCAGGAAAAC^ 

CGTGGrcCTKEATATGCGCGAAGGTTO 

TTGGGCGACGCGGCGGMGTCGTGTACAAATG^^ 

TACTTTCGCCCAAAGACOTGCCGGCAAAACGTGGG 

SEQ ID 8272 

MDIIDTALPDVKLLKPQVFTIXM 
LGDAAEVVYKCTmrYNPETEQ^ 
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SEQID8273 

TTTTTTGCCTGOETTT^^ 



AAATACCTGCCATGCACGACATGGTCTPTGC^^ 
GCX^ACCGGTTTAAATTTMTCCACTA 

SEQ ID 8274 

LVEQRIENPC^SIPPIATKKPIISGmPAVFGSSPAPQIPAMHD^ 

SEQ ID 8275 

GTGCGTATTAAACCAAATTTCGGGACGACACGCAAGCQGA^ 

TTm<XXOTC(mtX3GCGA 

SEQ ID 8276 

VRIKPNFGTTRKRICK^PNWiSRCF^ 

SEQ ID 8277 

ATGAGCGCAAM^UOGCCtt 
CCAACGG03CAGGCAAAACCA(X^^TO 

TCCTTTTGGACGAACCTTT^ 
GACCCTC^GCATCTGCGACCGCGCCT^ 
AAATA? 
SEQ ID 8278 

MSAlfflSRLVVQNLQKSFm^ 



KT 

SEQ ID 8279 

ATGAATAAATCTTTATCCGGTTCGGTAGAAAAATAOOGC^ 

TCGGGCTGACCTTTGCCTOTCGAT^^ 
TACGCTTTCMCCATCATCTTCGTCCTO 
ACCATTCCXKITGC(7rTA^ 

GO^ATCAMGAGCTCCXmX^^ ' 

CCAACTGCCGATAJ&CTTTTCACT^^ 

TCCTCACACAT'immCCT^ 

TGATGCTGCTCAAGCCMTGGTGGMGGCATGAAGAT^^ 

GTTGTCTATGATGTTTATTCTAGGCGTGTCGTTO 

MTCCGGAGGTTTGTTGGCGGATGAGGCCAACCG^ 
MCCGGCTACCTGCTCAAAGCTACGCCTTGGCGGC^ 

TTACCGGTATCGCGATTGGAGCGGTATTAATCGTCGTCGA 
CGTCMTA3^CCATCGTGGCAGGC6CGGT6TTGG^ 

SEQ ID 8280 

MNKSLSGSVEKYRKLTLRGMILGALI^^ 

TIPLRYAMVVKSDLPYPBGVAAAEILKVGG^ 

SSHIPQPSDMEMAAFAMKLWCEKVRFIGJ^ 

TGAMPRE®mAAQAI^ATimi^ 
MGLIVGESIJ(mMAFIIAFSVT^SD^ 

SEQ ID 8281 

TTGAAAATAAAAACTATTTC£AAAC 

SEQ ID 8282 

UaKTISKRTRTAVSGRPPDTAACFRTIiKSIKQIAQLV 

SEQ ID 8283 

ATGAAATACCATCCCGACCGCAATCCTGACAATAAAGAGGCGGM^^ 

GTCATGCGGCGTTTGMGGCGGCGGACAGCXXSG^^ 

AACGGCAGCGGCGCGAAACCGGGGGCATCOXXX)^^ 

GTCCGGGC^ACATATTAMGAACCTTGCGTCAAAT^^ 

6MK!GGTOAAGQ0GGG00G6CT 

ATTAGTTTTCCCATGGCTOITTTGGG^^ 

GTCTCAAATOTTACGCAGCAGCGCGA 

G(£CTTGGAAAACCAAAC(X^^ 

SEQ ID 8284 _ 

MKYHPDRNPDNKEAEEKFKEVQKAY 

NGSGAKPGASPETCPTCKGSGTVHIQQAIPRMQQTCPTCRGM 
ISFAMAAIXmBVPTIiDGKVKLTVP^ 
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SEQ ID 8285 

TTGATMAGCTTACTGCTTTGTTGTGTCTT^ 
CTTTGCAAAGAATAAAAMTCAAAACMACTCA 

SEQ ID 8286 

IJKLTALLCI^AFCVSGIiSRYN^^ 

SEQ ID 8287 

OTO3MTCCGCMTATCTCGTT 
GCATCAGACXaAlftGGCGAAGCGTTCGG!^^ 
GTTACCTGTACGCGTTATCJOGCCTCXOT 
TCGAGGGCGTGCAGTTCCA(XXXX3AAGC^^ 

SEQ ID 8288 

M^IDNYDSFTYNIVQYFAEIX^EV^^ 

VTCTRYmVIERGTLP^^ 

SEQ ID 8289 

ATGAGCACCTTCTTC0G(K^AAACC6CCCAAGCCAT^^ 
AAAAAACCCGTTACCTCA<^GA^ 
CATCACOX^TCG<?ITO^^ 
AAAGTGGGTGCGCAAAG(X1ACCTGAAGGC6ATGTGTTO 

SEQ ID 8290 

RSTFFRQTAQAOTAKHIGRFPLLELDQVID^ 
KVGAQSHLKAMCLNIJiKAANRIiSAPAAA 

SEQ ID 8291 

ATGATACAAAAGATATGTA^ATTTGTm 
ATCMGCCMCCAMCTACCACATTTAGCGG^ 

AAAGTGCAGCGCGGCGGCGACGTTGCCGAAGGTGCGGTCA 
GGGTCAGCGTCETCATCCAGOTT^^ 

SEQ ID 8292 

MIQKICKLFVLIVIFATSPAPALQSDSRRPIQIEADQGSLDQANQSOT 

KVQRGGDVAEGAVITYNTKTEV^ 

SEQ ID 8293 

AT6AAAOTAAGATGGCGOTACG6MTTGCGTOCCATTGAT 
CCGACGMCCTCAATACACMTGGA(^GCTTGGACGGM 
TTTTGATTCGCX^CATCTCGTGTTCTTCC^ 
AO^^CCECCGACGGaG^ 

ACGGTCAGGCGGGCGOTATGACCTACAACXACAAAACAGGCA^ 

SEQ ID 8294 

MKTOWRYGIAFPLIMVALGSLSAV^ 
TKTAK^QAGKVBTEKIiH^^ 

SEQ ID 8295 

TTGTTCTTTGTTTGGMGGGAAATGGCTGGAGAM^ 
TTAATG(^X3GTTGGTTTTATACCTATCTA 

SEQ ID 6296 

LFFWKGNGWRKTCMSGNESMQRKC^^ 

SEQ ID 8297 

GTGGTTCTAGGTCATACCGCCC^ 

cttg^gaacacgagatg^ 
(^gccgtccattgtgtattgaggttcgtcgggattc 

AACGCMTTOCTACOmTCTTAOTra 
GC6TTTGGCAATA 

SEQ ID 8298 

VWGHTARLTVRRAILETDRRIGLGILRFGI^^ 
QAVHCVLRI^IBPDFIiDFDFADTAQPCRQAAQ™ 

SEQ ID 8299 

ATGGCCTTCGCCGGGCCTATGCCCGGCAGCGC^^ 

OTTCATGGTTTTTCCTTT 
G 

SEQ ID 8300 

MAFAGPMPGSASSSCCEAAIMFTAAIMAQ^ 

SEQ ID 8301 

AIOGGAACGCAATTCCGTACCGCCATCTTACT^^ 
CCX3GGGCGTTTGGCAATATAGGCGGCGTGTO 
CGCACTCXSGCTTCTTCCAAGCCTGC^^ 
CGCGTCCCGACCTOT^TMTC^ 
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SEQ ID 8302 

HGTQFRIMLLSFIW^ 
RVPTCDNRSLHAACLKHFQAVSVQ^ 

SEQ ID 8303 _____ jf _ ^ twm ^^ 

GTGOTCMO,CCTTAGGGAATC6W 
AMCCATGAAAAAAATGTTTGTATTGTTCTGTAT^^ 
GGCGAAGGCCATTGCGGMTACCGCGOGCAAAACGGC^^ 
(X3CGC(£CC<XIACCAAAAG^^ 

SEQ ID 8304 

VimRESFPIiSRGGATPYRF^ 

GAPAPKGPAKPVLPAVKK 

SEQ ID 8305 

M6CROGCAftTTOTOC3CGAAOT 

AATCGTTTCACACACTGQACGGACACGGOCTGAAA^ 

AAATTACTACTTCAAAGGCATTTCGGACAAACGCGCC^^ 

ATGCTGCXSCTGCGGATT^ 

TOTCCXXXJCGAAAGGTACGITGGGCGOG^ 

SEQ ID 8306 

MQAISPELQARAmKIMLDVIWVLTO 
MVRCGLPVAVPIMWFTI^HAAYIAKRPGGAGAVREVC^ 

SEQ ID 8307 »mr*vv«™n«™«» 

ATGATAGTGTGGTTCTTCCATGCGAAAGTAGGTCAC^^ 

CTCTAOXJKE^ 

SEQ ID 8308 

MIVWFFHAKVGHCQTPFQ^^ 

SEQ ID 8309 

ATGCTTTCAGCGCTTATCTCTTCCGMOT 

GAGCCGA(^TCGAGGTG(XyyVACTCCGCCGTC<^ 

CACTATGiraHKOTT^^ 

6CTTTGG6AGGA6ACCGCC(X^GTCAAACTGCCTA 

ACTG(XX?rcOTOTTCCAA^ j 

SEQ ID 8310 

MLSALISSELSYPAl^LAI^PVHQRFVHSGPLVI^AAPTO 
HYTOSHLPDLSVSQLSYIiLPOTQSDFRPDLGffiiRTPP 

S ATCC^ 
ACCCTCCGGGGCTTACGGAGCMGTCOT 
CTCCXX^CCTGTCTCGGTTTC^ 
AGAAAACCCGGAOTGCC 

SEQ ID 8312 

MIAHLQLTFRHRAGVTPYTSTFVIiAECCVFNKQSQP 

RKPGFA 

SEQ ID 8313 

TTGCCTAAGTCTTCCACCTACCGGCTTAAACAAGCT 

CTACGCATOCTGCCTCGCOT 

TCGCACTTCTGATACCTCCAGCACACOT 

TCTTCCGCGCAGGACGACTCGACCAGTGAGCTATTAC^^ 

SEQ ID 8314 

LPKSSTYRLKQ^^ 
SSAQDDSTSELUaPL 

SEQ ID 8315 

ATGGATG<^CGGGOTTCAAG(OTAT^ 
•I^n^CACACCGAAGCGGAAGGCTTGCGCGAAAT^^ 
GTCCGGACATATCGGGCGCAAMTGGCGGCAAC^ 
GTCGCCXmTTTCCAATTCCGGCG 
AT(XXX&TATCCA(^TCACCGCATO 
C®XXS3CGCOTTCACGC^^ 
CTCGGCACGCCCTTGAAAGGAGCCATCGTCAGC^TO 
TGTTTCAAGMTGGaACAATTTTACCG^^ 
TOTGAA(XX3QCTTCTGGTTAC0^ 
SEQ ID 8316 

MDGRGItKAYATPIRKFWVVIiLQQRRMM^^ 

VAAISNSGESDEITAIIPALK^ 

I/jppLKGAIVSMSIffiGliGM^ 

SEQ ID 8317 

TOTATC(^CmATCCTGAAACACAAM 
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SEQ ID 8318 

LYRLNPETQKAGLRHNKAVSFIKVGISSIiLr 
SEQ ID 8319 

ATGACTATTTTATCGGACGTTAAAGCAT^^ 
OTCCAATCCCGCCATT^^ 
CGACGTACGGGCAGCCTGCGACGTTTGCOT 

OXrFGCTGTTCT^^ 

ctcgcgcgtcgacagtgcgctggatgcaacgctgcrc 

(kx«^tggaagcccaag(x;gc^ 

acaccgtccccgatgccacgctcaaagcctto 

SEQ ID 8320 

ffllLSDVKAl/^ItfimSRSL^^ 

EARRLHAAIARKKAMIKVPATDAGIDALE^ 

AALEAQGANRVQLLWASTGVOTAYPDTOT 

SEQ ID 8321 

TTGTGTOTMTCCTGCCTTTTGTC 
AAAACAAACTCATTGTCTTTGTTTGT 

SEQ ID 8322 

LCLNPAFCVSGIiSRYNHHPWrMFWFSI^ 

SEQ ID 8323 

TTGAAAGCTGATTCAMTAGCGGGGAATTATATC^ 
SEQ ID 8324 • 

LRADSNSGQLYQQSHQGTSFQTAYDCPNRANFDKTGK 

SEQ ID 8325 

GTGATTAACCTATTGATTTATMTTTGTTTATTACT 
SEQ ID 8326 

vimimFiiNcisDTPmpLYiiSvpnAPS 

SEQ ID 8327 ; 

TTGCTAGAGCAOTGCTTTGCAAGCAGGGGGTC^ 
GMTAACGACGCATCGATCTTTMCAMTTGGAM^ 

SEQ ID 8328 1 

LVEHLLCKQGVIGSIPFASTKTI£M^ 

SEQ ID 8329 

TTGGTAGAGCACCTGCTTIU^GCAGGGGGTCAITO 
GMTMCGACGCArcGATOTTAACAMTTGGAMGCCGAM 

SEQ ID 8330 

LVEHLI£KQGVIGSIPFASTKTIK^^ 
SEQ ID 8331 

ATGCCGTCTCAAAGGCTTOAGACGGC^^ 

ACAAAAGGAAA 
SEQ ID 8332 

MPSERI£DGIAIi)TGRSMYQGSALKII^GKKLMHDITRQR^ 

SEQ ID 8333 

ATGGGCGCAAGCCTGATCCAGCCATGCCGCGTGTC^ 
AA 

SEQ ID 8334 

MGASLIQPCRVSEEGLRWKDFTQGRKGRCQYRRPlfrVPEE 

SEQ ID 8335 

TO^AGAAATGCAGGCAATGAGAGAATItXIAG^ 
SEQ ID 8336 

LVEMQAMREIyQIEPSIFSRLIiLWMIQIiPKSNIMPYHI 

SEQ ID 8337 

GTGCGAGGCCCAAGGCACTCACACTTATCGGTAA 

A!EftCTTTTCAGTKX3TCCG^ 

A 

SEQ ID 8338 

VRGPRHSHLSVICFVmCEIimFRLSUSLDIS^ 

SEQ ID 8339 

ATGGCAAAAGCACTO^TCATT^ 

(XjAACTGt^TAMGATGAGATMCGGCTTTGAT^ 
SEQ ID 8340 

MAKALEIISPDEnSDLIFKDPVPPHTIYTBUdKLVGRHAGNERlADRATO 
SEQ ID 8341 

ATGGTAAAGCTGACAGAACMGATATOTGAATTGGATTGGGOT 
AAAACGCCTCCGTTACAACAGGGCATCTCCAA^ 
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SEQ ID 8318 

LYRLNPETQKAGLFHNKAVSFIKVGISSLLT 

SEQ ID 8319 

ATGACTATTTTATCGGAOGTTAAAGCATTA^^- ™ . — »w * * . A ™ . ^ , w « ww **,w««ww«** * «wv«m««« w ^^.^v; 1Ui ^ u ™ 
OTCCAATC0(X3CCATTTTCCAAAM^ 
CGACGTACGGGCAGCCTGCGACGTTTGCCT^ 
GAAGOXGCCGCCTCaTGCaXCAT^^ 
(XCTGCTGTTCTCACGCGCCCAAACOT^ 
CTCGCGOTXCTCAGTGCGCTGGATGCMCGCTO 
GCCGOCTGGAAGCCCAAGGCGCAAACCGCG 
ACACCGTCCCCGATGCCACGCTCAAAGCOTTAT^ 
TGTCGAAACCTTGGCGGCXXXSTTTG^ 

SEQ1D8320 

tfTILSDVKAIXSQQIWIiDNLS^ 

EARRLHAAIARKNAMIKVPATDAG I DALETLVS DG I SVNLTLLFS RAQTLKAY AA YARG I AKRLAAGQSVAHI QWASFF I SRVDSALDATLPDRLKG KTAIALAKAAYQDWEQYFTAPEF 
AALEAQGANRVQLLWASTGVKNPAYP^ 

SEQ ID 83211 

TTGTGTCTTAATCCTGCCTTTO 
AAAACAMCTCATTCTCTTTGTTTG? 

SEQ ID 8322 

I£LNPAFCVSGLSRYNHHPNTMFWF^ 

SEQ ID 8323 

TTGAAAGCTGATTCAAATAGCGGGCMTTATATCA 

SEQ iD 8324 • 

LKADSNSGOLYQQSHQGTSFQTAYDCPKRANFDKTGK 

SEQ ID 8325 

GTGATTAACCTATTGATTTATMTTTGTra 

SEQ ID 8326 

VINLLIYlttiFIINCISDTPIHTPLYLSVPEAPS 

SEQ ID 8327 I 

TTGGTAGAGCACCTGCTTT^ 

GMTMCGACGaTCC^rcTTTMCAM I 

SEQ ID 8328 

LVEHLLCKQGVIGSIPFASTKTLQMK^ 

SEQ ID 8329 

TTGGTAGAGCACCTGCT'ITGCAAGCAGGG© 
GMrMCGACGCATCGA!ICITTAACAMTTO 

SEQ ID 8330 

LVEHH£KQGVIGSIPFASTKTIiQMK^ 

SEQ ID 8331 

ATGCOTCTGAMC^^ 
ACAAAAGGAAA 
SEQ ID 8332 

MPSERLSDGIALLTGHSMYQGSAIiniMGKKLMHDITRQHKRK 

SEQ ID 8333 

AA 

SEQ OD 8334 

ffi^LIQPCIRVSEEGLRVVKDFCQGRKGRCQYRRFJfrVPEE 

SEQ ID 8335 

TTGGTAGAAATGCAGGCAATCAGAGAATTGCAGAr^ 

SEQ ID 8336 

LVEMQANRELQIEPSIFSRLLIjWMIQIiPKSNIMPYHI 

SEQ ID 8337 

ATACTTTTCAGTTCGTCCGCCGCTO 
A 

SEQ ID 8338 

VRGPRHSHLSVICFVKERCEI^ 

SEQ ID 8339 

ATGGCAAAAGCAJTrCGAMTCATTTC^^ 
(EAATGAGAGAATTGCAGATAGAGCC^^ 
ttSAACTGGATAMGATGAGATMCGGCTT^ 

SEQ ID 8340 

MAKALEIISPDEIYSDLIFKDPVPPHTIYTEI^^ 

SEQ ID 8341 

ATGGTAAACKTK&CAGAACMGATATTTTG^ 
AAAACGCXITCXXnTACAACAGGGCATCTC^ 
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ACTTCTCAGTMMTACAGGCGACCATCC^ 
TCCGTTGAGGCCCAAGAAC^^ 
SEQ ID 8342 

MVKLTBQDIU^GPEDDYMNDDHLAFFRE^ 



SEQ ID 8343 

ATGTACACAGGACGCTTO^^ 

AAGACCTCCMCCCXXXKX^^ 

TGMGAAACCCTATACCGTCTGAAMCCGCCGGACTOTCT 

CGCaCCCCGGCCAACCXXXTK^^ 

GCGACATCX5GCGATTTCGTCCTGTTO 

TTCCACACCGCGCCAAATCTATTT6CAG 

GCATKK&TTTAAAtt^ 

ATATGGACAMGTGCCGAAACACGCCATTACCGCCCCC 

SEQ ID 8344 

MYTGRFAPSPTGLLHIGSLLTAVASyADARSNCXm 

WffGQRPALQGKQPSWra^ 

ALDLNRREQUiRQVFRYLKLPEAPETD^ 

SEQ ID 8345 

AT6GCGGCTGCCX3TTGCGGGC66ATTGGTCAAACAAGG® 
CCCTGCCGGAGCTTCATTCCK3ACGATC 
GTCGGTCGGTACGCTCAGCCGTTACCTCGGGGGMGCCGCCGCATO 
GAAACAGACCGCAGGATTGCCGATCGAATCATGAAATCAGTCGGl^^ 
TTTATCTGCTGGACGCATTGCAAAATG^ 
AGATTTCGAGMGCTTCAAAAAAATGTAACGTCAAMGGCGGGAC^ 
TGCCGTTCGCAGGAAATGGAACGGCAATATCAA 
SEQ ID 8346 

MAAAVAGGLVKQGGYRIHIANRGAEKRERI^ 
ETDRRIADRIMKSVGLTVWIiE^^ 



SEQ ID 8347 

ATGGAAATTCGGGTAATAAAATATACGGCAACGGCTGCGTTGTTTGC^ 
AAOTGCOXX^TCGAmTTGGGATAT^^ 
AAAATOTACTTTTATAGGAAMTAGGGAAGTTTC 
CAGGGGTTGCGGCGC^CGGCCTGTCCGAGCGCGTCCGA 

SEQ ID 8348 

MBIRVIKYTAJPAAIiPAFTVAGCIUjAGWYECSSLSGWOT 
QGLRRNGLSESVRW 

SEQ ID 8349 

ATGTTTGAGTTAAAAGMTGTGAAAACC^ATGATAAACAAGGMCT^ 

TTGCCCGCCAGATTACCCGAMTGCTTGGCTGT^^ 

GCAACTGGGCGGCAGCGATCCGTCCGATTTGGCGAAAGCCGCC^^ 

TTCGGCGCGTGTCTGATGMCGAAGTCGGGCTGGTTGC^ 

ACX^CCOTGCCGATTTCGl^^ 

GCCCTTGAAATACGATTAOTTTAC(XCCTC 

GTGATGGTCGGGCGCGAGGCATACCACMCXXX&TGG 

ACAtmAGTCCAAATCCAAG^^ 

AACGCTCTTGAMGACMC£ACGGCGGCCTGA^ 

SEQ ID 8350 

BFELKECENUQNKELI^ 
FGACLMNBVGLVATCLNAMHDAVGIPVTVKHRIGV^ 

vmvgreayhnpmvmrevtor^ 

SEQ ID 8351 

GTCCGAGGCCX^GGCACTCACACTTATCGCT 

ATACTTTTCAGTTCGTCCGCCGCTO 

A 

SEQ ID 8352 

WGPWiSHLSVICFVKERCKIimFRLSnSI^ISDIPC^ 

SEQ ID 8353 

TTGATTAAAAGAATTGATAAAACAATGGGTTTGAGTC 
GAATACTCAGAGGGCATAGGCAAAAGTTTTTCAAA 

SEQ ID 8354 

IJKRIDKTMGLSVICAI LASRRISKFCYYITIiJPLYHKILRILRGHRQKPFK 

SEQ ID 8355 

A3X3GTGCAAATGGCAA(MTAACCAAGCG^ 

AAAAGGCTTTGGCGGTCGMTGGGGGAAGAAACGGGAGGCGGAAAT^^ 

TOTAACGAAACGCTTG(&GC 

ATGCAGAGG(X3GCGCGGAATCCCG3AACTGGJO^^ 
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AMTAGGCTGGCAGGMTTOAmCGC^^ 

TTTCCTOm^TGGCAAAra 

AAAAATGATTGTACCCGGCCGGTTCX^^ 

CGGTCAGGAMC6TATGCTT6CCAACAAA6GCATCGOT 

OTCCACGATTTACGGCACGAGGCTGCTArc 

SEQ ID 8356 

MVQMATiaraUOTSGETTOWQVRVOT 
MQRRRGIPELDIAPIAASTAI^EI^YIRSVLK^^ 
KSDCTRPVRDLKNPlSKSTGNNKEro 
RKRSTRLDFKEAMHQAQSDIKSGK 
SEQ ID 8357 
ATGACG(m3TTGCMGAACGTTATCGTGA 
ACGGCATCC(mAGmACGCTGCCGGAa 
CGATGTGCAGTCCAACAAAACCAAATTTGTCGCC^ 

TGTGCGTTGCCAAAGOCMCAGCAGTGAAACGGAGTTGAA 

GTCGGACGA^TGCCTGCCGCCATTGAGTGCGGTG^ 
SEQ ID 8358 

HTVI£ERYREVSDRIGKLVI£AGR^ 

QWCVEVNIAGEAAKHGVAPEEAVALAV^^ 
SEQ ID 8359 

ATGCAGAmCCGAOTACTCGCCTTCXXX^ 

AMTGAGCGCGC^GAGGTCGGCMTATGGTMCTTOT 

C03CGTCMCGCCTTCAACACCGGCCGC(KK 

CCG(XXX3GCATGGTATTGGTTACCGG(X^ 

TCGMTTCGTCCA(XAMGCAAAAAATCCTO 

CGGCGAGATGamCOTGAAACaTCGGOT 

GTATTCCCCGOXXaGAGAMGAMTGGT^^ 

TCCTGAJTGCCMCEttG^ 

GCTGGTGCGCCAAGGQC?TGATCGCA(XX3GAA^ I! 
SEQ ID 8360 i 



PRGMVLVTGPTGSGKSTTLAAMINYIHETQPAHILTIEDPI 
VFPAGEKIMVRSMLSESLTAVISO^^ 
SEQ ID 8361 
atc^catcagtaaagaacaagggottp^ 
(^tgcggtgcgagctto:^^ 

GGAATATCGGAAATTCCGGTCT 
SEQ ID 8362 

MNISKEQGLLIAYGGRPHPPREVHKDFF^ 
SEQ ID 8363 

TTGCATCAAGACCGGMTTTCCGATATTCC^ 

AMTCGGAAACGGAAGCTCGCACCG(^CCCCCGATGCTTT 

TAAMGCCCTTGTTCTTTACTGATGT^ 

SEQ ID 8364 
LHQDRNFRYSDFAFSSLSAIAF^^ 

SEQ ID 8365 
ATGMTACCGATMCCTGCACGACATCT^^ 
GCCTGTGCGAAACCGCAGAAGCACAAAACGCGT^ 
GGAAACCGCOJCWTCGCCGCATCGACGAT^ 
TACTACAGCCAAGGCAGCGCAGGTTTGGTT^^ 
TGATTATCGTCGGGCCTACCGGTTCGGGCAAAIKX^ 
CAAACCGCGCCGCTGCATCmAOT^ 
AGCAGGGAAAGTATGGAATACGCGATGCAGCTCGCXXJ^^ 
AACAGCACAACCAAATACTGATTGACATCGCOT 

GcccGcamxAGGACT^^ 

GGCATCATCAGTTACGAAGAAGCCCTGCGO^ 
TCAACCTCATTTCC 

SEQ ID 8366 

MNTDNLHDILDETVQWSQKKQSRSETPAEIGTHF^ 
YYSQGSAGLVLRRIHmPQHRELGLPH^ 
SREQffiYAMQLAQTtaiLCOT 
GIISYEEALRQSVSANNLRLHI^ 

SEQ ID 8367 

TTGAGGAGTGGGGTTTTCATGGTTTOACCG^ 
ATTGGGTTPTGCGGATCAGGAAA 
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SEQ ID 8368 

LRSGVFMVLPVYCYEVTO 
SEQ ID 8369 

ATCAAAACCOCACTCCK^^ 

TCGGCATCATCGCCGGCGGCCTGGTCGATTI^ 

CACAGGGCTGCCCTTCATCCTCGC^ 

ACGCTTACCTACACCXXXXSJU^ 

MGAAAGCGTCGCCAATGOTACGAAGC^ 

CACCGGC(^TOCCGCCTiaACC^ 

Cm^CXX^TCAGCTCCGCCC^ 

OXKXX!AAG<X»TCC^^ 

CCACCTCAGCCGCCTTCTCGACAA!^^ 

CCCTCAACCTCAACCTC^ 
CGTCGCXTPCGCTX^CCCTACTT^^ 
M^CCATTCAGGCACTGACCAGCCTCTC^^ 
TGTGGCCAGACTGGAMTACCTCACGCTCGAACGC^ 
AGAATACCGCATCACCCG(XXXX!GCG(X^ 
CTCAAMCCGGCTAOTCCTGACC6GCTACATCTOT 
CCCACATCTTCCaACacrimm 
CCTCmCAGCTCCAAC^^ 
SEQ ID 8370 

MKTPLLKPLLITSLPVFASVFTAASIVWQ^ 

TLTyTPETYWLOTFMILCGTVLySTAIIIjFQIIL 

HERISSAHVDYQEMSEKFK^IIFRI^ 

TGSFKNTWQAIRPQLNI»ESCVFRH^^ 

rnQMjTSLSLAGLDVTAAMFVRIIiyrilGASIiAWAAVSYL^ 

LKTGYALTGH&^AH&EMHEK^ 

SEQ ID 8371 

TTGCGACMCGGCGCCGGOTCAGCTGCTTCA^ 
TGTATCACGATTCXXSGCACAAAAGGCTCAAACACGGCATC^ 

axMmxaAAcrcrc^^ 

AGCTTGAGG(X3GCTGGTTGCCGCCAAAGTCCGCC^ 
CCGTAGAGATACAGGCGGAAGCGTGTAAAGCOTGTT^ 
TCCGCCXjTTTTCCATCAAGCTAAAGGATTTGAGCG 
AACGCCCGCCGOTCGGATTCTTCATCC^^ 
TTGMGACACGCXX^TCGAAGTGCTTTTACTGOT 
AATCGMCCGGACTTGTGGCmCCG^ 
AGGTTCAGCAAMTGGTCAACGAGATAGAAGQCTGGCCGCAT^ 
GCAGCCTGCCACTOTXX3GAGGGGCGGCCGGC 
SEQ ID 8372 

LRQRRI8jQLLQIiDKIY^ 

SIRRLVAAKVRRYCGEHKJVFI^^ 

NHUffGFFIIJPGIAYIlKSEQRFRM^ 

BFSKMVNEIfiGHPHAEPDGFCDRim 
SEQ ID 8373 

TTGGG{^TTGAGAGAACXjGCATCAAAAAATC^ 

AACGO^TC/U^GCGCCGGG^ 

CGAAGCCCTGACGTGCGGCATCATGATGCGGCTGT^^ 

TACTGCAAOOGACGACO^^ 

TCXSTGCTGCGTCCGCTGTCAAATCra 

CATGOTTTTTTAGAGCTOACGGCGGTTT^ 

ATCCTACACGCCSCGAACCTGCmAC^ 

C(&TGATGAAAAT(£TTG^^ 

AGATTTC^AACGGACGGQAAOCTFGASCA 
SEQ ID 8374 

I/3IERTASKNRENKMQTVftTKW 

YCmDPMGRSVLFACREASQAVAC^^ 

ItflAANIiLYBVGIQADXELSAHYGKPl^^ 

SEQ ID 8375 

GTGGGCTTGGAAA(X3GTCAGCGTCGTCGGCAAM^ 

TCAACTTGGCmOGCTTTCGAC^ 

CGGCGAAACGGGCGATATGGCGGACTTTTCT^^ 

GTGGCGGGGCTGGTCGATGTTGCCGATGGAAAAAM 

GCATCAATATCGGACTGGGCAAAAACTTCGTGCT^ 

TTCGCAAACGGGCAGCATCGGGCTGTCTTO 

C^CGCCGACATCATCTGGCAAMGAG?^^ 

ACGGCGAC(K3T(K^ACGCACiO^ 
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ACATCTGAACCGCAATGACTACCACCACGACGAAAAA 

TAGAACAGGCAAATTGGGACAACTTttCXXTW^ 
GCCCGACCTOmCCXa^^ 
CAAGAACTGTAC6CACA(X56CAAGCACGTC6CCACC^^ 
GGCMTACAA!TCTGGCAGCX!TAC(^ 
CAACmTCCG(n^CGACTTCT 
CTACCCCKXaGGGAAC&TCCCTAC^^ 
TCGATGCCAATTTGGiU?EACT 
TGGCGA6TGGAATTGGTAO(3TCAAAG(X^ 
AAGTTT 
SEQ ID 8376 
VGLETVSWGKSRPRATSGLL^^ 
VAGLVDVADGKIPEKMPBN^ 



KGSWGVQYL(#KSSAIiSAIPE^ 
Q&YARGKHVATNTFEVGNKHL^ 
LPGREDHGKRPFIAQADQNAPMPAARL^^ 
KF 

SEQ ID 8377 

CGTTTCCGTCAAA(XK£AGATACCGCT^^ 
TGACAAAACAGGCAAATAACAATACMTCCCCAAT^^ 

SEQ ID 8378 

RFRQTADTA^KLIQIAGNnSKAIRVHLFRRHMIAR^ 
SEQ ID 8379 

ATGCXIGTCTGAAGCCXmGTTTCAGGCGG^ 

CGACAGCGACA 
SEQ ID 8380 

MPSEAPSFRRHIHKGASAGGEEEGGVVGGGAAFGGDiCKFyPPAT 
SEQ ID 8381 I 

ATGTCCCGTATCTACCTACCCCTCCTATM^^ 

AGGTATTCGATTCGGAAAACTAT i 

SEQ ID 8382 m > 

MSRIYIiPUiFPHJIVERGIiLYFQQDKVRI^ 
SEQ ID 8383 

TOTT(£ATGATGACGATTTCATACGCGCGGOT 

TGGGAACGGAAAATGCCmTACGGGCCX^TCa 

TGATTTTCTGACAAATGCGATTCGTCXIAAAAATM 

TGCCATCACGTCAAACCGMGGGCGAGTGTCTTO 

GAATCTATGAAAAAGGCAAGCAGCTOGC^ 

TTTCGGCGGAGaTACCmTTTGCGAACG^ 

GGACGAGGAATCAATGCATCGAAAGCAGTGTTTCCCX^ 

GCAGCGAGGCAAAGCTMC(XXACTGCATGAMTTCCG^ 

AGAGAAGA^ATGACAMTACGCAMTO^ 
SEQ ID 8384 

LFDDDDFIRAASAKMEEIFGTCIIEKAKHSGGOT 

CHHVKPKGECI/SSDWEEE^^ 

(ftGINASKAVFPDKSKQ&FEVL^ 
SEQ ID 8385 

ATGAGCTATTTGGMGATGTAAAAAACGCATTAAGGGTMTAGATMOT 

AAGCGGATAOTCTTTGGAATTTCTAAAGGATGM 
SEQ ID 8386 

MSYLEDVKNAIiHVIOT 
SEQ ID 8387 

ATGC^GCCTCCGGTTTATATOTAGCAAGC^ 

GUXOTACAACGTTACTATGCCXjGTTTGGT^ 

AMTMTGTTTCTTGGCGGGATTTATGGTTT^ 

SEQ ID 8388 

l^PAVniiASQRimYIGVTSDLVQmQHREffl 

SEQ ID 8389 

TOTTAAAATATAAACCGCAGGCTGCATATAATA 
TOCCGCAACTOTCGTC^TTCOCGCGAM 

SEQ ID 8390 

I^JCYKPQAAYmLKYLNLYSREHHPVm 

SEQ ID 8391 

TTGGTTTTCK?rTTT^ 
OTATCGGGTAAAAAGGTTTCTCAGGTCC 
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SEQ ID 8392 

LVFCFCRNDEE LS FRNLSEKTETAP PSFPLRRESGSVGAETYRVKRFLR5 
SEQ ID 8393 

AMCCGGCCQCGCGGGCGGGI^GAAGAAGACTTCGCGCC^^ 
SEQ ID 8394 

IiUiSHVIiDSRIiRGHDIAFHLPPPPE^ 
SEQ ID 8395 

MGCGCAATCCOTATCGCTTATA37AAAAACCAAG^ 
AGACTACTTGGCGCGCAATATGCAGGCGGCXaG^ 
CXX»CC(MTTTATGCGTSAGGCa^ 
TTAGA 
SEQ ID 8396 

mimRCAAWQAALDYYFDOT 
PAEFMREAVKTEPDKTAIRKAIESGRQVAGAKIEGRKNLQIR 

SEQ ID 8397 

maACTGCTTCGGACGQCAACAG^ 

TCACaTCGGCAGCGACGACCTATTCATC^ 
SEQ ID 8398 

I£LLRTATEKGNKKMANIDIffQ^ 
SEQ ID 8399 

ATGMCGCGCAAAATCTGCCCGAGGTAAAGTG^ 

TATCATTCGATMTACCGATAACGTTGTTGC^^ 

(mXMCMTGCariTGATTGGATOCT 

TATAAMCCCTUOTTACCTTCGGTCMCMGG^ 

GGTTGG6CGTGTCTTTCMTTTSA6CAGC6AGAGOT 

AGACCAAMCCTCGTTTATACGACAGGCGACGGTMT 

ACCCAAATCAAAGATATCA(X5(MGGCAAAACAGTT^ 

TTGAAGATAMGGAAMGTCAGCGTC^TCTAGGCCTGCCGCMGTC^ 

CGCGCTGTGGTTCGGCCCTGTGCAAMTGGC^ 

TCCGGCAMCCCGGCCGCTATTCCX^KX^M 

AACTGMTAGAMCMGGATGA6GTCGTCACTTTTGGAAAW 

GCCTTGGACGGTTCGTGCTGCTGCCGATGA 

GTC^CAGCra^TTGTCGaSGTraGCGGCT 

ACATCXXX^GGCACGATGCCGCGCMGGATAITCAAAGCM 

TCIOTGCGCC^AmCAGATGACCMGA 

a^cGccGcnraOTGTT^ 
ACGCCGccraccTacrrc 

GCAGGGCGGCAAGGGCGGGCTTT(^TCCC(XIACGOT 

TCCMTCCTGATAMTCGTCTGTMGGACTATOT 

TGACCQAA6A!WaATCTACTGAATACG6GCGAACAATATATTO 

GCTTAAAGAGGAAAATAAAACATTATTCCIG^ 

CGTAraCCOTGTCACCATCCGaGCT^ 

TTGTG(XGGATMCCAGC?ITCCGCAATAraX^GCCA 

CAAACCGGTTAATCTGOTTACCTGGACGA 

SEQ ID 8400 

M^QKLPEVKVKSQSYSSAPKKDRERQFTHTNNFKIRR^ 

YKTQroGQQGLQRKAGKKLDIYEDKS^ 

TQIKDITQGKTVDLGTLKPRIELSKGW^^ 

SGKTCRYSLVmENDIKSKEPSFMGRETIIRLDS^ 

VNSPmVGGYlATAANIXa™ 

PAAAPIJDVKNGDNNGKMV^^ 

SNPDKWSVRTIFBGDKPXTSAPAVSRLADKRVVIP^ 

RTAmiilSyrGNDKCGAQTAILGOTM 

RCFSGKGVRTLUINDLDMITGKKXSIKRLSWREVFP ~ 
SEQ ID 8401 

ATGGGAAAACATCAATATGGtt3GCATAA^ 

GGGTTOGG 
SEQ ID 8402 

MGKHQYGGIKRGMAENLPFQSFIHILFliKFAFAGRRAANAGVR 
SEQ ID 8403 

TTGATTTOiU3TGTGAAAGA(X3AAA6 

AGCGGCAGCACAAGATTTTAGTGCAGTGGGAMTCGACCCTGA^^ 

AA(mACTGGCAACAGACCTCAAM 

GAAAGttGTGACSGCAAAACC^ 
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GGGCGAAITACGGCCrCCKjAACGAG 

(&AAG&XX?GCCGGAACACrc 
SEQID8404 

LIJaSVKDESNFKPCTAGS^ 

ESSOXTTYAmPMSNKM^ 
SEQ ID 8405 

TTGCTTTCGTCTTTCACMnTAAAA'P 

SEQ BD 8406 

LLSSFTLKIHDIF&JJjKV^ 

SEQ ID 8407 

GTGCAGGOTCTGTATGGATCGATTCXHCATO^ 
SEQ ID 8408 

VQVSVHMDSSFPRRRBSRPWDNSNIQRLSAAVC 

SEQ ID 8409 

ATGAGCTTCCATCCCGAAACCGCTTATM 

AAAAACAGGCAGAAGGCCACATTAAMGCATTATCA6A 
SEQ ID 8410 

MSFHPE?AyKGGGETBPYGPSPEEimQSPETAETRRMTEKQAEGHIKSIIR 
SEQ ID 84111 

GTGCXK&TATTAAOCATAACCCTT^^ 

TCEACAGCCAAATO^CTGACC 

CGAAAAA<?KXXX^TOTCAAM 

SEQ ID 8412 

VRILTITLMELSMYAVVKTGGKQYKVSVGEmCVEQ 
SEQ ID 8413 

ATQGGCAAACATATTCTPniAGGTGTAACGGGCA . 

CTGAATTTGTTTCT^ 

TTTTCTGAITCCGCCGGCAACT^ 

AATGTGGAAATGTGGCTKMTCCTGCCAAC^ 

GGATGCXXmCCTGCCGAATTGCTGGATT^^ 

GGGCATCACAAATACCTOAGCGGAAmTG(Xm , A(^ 

TCCGATACGGTTCMGCCGTCAGTGCCGA^ 

GTAOGCAWMTTCAAAAM 

CGCTGAAACG6AGMTGTMTOGCATATGCGCGGGAAAAACCT 
ATAGATGATGATCCGGAGTTCTX^^ 
SEQ ID 8414 

MGKHILMVTGSIMYKSCELVRLIiKKQGHSTTO 
NVEMmiNPANQRNIAQLVSIXSITVYMPGW^ 
SDTVQAVSAEDMHRAVHRLIEK^^^ 
IDDDAKI^FPETSKDEAAMRIVEIUiAVYHK 

SEQ ID 8415 

ATGCAGAAAAAAAGACATTCGCKCCGACACXTrTT^ 

AAAGCGaaTCTCGflTC^ 

TCAGATGATTTTAAAMTAAAAAAAAGITACCCAC^ 

AAGC6TAAA6GCATCTTTTCAAAAAAT 
SEQ ID 8416 

^KKRHSRRHLHPTSSmSCMI^ 

KRK6IPSKN 
SEQ ID 8417 

(HSCAOCCCXXXmACa^^ 

ATATCGGOGGTTTTCCCTTTG^ 

TTTCTATTTCOTAATTTAAM(Mm 

CCCCGGCTGACGGOX^^ 
SEQ ID 8418 

* VHPPATGFFPRIQMPLFQPNLGAQPRNLRLIjLSQRHra 
PRLTAHHSYARYEPASLPMFSASANHSRSSMA 

SEQ ID 8419 

ATGGGGCAAAACCTGATTTT^ 

(XX£GTATTCCCTGCMGACTT^ 

A(KmCATCATTATC6ACGGC(mATGCCMW 

GCAAGGATGGCGCGGGACAT^^ 
MTGCACOGCXjTGTTTGCCGAGTGGAATAAAAC^ 
GATACGG(^GG(X^AAAAGGCACGGGCAMTGGACGGGCATCM 
AGCGCX^^AGACCGGCAAGCTGTTCGCCCGA^^ 

GACGCGTATGAGGCCAATCCCGATT^^ 
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TCCCTATGCCCTGCATGGCTTCGGCGATC^ 
CGACAAACCGCGCG«m^ 
SEQ ID 8420 

ARRGPSIMPCKEKRAWE&VKPIFQAIAAKT^ 
mGQKGTGKIPrGINALDLGIPLTLISEAVFARCV 
D%7EANFDLWIiG^PmNII£NCLF 
SEQ ID 8421 
ATGCCGGGGCACTTCOGGAGCGTTT(nCCAAAT^ 

CGTAAGTGTGCGGSCCGMGTAGTCGCGC^ 

TTGC(X^AAAACGCGCTGCGGATGATGCAGCCTO 

AGGAAATGATTTTGGAGGCAAGCAA^^ 

GAACGACGAMCACAGCGGXGMTACGGCTiraSAMTa 

TIOTCGGCCAGCGGTTCGCCGCCTOX^^ 

A66ACAGCC(XAAAa3GTCTT^ 

GTOCAACACGGTTCGCCCTCKX^ 

TC^CX)GCCGGATACGCCCGCGCCGACAAACAA^ 

ACAGCGGAAGCAGTTGTTCGACAAAG^ 

SEQ ID 8422 

KPGHFRSVCPNIWVSISAQAATIHHKAIGTSPYFIjQNKCRPNGSIXjISA 
FDRKmAPCRQAVFQNIFEIRIR^ 
ERRNTAGEYGFGNQRQRNAQIQGIDARPFARAFLPOT 
VATRFALRRFGGNRLENRLOT 
ARLFGfTV 
SEQ ID 8423 
GTGCAQGATTCGGTTTTCCGCCTCCCAACC^^ 
GCCTTCGTCCTTTCCGT^^ 
TTTCGGGCGGTTTGTTACTTACAAAACAA^^ 

SEQ ID 8424 

VQDSVPRLmQSIiPHIIiPRVISFHPPIGAAVSIffiAVIPHR^ 
SEQ ID 8425 

TTGATTCCGTTGCCGAAATGCCGTCCXaAAC^^ 

GTTATTTGGACAAACGCTCCGGMGTGCCC^GGCATACC^ 

AGAAACGCGTGCCGCCCAGCCCTTGATACGCGAGT^^ 

GCQCAATGCCGCTATCTTCOGTATGACAAAAAAACG 

GQXKCGAGCGGGTGTGCCGCTGTTTTTGGCGAA 

GGCGCAGACGGAGGCGGATGCGGCGCGGTTGGCGAAATTGGGC^ 

TITCAAAAACGCATCGGAGGCCGGCCGGTTGCCG^^ 

TGGCCXJTCX?ira:GCGCCATCaX5Ara 

GGTAGGCGAC^GTATGGGCX^GCTGTATGCGTATTACCTGTGC 

ACGACGATATTCGGCTOTCGACCTACAATTTTTCCGA^ 

TArcTGGCGAGGGGGGCGGMTGCAGATGCAGGCGCGCGTGG^ 

GCGA 

SEQ ID 8426 

LIPLPKCRPNRSIXnCFAGNRKMFQWLTO^ 
AQCRYLPYDKKTTOQFLREHRPMFGIU1ETEI 
FEKRIGGRPVAVCGSTRVyRGEDEABKLLAAWQQYRGDALLATO 
TTIFGFSTYNFSEAOIHAIASGAAVQVES^ 
SEQ ID 8427 
ATGMCGMTTMTCAGCAAAATAAACCGGGTCGGCGCGAGGGAAAAG^ 
G(X&AAGCCTCMTCACACCGOTT^^ 

SEQ ID 8428 

MNEHSKDfflVGAREKIX^SLLIiKVGEICR^ 
SEQ ID 8429 

ATGTCCGCATACCGCTTCCX£CA(^ 

SEQ ID 8430 

MSAYRFPHRIiGDSRARSAMIXIDKAVHAIUiHIiHSAP^ 

SEQ ID 8431 

TTGGAGCATCGTAT^GTATTTATG^ 
TGTC(NX£(&GGC(XXXXX£GMG^^ 
TCCTATATCGGGCGAACCGTFTGCCACT^^ 

SEQ ID 8432 

IiEHRMSIYAVmiHLYCATAFVGGVFFEV^^ 
IilVGWSKYIHAVVFTHMIiLIVFIiAKAMFyiStf 
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SEQ ID 8433 

GTGTTGGCAGCIATTTGGACCGGAT^ 

SEQ ID 8434 

SEQ ID 8435 

OTGTTTACCATCCTAGTCMTCCGTGTATCC^ 

GGCAGGGAGAACATCCTTTTGTCCGAC^^ 

ATGTGGCGAACAAePGTT^ 

SEQ ID 8436 

ICTILVNPOLPNR6NKPQVLSVGISSVRANI 

SEQ ID 8437 

GTGTACCCAATGCATOCCAGCAT 

OCCAAAATA(XX3AAAATA3"K3G^ 

AAAACAGCGGCGTGGCGATTTTTTTGCC^T^^ 

CTCTTCCTCATCTCC^^ 

(HSCGCAATACCAATCXX^ 

TTTCCAAAATGTAGCCGGTAMGAATTCCAATACT^^ 

CCACAAGCC(£TCCATGCCAAG(^^ 

MCGGOjACa^TGa^j(^TATTCTGTCATG^ 

SEQ ID 8438 

VYPMHHQHFHTDKTQHECQAWQ©^ 

LFLISFGLHHmSIEQERA^ 

PQAAFCQGFFDADFHAAVFLQRKHIQGNH^ 

SEQ ID 8439 

ATGACAGMTATCCCGGCATCG6GTC6CCGTTGTOT 

TCAAAGAAACCTTGGCATGGAGCGGC^^ 

ATTGGMTTCTTTACCGGCTACATTT^^ 

TACGGCGTATTQGGCGCATTGGTAT^^ 

ACATGATGAAGCCCt^GGAGATGAGGMGAGGATT^ 

GTGCTGACTTCCAATATTTT^^ 

GTGTGAAAATGCTGGTGATGCATTGGGTACAC^ 

TAAA 

SEQ ID 8440 

KTEYPGIGSFIiFYOTFAAVLVMIAL^ 

YGVLGALVLIffl/MIFVGAALWQFEWI^ 

VLTSNIFAILGMMYFLLADVAERFI^ 

SEQ ID 8441 

ATGTACGGAAATGTGCCGGATOA(&TO 

TCCTGCTCTACAAGGCAAACGOT 

MTGCCGCATGGTCXXKJTACTCTCCATGAC^^ 

GATCCMOTATGATTCGGACGCCCGGGCMGAOT 

TGACCCACAACGGGTTATGCTCGGAATGGGAAGACOT^^ 

AAGGCTGGAAGAACAGGCTTACATCMTCGCGOT^ 

SEQ ID 8442 

MYGNVPDLIXIIjLYIUCPHIIKSMRPHNDTIFQEPKAA 

DPSffiRTPflPiSAIiSEADL^^ 

SEQ ID 8443 

ATGCATAAA(X^TCACCCGATGAA(XOT 
GGAAAACAAAAAAAAfiCCGAGTTTTGAA^ 
SEQ ID 8444 
MHRASPDEOTWIIRDFREKYGS^^ 

SEQ ID 8445 

ATGGCMCTAAAAAAGCAG(XX3GCAGCAC 

GCGGTACCAAATTCCACGCAGGCGACAACGTAGGTAT(XKS^ 

CATCaSTCCTTACACC^ 

SEQ ID 8446 

HASKKAGGSTMODSEAKRLGVKAYGl^^ 

SEQ ID 8447 

GTOXaTCATTCCCAOT 

AATTCATTCGGTTTCAGCCGTTTCCGATAMTTA^^ 
CGTG 

SEQ ID 8448 

VPSF?FSLFHHSCFHAVILAKAEKSKAKTQDPIIPAQM 

SEQ ID 8449 

GTGGACAAACTGAAAATCPCCGCAAACQGCCCGC^^ 

T6AAAAA(X?TC0CTATGCTGGCA6AT6TCGC^^ 

TAOTGC6CX30CTTAC6MTTGGTTCX^ 
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CGCCCXX5TCAATCAGCATTTGAM 
CCGTCCXjCGCTACGGAAMCCT 
CGCGMAATCAGCGGCATCGGAACATCGACCAT^ 
(XX&^C£GGC(K£AGGOTG^^ 
ATATtKIGGCAG(XHOT3AAG6CGGTG6ACATO 
GGAAACGATTTTTGAAAAOXXTO 
GTCAAGGCGACGGAITTGCGC^^ 
AACTCGGCAGCGTCGGTGCGAACAIICGAGCGC^ 
SEQID 8450 
VDKLKISA!K5PI^EIT^^ 
RPVNQHLKGLEAMGA^ 
MTGGRVVLRHAAjPKimVI^ 
VKATDLRASASLVIAGLAMGETVVERIYHIJH 

SEQID 8451 

ATGOTATCOTCKX^GCCCTGCAC^ 

CTGOCGCCTAMGCA!^^ 

TGAMTCGACCCTGCAAAAATCAGTCGGCAGCA^^ 

SEQ ID 8452 

HFIPAALHKDKHSAYGVTIPDIjPGCFSCGDTVEEAY^ 

LAKAALLTMNQA 
SEQ ID 8453 

TTGAATAGCCTAGACGTI^TTGCCCTACTCAM 

CGAAAAAAGACTTGCCGACAGGTACTGTAAAAAATAOT 
SEQ ID 8454 

SEQ ID 8455 

TTGCCTGCGGOTGACAGCCCTCAAGACCX^^ 
SEQ ID 8456 

LPAADSPQDRPRYAGFLHPSLQLLPKDIFTHKYT j 

SEQ ID 8457 

MGGCATCTTTAAA^^ 

TCGCXSTraTCAMTACTGOTGa 

GGGCGGCCTGTTGGGCAAAGACGTGAMGTATTGMCGAC^^ 

MCGATTTAGMCTGGGCAAAGOTACGOTCCT^^ 

OT(£GC(XHXXCTATTGATGG^ 

CGMTCGITGGCGGACAAAGTCGATCAGCTTAT^^ 

TCCAAAAAMTTATGGCGAAMTGGCGGCCAMGGCGGTTC 

TTG(XX^6ACGA6ATGATTTTGGACATCGGCCCGAM 

ATTCGCAGGCGGTACGAAAGCCCTGCCGAAGC^ 

TACATCTCCACCGGCGGCGGCGC(n , TCC^^ 

SEQ ID 8458 

MAFLKLTEQNVRGKmiEADM^ 
NDLELGKAYAALCDVFVNDAFCTAf^ 
SKKIMAKMAAKGGSVPLPTDVWAKAFAADAEAVVK^ 
YISTGGGAFLEFTjEGKEIiPAVAALEKTRRVHGU 

SEQ ID 8459 

TTGGAGCATCTCAT^CCGA^ 
TtfcACTTTAmTCGAACAT^ 
GGCGGCACACTKX^GCGCCCGATCAGATACGCGCCATGC^ 
CTGAACCGCCTGTT^ 

AAA6CACMTCGGCGACCCGTCAGGCTTTCACGOT 

CATCTATTACAAGTCTGATAAGCGGATTGCC(KKA 

GAAMGGCATTGCTTGACGTATCGGGC(^ 

SEQ ID 8460 

imHNRYAMRFAVIRFMFYV^^ 

IiNRLFSYFVARSFAKNQPEAEm 

mi^SGDLBEQGIQLAimSEGnWACG 

SEQ ID 8461 

AT6AAAAAAA6AQAATTAAATGAAA!PrGAGA06GC^ 

TTACTCAAAGC^TCAAC^ 

AGAAATAAGCACTATGTCX^ 

GGOTAGTCAACeCO&ra^ 

ACAGCATGGAGCCGACCATTCCAWGAAATOATAACTTTM 

GTCCAGAGGCAAAGGCGGCGCGATACACGCTAC^ 

GTTTTGCCGCTTGAUTl"l L rlJ^CCTT 
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SEQID8462 

MKKRELNEIETAECAELKW 

GWNPDYPQLLRSIBIPNDALFE^^ 

VLPLDFLDL 
SEQ ID 8463 

ATCCT^CACCCC^CAAGTCAAGGOTTGATO 

AGGCACGCCT03C(KXm^ 

ACAA 
SEQ DD 8464 

MLTPEQVKALIAGVAKCHfflEVEGDGHHFFAVIVSLEFEGm 

SEQ ID 8465 

ATGAAGAAMTTCCGCTAMTGAGTATGTTGAGCAG^^ 

TATTTTCAGACCH&AAAA 
SEQ ID 8466 

MKKIPLNEYVEQHGQAKIAKEIGV^^ 
SEQ ID 8467 

. ATGC03TCTGAAACGAAATGGTTTCAGACGGCATO 
CCAGTTGGGCTTTGAGT(X!GTCTra 
TTC(XATTTTGCCAC(X:CTGC^ 
TTCAGACGGAATTTT 

SEQ DD 8468 

MPSETKWFQTAFLCGCTCALA^ 

FRRHF 
SEQ ID 8469 

TTG6GTTACA6ACGTGTC6AGGGGAAAGC(XGCAC^^ 
SEQ DD 8470 

LGYRRVEGKARTSSAFTLLPSENKKIARPDLNAFEIAP 

SEQ ID 8471 

TTGGCGGCGGCTTTCAGCATCGCCTTm 

TTCGGTTATATCMTTMCATCTGACCCGACGGGCTTTO 
SEQ ID 8472 , 

LAAAFSIAPLISRLSFFIWLSCFPIFFLSGKIM | 
SEQ DD 8473 

GMGCCAMCGCTTGGGCGTTAAAGCCTACGGCMCGAGOT 

ACACTTTOTCGCCAAAATTC^ 

GCCGCATCTTTTCACGATGCGGCTTTO 
SEQ DD 8474 

EAKRLGVKAYGNELIPAGSimQRGTKFHAGDWGMGK^^ 
SEQ ID 8475 

AT(xxmramAccrira^ 

CCTATGCGTTTCGCGCCCACGACGG(Xy^ 
AGGCGTGATGCACGACGTATTGGAAGATACGGGCXjTGACMM 
GMGATCATGCGGMCATCAGGCGGAGAGTWCGCAAAC^^ 
CGATGCGCCCG(^CAAACGCCGCCGGATTG^ 
CCTGCATCCCAACCGCTACGAGACTO 
GAGGCCAmTCAAAGGCAGGGAAAAAAACCTGTACGra 
TTCCCGCCTGTTATGCCGCACT^^ 
GGTCG(XXXOTACGGCTTGCCGATT6AAGTTC^ 
GCAGTGCTGCA(^CCAAT(^TGGTTGAAAAACAT^ 
TTACTCXXaAAAGGAAAAftTCCTAACTTTGCCCAAAGGG 
GCCGCTGCGTACGAAGCTCAAAACCGGTGATTCTGTT^ 
CAATATATTAAAAATCTTAACCGQCACGATGCGGT^ 
ACCTTG(XIGATCTCAACGACAAACAGACTTCA^ 
CGGCAGCGAGGTCAGGCTCAGTTCCATTAAAGTTGATGG^^ 
AAAGGCATGATTATCCATAGGGATACCIJG^ 
AATCGGAAGACAGCCACGGCCTGTTGGCATTAATGGCGCAA^ 
ATT(^TTCmTTGAMGTCMGAATTT^ 
SEQ ID 8476 
, MPAPQPSAPYDPLTAEARTL^ 
EDHAEHQABSFRKLILAMTKDVRVIVVKI^ 
EAKIKGREKNLYGIHQKMMAKKLRFAEV^ 
AVLHTNQWLKNILDIiQASSANAIEFLEHVK^ 
QHKNLNRHDAWLGESUiQ^^ 
KMIHRHICPTLLKSDPE^^ 

SEQ ID 8477 

ATGAACXXm^ATCCCCAftX^ 

TGTTTACCXHATTTGCATTGCAGT^^ 

GTATOTGATGTCGATGCTTTGAAAATCA 
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MTGCGGATTTC&ACAAAGCCAGCACGGAAAACG^^ 
GTTACACCCMGCAGGTGCGAAACTGTCCCGCGGCGAACQ^ 
AGTAACCG(X£TAAAAACOTATTG^^ 
GAAGCGGCGQCGGCTMGATTCAGGCGGCTGTGCAAGAT^^ 
SEQ ID 8478 
MNRIiYPHPIIARBGWPII(MX3IALSUVSM^^ 
NADLDKASTENERNAVL^^^ 
EAAAAKIQAAVQDVLKD 

SEQ ID 8479 

ATGMGAAGCAGGACAGAAACCGCCTGTCGAAGAAAGACAGA^^ 
ACXjGCTTTGAATTACTTGGAAAGCAGCCCQAT 
SEQ ID 8480 

MKKQDRNRLSKKDRRLIKKMLKAAA 
SEQ ID 8481 

ATGAGCATCATCCACTACTTCTC^^ 
TGCCGCTGTTTATGTATCTGGGCATCCAAAGC^^ 
ACX^GOT<£TGACGCGC(&TAMC(^ 
CCCCT^CGGATGTCTTTATCGTTACC^ 
«3<SSCAAAa3CTCTAC^^ 
CCGCAMGAAGAMTCGmTCA(XAAACTCCTGGG^ 
TGC(XjTTGG(nOTCTCTGCOT 
TCGCGTTGGGCGTATTCGGCGCGTGGCTTGC^ 
SEQ ID 8482 
MSIIHYFSLHVESARSALKQLLRQPrcTLI/^ 
PLPDVFOTPDPATTPAQMQAIYRDITKIiraVESASMOTS 
CGWLLSAVRPLVEAIFKI^GLM 

SEQ ID 8483 

Gra^CCOSGTTG^ 
GCGGTTCAGGTAACGGCGAAAGGCGATTA 
SEQ ID 8484 

VETRI^RGIXJKSERRLTMTDGRPIKKNPKGAGNPPBLQGSGSGNGERRL 

SEQ ID 8485 

ATGOCGCCCTTCCT^^ 

TTATCGCGGGACACTCCGGTTCGG(Xy^TCCACC^ 
CC^TCGGCTTTATGCGCCAACACATCG(X^TCG 
AMGCCGAAGMCXXX£CCGCATCGCCATCGAAAAA^ 
TCCAC(^GCCCGGCCTGCTGATTGCCX^ 
CGCACATGACGAAACCCTCAT(X3CG^ 
SEQ ID 8486 
MPPPLTGPPMIRFEQVSKTYPGGFEALKNVSFQINKGEMIFIAGHS^ 
KAEERARIMBKVGLKGRELDDPWI^ 

SEQ ID 8487 

ATGCXIGTCCX^CTTATCOTCAT^ 

ATCAAAAACCCGMGMTCGGAGCCAMTACXXMMCGMTATTTM 
SEQ ID 8488 

MPSMSFIIflKELIRSGSI^ 
SEQ ID 8489 

ATGCOSTTAAACGCCGGATGCCGTCTGMGC^^ 
GTCTTACCTATTGTTCA(&CGGCATAT^^ 

SEQ ID 8490 

MPLNAGOttiKPLQTAFSRPAFGI^ 
SEQ ID 8491 

GW^CGCAGCPTGCGGCACGGATAA^ 

GAG0GGAAG7A 
SEQ ID 8492 

VTQLAARITDIiBGRGFVFAKPRMKAGGRGKPVTHYSI^^ 

SEQ ID 8493 

GTGGAAAATTTTGATTTCAGACATGACATTTCOT 
CXU&TCGACATACAAAG(&CAAAAC^^ 
CCC<mAACCTGCWCGCTCA^ 
GGCAGCGTCGATATSGTCXjGCATC^CXSCT^ 
TGAMTCCTACGGAAACAATGTCGGCGTACTGCCC^^ 
ACTCGCCCATTCAAAATGCCGT 
SEQ ID 8494 
VE^FRHDISLUI)NPIJIGLRRVIiIiOT 
GSWMVGIALDTSIMGNFLKQ^ 
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SEQID8495 

TOTTTGCATTGCGTTOTCT 

TCXX3GOXMTCAGaX3^^ 
SEQ ID 8496 

IfAIALFIJHNRIiSPLPEPLFCSISGGIiPAFCGFFF 
SEQ ID 8497 

ATGAGTGCGAGGCTGATGGGGATGGCTTTCAAAACGGCT 

CGCTGGCGGMC^TACGGOTTC^^ 

CEACC6CATCAACGTCGCCCTGCTT6AAAW 

TCGGATTACGAACCTTCGGATITrcA^ 

(XXXn , MGAACCATCAGATT?TGAGCGGCXjA^ 
SEQ ID 8498 

HSARIifGMAFKTGIPRGQRFVLVK^^ 

SBYEPSDFEPSDYBPSDFDA1QTO 
SEQ ID 8499 

mAGOXSTGAACCAT^ 

CGGCGCCTCCCGAAACCGCA(XEGCGGCGAAGG^ 

CCGCAAGGCAAMCGGCAGC(XXTGACGGAAACG^^ 

AGCITTTCGCGCCGMTCGCTGCftAAACAAAACTTO 

CGGATTTT 
SEQ ID 8500 

LSGEPSDFIXjSLYVEPSVEPSGSNARGAR^ 

SFRAEWLQ^KTPGGSGNWXSPTHNQTM^ 
SEQ ID 8501 

ATGTCTGAAATCAAAAmTCCACAATCC(X 

TGTCXmTTGAA«3ATATTTTCAAC^ 

GCTGCTGCGTGCCATCGCCGATCATCCCGCOT 
SEQ ID 8502 

MSEIKIFHNPRCSKSRAAVSLLEERG 

SEQ ID 8503 

ATGGCTTTGAGGAACGCGTCTGATTTOT^ 
CGGGCTGCCCGGCCTOCGGAAGCTGGAGG 
CCCGCGGTTCAGAAACTGCCGGATTGAAAACTACGCCGT^ 
GGGCGGAGCATGATTTTTT(XK3GCAGGAGGGGCACGGG^ 
TGCGGACGGTCAAGGACAGTTTCGGCX^ 

CGGGACGKGGCGGCMGCTCATTOT 
SEQ ID 8504 

MALRNASDFM&YGGGVRVERRQ^^ 

GRSffiFSGI«GmNHLACGIAREVIAAGKSALVIWa 

RDGGGKLIPFDWBSYRA 
SEQ ID 8505 

ATCCGOTACTTGCAGAAAMGCAGGOTCACGC^^ 



TGCCCCX^X^CTCGTCCAGCACCAGC^ 
GACCAGCGCGC7ITKKXX3GCGGCGATGACTTC^^ 
AAGTTTGCCG(^TACTCGGCGGCGGCCGCCTT^^ 
TGCGTTTTTCCAGCGCGTCGCGCATCGC^^ 
GCTTITCG(XX50GTATCCGCCGTGTTCCGC^ 

GGAAGCTGGCCQGCCGCTGGC^TCGOT 
CTTGCGGACTTGCAGGi^TCCGCXGCC^^ 
GGCGCCGTTTGCGGTTTCGCAGGGTGCGGTCCGGCAGOT 
SEQ ID 8506 
VSGmKSAYIflKKQVSI^^ 
DQRAFAGGDDFAGDAAGKW^ 
AFRKVSAVFRALPPLDPHAAAVSPQE^ 
LADLQEVRRHLPRDAVVGXQRKIX^^ 

SEQ ID 8507 

atgcgcgmggcgcggaatgcggaagagco^ 

GGGGAAS!ATOCOC(XXX}GCMT(X3GOCGAA3!T 
SEQ ID 8508 



SEQ ID 8509 

GTGAACAAACGGCTGTTTTGCAGCCGCAACX^ 
CGGGCGGGCAGGTTTTGCCGCCGTATGTCCX^ 
TTACXAAAGCCGCAGGGTAGGCAAAATCGTC^ 
ATCCTGTTTGAAAACACTTCGCGCAA 
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CGGCX3GAAATGGCGGAGGATTTGGGTGTCCTO 

CiacCGTTTGGA^ 
SEQ ID 8510 

VNKRLFCSKflGIiRYHi^ 

HiFENTSRNTYENLNHI^ 
SEQ ID 8511 

TTGATACACATTMGCTGAAAGTAAAAATCCGM 
TACAMTGCCGOTQAAATCATCGGCATCCGOT 
CGGCAGCGGTTTGAAAOTACP^ 
GGTA3X3TGGGACAAAATTCCCGAAAACGAAAOT 

TGATMTGAAG0GCAAGTCTTTTTG6MM 

GACTACAGCCGAACCATGTTCGTCTGCCG^ 

CCAATACCGTCAmCAGTCGATACCGATAAAATCM 

CAGCA^TCCACamCCCC^^ 

CTCGGGCTTTCCAGCGCGGATATTCTGG^ 

GATTACCAAACAACTGACCGGT 
SEQ ID 8512 

LIHIKIimiHHPPlHISSTTC^ 
GMWDKIPENEIYPL!UMVLNGRSDYQI£FPQPTmGEPArc 

IX^SADILAKIJIuEAEQRGQDIGAASLLEKTC 
SEQ ID 8513 

GTGGGGATAMGAQ3GTTCATITGTCATCCTOT 

CGAGAMGTGCTACAATATCGGTTTTCCTO 

TMTTTGAAAAACATCGATTTGGACATKX^^ 

TATGTCX^GAGCCTGTCTGCCTATGWCGGCAG 

ATCCGCGTTOJm?rcGG(ACGCT 

TCAMTGGTCGATGCCGTATTAAMCTOXX^ 

TTTOCXX5TGTGCGCGTGGACGGCGAAGTCT 

AACAAQjACTGGOTGAAAGTTTTGAAACCGCGCTGCGCCAC^^ 

TTCGTACAGCCTGCCCGAATTGGMCCGCGOT 

CArcCCGMTTGTCGCKSGCA^ 

AAACTTTGCCTGAAAAAGTCAAAAAAGTCGTGTTGC^^ 

CATCCCCMTCTCCMCGCC^ 

GCACGCTACGTTTACGTCGGCGGCGAACCGTTC^ 

AAATCXTCAAAGAAATCACCGAGCGGCTCGGCTT^^ 

C^GCCAAATCGGCAGCGGCCK^rcGGCGTGAJOT 

AACAGCGTMTTCTGGTCGAAaCGAC^ 

ATGTCGTCAAATCCGAAAMTCCGTTACCGG 

CGGCAACMC£TCAAAMOTAAC£CTCGAGT^^ 

OmACTCAACCtKmmGAAGAGCCCGCXm 

AGCCTGCCAAGGCGACGGCGTMTCAAAGTCGAAATGCACTO 

GGCAAAAACATCAGCCAAGTCCTCXIAC^^ 

GOTCGGCCAGTCC^^ 

CCTGCACTTCGCCGACATCGCCCTGOT 

GACTTAGGGCCGGMGGCGGCGArcGTGGGGGTMGGTTA 

SEQ ID 8514 

VGIKTVHLSSFYSNILNSDIITESAAFRROMPSERGFRRHPRXCYNIGF 

YVBSLSAYARQFLQMUDKFimiEGLSPAISIEQKSTSH^ 

FHVRVK3EVYQLDEVPKLEKNIKHNIDTO 

HPELSLATGMIX^KRNQFYFQMIQSIAHHYKFDVim 

ARYVTOGEPLBBVSAWPLTKW 

NS\OT/EHDBDAIREADFVVD^PGAGEHGGNVLIAOT 

REUlRAQEOTAPYDDIRGLEHIiDW 

GmSQVU^VEBAREFFOAVmSRK^ 

DI£PEGGIX3GGKVIAKGSPEBVAXVRGSYT6^ 

SEQ ID 8515 

TOAAAACraXnTAAAACGCAAATTTGAAAGGAAATA^ 
ATTACAAAAACCGCGCCTTCGAGTGCGTCGGGT^^ 
AGCCCTGCEGTAOT^^ 
GCC(XX3CT(XrrTGC(^ra 

CCCC(X^CCCTGAAA(£tt - 

SEQ ID 8516 

LKTALKRKFERKYGMTVRNT^^ 

AALLAVI SAAYYIRDGEI^AlffAMCVroiiLEKMPPEAGRAPHPESPMPPETAGGGI 



WU U2/U79243 



-637- 



PCT/BB02/02069 



SEQID8517 

TTC&TOCCmGCA^^ 

GCACCGCCTTGTTTTCXmGra 

GAAATGGGTCGGTCTCGATTACGACMCGCCGACA^ 

AACCCGTCGTCCGCTTCAAMCGCCTO 
TACGCCGACCTACMCTT^^ 
GGCGCGAACCTGCCC^ 
CCGMGCGATGCTCMCTATTTGGOGCGTTl^^ 
TATGGATTTC^AAAGCTC^ 
CCITO3TTGGMGAKTGTTGGCACTO 
ACTGGGACGACGAAGCCGCCGCCCGTATCCTGCGCTT^^ 
CAAAATGGGCAAACTCGOTATGOr 
SEQID8518 

LMPKHONSKINA^^ 

CSKEELEAMREKAEKECT^^ 

GAHLPBYGHLPMIUJEQGKKISKRSGOTVMTDFGAMGIIJ 

PALEDVLALVKDl&QDIiNALM 
SEQID8510 

GTGCGGGCAATGCCGTCT^TGTCGGAATGCGGOT 

GTGCGGAAGACAATTTGCPCAGCATCCMTCCG^ 

GCAGGCGGCTCAGGCAGACATCGCCXJGTTTGGAAG^ 

GC6GTTTACG606CA6G66AACGAAA6CGTC66GQ6 
SEQ ID 8520 

VRAMP SDVO^APRmfflJraLY 

AVYGAGERKRRGN , 

SEQ ID 85211 
ATGAATTCCCTCTIT(^ 
ATAAGATTCGTGCCTATTACGGCAGATGTTTG^ 
AAATTTTCCACAGTAT^W^ 
MGTTCTTCTATCTTTATACGC^ 
AGGTT 

SEQ ID 8522 I 

MNSLFVDNTOITRLKAGHIGRLVQ^ 
KFLYLYTPDVQIFYESFGWAVVGRHFH^ 

SEQ ID 8523 

ATGGGCAGATGCCGTCTGAGCTGTTCGGACGG^ 

SEQ ID 8524 

l^CRIiSCSDGIFin/^FPVRYSGLTKTGTALPRPGSKG^ 
SEQ ID 8525 

TTGAAMCTGOTAAATGC^ 

CAAGCAGAAAGGAAACACAA 
SEQ ID 8526 

tKTAEMPSETFQTAFLYYEPFCGFHIIiGRRPAADSHT^^ 
SEQ ID 8527 

ATGGCACGTATTACCACCt^GAClOT^ 

TCCGCAATAACAAACCGACCGWA 
SEQ ID 8528 

MARITTBDCTGHSHHFIHjTL^^ 
SEQ ID 8529 

ASX^GATATGCTT^^ 
SEQ ID 8530 

MIMiAAVKIIVHPLYPPVyPKWKVFPV 
SEQ ID 8531 

GTG(^AATCCCTATCTTAATTTCCGTTG^^ 

CAAATGGTACGGAACCGGTTCGCXXXjGTGCTTG^ 
SEQ ID 8532 

VQIRIIJSVl£AASNHMEKSVDKLY(X>L^ 
SEQ ID 8533 

AflftTGCAGCAATACCTOGCATAAAA^^ 

TCACCCTGAOaCACQCQGGa^ 

CGAACAGGTCGGCGAAGTC^^ 

GTTGCCGTGTCTC^^ 

CGGCGGAAATTTTGCTGCCGGACAGTAAAAACGC^ 

T^ACGGOCTGTCGCTCGA^^ 
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CGAACCCCTCCAATGCCATC^ 

CGGTTTGTCCACACTCAMGTCGAGTTCAACC^^ 
^CGCTTCATCACGCCGGAACTGAAAGCCTITC^ 
TGCCGCAGCTTCAAAAA6CCGCCAAAGCCGCCG 
CATCXiAAMCG(£CGCCAra 
TCCAOTATATGCGCCAAGTCGCGCTGATTG^ 
Aa ACCTCGCCTCCAACOX^ra 
TTTOSACGGCCICGC^ 
GCCGTCAATATGCACCTTTCCGCGCT^^ 
CTGTGC3GCGCATTGAAAGCCGCCCAAW 
TGTGGACTGCTTTGTGGATAAAGCAGAGGAAAAACATO^ 
CTGAAAGATTTGTGCAAATCCGTATCT 
SEQ ID 8534 
MHQQYI£IKAQHTlHnjVFYIQO 
VAVSPDKKyiGLAWftSLQSGEFKTKLTTADK^^ 
LMLSliETDSQYIGMDAATRRMiEITQTLS^^ 



ERPITPELWEDKVIjTAQI^AIiALEKQLFDGVLKNI^TALP^ 

SITMRQVALIVLLAHTCCFV?^ 

AVNHHI£ALEQGH0IWIflQIQPGPAGmGIAVAKIiAG 

LKDUXSVS 
SEQ ID 8535 

TTGCGCTTTTATGCCGAGGTATTGCTGCATCATTG^^ 

CTGACCGTAAGCCTGTCMTAMGTTGTGGAGAC^ 

CTCAAATTTATTTMCAAAMGGTMTGAAAATATTGGATATTO 

TTGAAA 
SEQ ID 8536 

LRPYAEVIiLHHWGNGGPGHGAAIiLWj 

SEQ ID 8537 

TTGTTTGACGCAGGTGAATTTC^ 
SEQ ID 8538 

LFDAGEFRYFKNAHS IFDFKI LHMI LMIADIiLVYK 

SEQ ID 8539 

GTGGMTGTAGTTTTATTrCAGMTACMTACAGG^ 

TGCGTCAAATTGAAAAAGCCCACGGCATCEA^ 

(X^TTAAAAGAC(^TGCCATCCrreGTA^ 

TTCTTTACCMCAAAACCm^ 

TGGACGC?TA(XX5AAACTGTCATTO 

CTAttCCC^ACCAAAACACCCT^ . 



CCtmTGAAA<XXXmtta<mAT^ 

CXH , GG(XK^AMCAACTGGTGAAAACa 
SEQ ID 8540 

VECSFISEYNTGIGLYFTTGIpMS 

FFTNKTKAEAHIRAGARKWISA^ 

VIPELJX3KLIX;SAQRVFVATGSLTE^ 
SEQ ID 8541 

TTGAAOUACGarrGTT^^ 
SEQ ID 8542 

LNKMiFVRHKPQyQVT^ 
SEQ ID 8543 

ATGACTTATACCATTACCCCCATCGGCAC^^ 

TTA(XX£AGACAGCGTGCGCGGGTTG<^ 

ACAAAAAATGGGCGTGTTCGCCAreCGCAGCCC^ 

GACCTGCTGGACGGCACGCCGATTGTGGACAT^ 

AGGAAAACATCGGCGCGGAAMTTTATCTGCAAACACC^^ 

TGCGGATTACGMGTCAGGTTTC^MTOGAGGAAAACCGC^ 
SEQ ID 8544 

MTYTOTIGTARSPYKQRFGIARQPGLV^ 

DIiUX^IVDIKPHPFIESKPDAASGFV^ 
SEQ ID 8545 

OTTGCTTTCAAAC 
SEQ ID 8546 

MNRFAVSISIiAKJLQQIRIRQHFQAERl^^ 
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SEQ ID 8547 

GKOGCTMCMI^GGTCMAATGGTl^ 
CGCGGTTTt 

SEQ ID 8548 
SEQ ID 8549 

CTIttTAGCGGGCGOGTraSGGCAGATAGGCTAMCGGCATACCGMTAGCM 
SEQ ID 8550 

SEQ ID 8551 

ATGATTGWICCATl^TGTTCATGttCAAAAAACGTTrA 

GATGGTCGGTACGCGCGGCTATGAATTGTTGGACGGTCTQCCTICCG 

SEQ ID 8552 

laVSIE^CKSTTPLCPDELPVNEGHUVBH^^ 
IEmsmiEWMI^OTKMOTGGUTDYCVKTTV^ 
SEQ ID 8553 

GAAACTGMTCAAAAACMCAATGCMTGTTTATCTGCCTOjGGGAAAC^ 
^TTACAGGCGGCGTATCTGTTGAAACACGOW^ 
CGAraAAAAttGGCGTTCAAGTTTCGGCTATreCAAAAAATGT^^ 
SEQ ID 8554 
MaLLVRLSSMGDLimPMBD^^ 

WMYIiLKHAVGIVGVI/rGIiIiHIiAHAIiEKIWGI ***w*»umi*Miu 
SEQ ID 8555 

<»CGATGCCAAlttAWG^^ 

GCGAAGAAACGATTGCGTTCAAACTGCCTGCAGTCATCAGCGCAGATTTGCGTTTO 
KMTCTGOGATTT^ 

SEQ ID 8556 

MRALVAVKRWDYHVKVRVKADGSDVDIGNVKMS^ 

KfuiAKVI 

SEQ ID 8557 

C(7KXS3TCGIuGAATTCGOGAAGCAAGTAfiCGGGAGTGGAA 

AGCTMrrMTGTTAAAACAACCCCTGCCCAAAACCTGAGTC^ 
®^*OTraiAAAAATTCA<^^ 

GGACAAACOGGTAAGGTGGTI<jCGCCGCAGCTCTATTWGCAATC(^AT^^ 

ATGCOXBAmimTOTAG^ 
SEQ ID 8558 

MSVUIAEHDNRQIOTATLHAVAAAAKI/XVDUiVVGHGASSVVEPAKOW 

FVRPnAGNAFEOTQADSEKLVLTPRVTAFDAVAAQGGHAEVINVOT 

GQTGKWAPQLYFAIGISGMQHVAOTgDS 
SEQ ID 8559 

TTGGTATTOTITCACATCAGCCAATGTCCGAATCGQAra 

CTGAAATGTTTTQUjACGAOCTCAITAAIAACAAAGTOGrc «~»»«»>i«iu.uii imamjiujtmTCATC 
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SEQ ID 8560 

IjVFFHISQCPNRIGKCNKTAAFIFKLSLRRGSN^ 
SEQ ID 8561 

ATGATOCGAOTATAAAACC^TOCTTCOX^ 
CTGTTAAAGGAAGAGCATTA 

SEQ ID 8562 

KITTYKTITSPTQAEFKnro^ 
SEQ ID 8563 

TTTCAGTTTGGTTTCTTTGTACACT 

GCACCGGATTOAfnTTG^^ 
SEQ ID 8564 

FQFGFFVHYVFACaJRIEFPDFQFAGHSAFIFGSGVEV^^ 
SEQ ID 8565 

TTGGAAAATATACAGCTA^ 

GTTTGCGCGTTTCCGTGTCGCATAC^ 

GGAATACGCAGACGTATTCGGCAACT^ 

MCGCGCTGCCGGMGCTGTCGGTATCTTTATCCTGOT 

CAAGGCATGAAATCXSAACAGTOGTATT^^ 

ACTGGGATTTATTGCAGMIITGTTGGAAAATTCC 
SEQ ID 8566 

IfflQVmSAYRKGNIFIISAASGTC 

NAIiPEAVGIFILPPSFDVIiAARLKGRGTDSREVIQR^^ 
SEQ ID 8567 

GT(^CGGGATGCAGTTTCGCGGCAACGATGA^ 

ATTTCGGT6GTGCTTTAITO 

ATCTGAGTATCCGTTTTTGMTGMGCMTATG^ 

GAG(OTGTGrrGCGGCTTGGMGM 
SEQ ID 8568 

VNGMQFRGNDDGEPPGSAGRPIlOJIDSVGITDVLVVm 

BACVAAWKNTRQVEVNTSCPFE 
SEQ ID 8569 

ATGGACAATTTCTATACOCACGAGTATCAGGT^^ 

CGCCGGGCGMTTCCATCAATTCGCTK3GCT 
SEQ ID 8570 

SEQ ID 8571 

GTOTCTTTTGOTGATO 

GACCGOTATCAGGATPTGATOAATPCTGCCGTA 

CACATGATTACGTCMTCAGAMGGTGCGCCTATC 
GTTt^GGGC^TAAAATGGGCAATGC 
AGTCMGACCACAAACGTCTTTTAGACGGAGATAMGGCCCGM 
TTTTGOCGACCGTAGOGGGIATGAAAGCGGAGGGGCA^ 
CGATCCCGAAACCCAGCCGAmTGM 
GGTGTGGTACTGGC6GCX3CAAAATTAOCC^ 
AAGGCGACGTGTTGACX^CGGCGGGCGCATATTGTGTC 
ATACCGTAAAGACATTCCCGATAAAGCGATTAACCGT 
SEQ ID 8572 

VVFCLILLKFLQM<$^ 

IFGPTQYMQLESSKDFAKAFMVK™ 

SQDHKWiHXffiKGPOTXXjMM^ 

GWLAAQNYPETPKKGDVISGLDBV^ 
SEQ ID 8573 

ATGACKMGC(OTra^ 

AATCTTGTTATGGCPIKjGGGTGTCTTOT 

GCAGOOTTACTTCAMTGCCGTCTGAAA^ 

AGGCGCAGCMGCTTGCGGTCAGAGTGCCCGATCAT^ 

CGGAGAGG(^GTCAGAAGGC^^ 
SEQ ID 8574 

MTOALQTAPYIMIiPSRIIAASAQRKIiSVyiiKKGGLTO 

RRSKIAWVPDHTGARRIiC^ALGMPLVS^ 
SEQ ID 8575 

(^AAAACTTTTTTGCATACACAMGAAA 

TTGTCGTTCTGGAC^^ 

TGAAfiACATCGTOT^ 

TGTATCGGTGTCAAAACMGCGATTATraTTAGAG^ 
ACAATOACGTTTGMTAGTGAGATTGTA 
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SEQID8576 

VKTFLHTQRKNFLDGI 
aGVKTSDYI^VLVTGRSREC^^ 

SEQ ID 8577 

ATGAMCCGTATGCGGGAAAAMCTCTGATTTCAAOT 
TTCCC 

SEQ ID 8578 

MKmGKI^PNLYTNI^SIillKMPSGII^HPREHPAIP 

SEQ ID 8579 

<m&GAAAAAAC<a^ 

GCMCMCAAAACATACCGCTTAAGTTCGCTAQG^ 
AACTGAAAAACMTATGAAACCGTATGCGGGAAAAAACTC 

SEQ ID 8580 

VRKIKanjMIFQDKKVVTW 
SEQ ID 8581 

TOGGAAACCGGATATTTTC6CG(n^ 
TTCTCCAGGTCGTCATmCCACGACAAAG 
GCGCGGCAAGCAOTKXy^GMGGCGGCAGGATAM 
CGCCAACGCATTCACGCCC^ 
TGTACGCCGTTTGOTCGCC^^ 
CCGAMTGATAMGATGTTGCCTTTTCGATAAGCGGACATAOT 
CTATTGTTT 
SEQ ID 8582 

LGNRIPSRFLGIPQQICNKSQLPRPFQTAGIHNMK^ 
RQRIHALRACAVIVAEyVCVPQKSIiIiR^ 

LLP ' 
SEQ ID 8583 

atgaagtcmtggamtitittcgggcocgggc 

actttmotcmgcttgtttatcagggagatc 

ccttgc(xatgaagacagaagggatatgtttcctc 

tattttgmtatatotaotcg(ot 

mtatmggtttcatatcaattgttgaatgcagctga™ 

mcacattccgmgacaggttgaattgggaaaagtatgtmcggggg 

atattcgmotgaaaaaaaaccttggcaaato 

CCGGACA(^CA(&AA(Xra(aTA(&CGM^ 
mTGMGCAAAACTTOTCAAACATT^^ 
CATCTGATGGGGGTGTTGTCGGAGAAAAG(^GCCTGCACAAC 
SEQ ID 8584 

MKSLEIFSGAGGLAKGLELAGFQHASFIEI^^ 
YFEHLLRLTYPNIiGILQNEDWK^ 
IFEPEKKPWQTVRDTLSDIPHPLGNHKITGHEYRIX^ 
ffltfGVLSEKSSLffll 
SEQ ID 8585 
ttoegtttgagattg^^ 

TAATCATTGAAATCCGCTCATT^ 

CGTATGGAGGAATATGCTGCTTTTGMGAGTGT 
SEQ ID 8586 

LPFEIVPIFSHRADLSGVLQAVinGSKTKDRKIGRIRraiJ 
SEQ ID 8587 

ATGGAGGMTATGCTGCTTTTGAAGAffTGT^^ 
SEQ ID 8588 

MEEYAAFEECIMDELPITVNRPDIjFPSAYAV 

SEQ ID 8589 

TTGATTTGTTTCCGTCAGCTTATGCOT^ 
SEQ ID 8590 

LICFRQUiPSEALSDGIYFEVLWFASDimV 
SEQ ID 8591 

atggttcaaccmgaaaacccgmgtamcgttatg^ 

aoctgctgtok^^ 
SEQ ID 8592 

HVQPRKPEVNVMKKCBDIAIiLLSKH 

SEQ ID 8593 

ATGCCGCTACCCGACCTGACC^^ 

TGOTTCCGCATACAGCGCAGGCGAC^GTTTTCM 
AGTCTTTACCG(^CTGGATGA(XUGCTGCTGGAT^ 
AATAACAACGAATTCCAAAAAATTCTGCAAAGCTGOT 
GOOTATCAGCACtnCCAACTACCACACCA 
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SEQ ID 8594 

MPLPDLTDAELMESRKLLOT 
NNNEFQHLQSCLYNU^ 
SEQ ID 8595 
TTGCCAAAATTGCCGATTAfTCAAACGC^^ 

GAAAGMGGTTTCGTCCGCGACGCGCCT^ 
TTTGAATATACCmTTGCTGGCAGMGTTGaC^ 
ATOTACraTGATTTGCAGGAAGCGGAAA^ 
CTCOSCGCTTTC^^ 
CCCTCCTTGTCCGCCATA1TC 
SEQ ID 8596 
LPKLPIIKRVPERNTAVQPETSAQ^^ 
FEYTQLLAmQIPDIPDIHySHDSKYTPS 



SEQ ID 8597 

GTGCCATOCGAACCGGGTAAAAATATGATTCAACA 

CGGAAAACTGGCTGAAAATGGGCGGCAGGGCGQXIAAA^^ 

C6ATTT6ATAGACGGCATCAAA6AAATGATGTG0CGTTATGA^ 

GAAGAMTGGTGCATCATACGGCGCGGCGTATCCGCGAAGTGC^ 

TCGAGTTCCTCMCmSGTTGCCOGT^ 

ggacg^gggcgcgtgto:tacatccacatc«^ 
TACA<m«2rcc^ 

AATACCGCCGTGCAGCC 
SEQ ID 8598 

VPSEPGKNMIQHAGIXa^ 

EEMVHHTARRIREVQI^^ 

YTKlJ>TIPPTLLERDFNFPPFAEm 
SEQ ID 8599 

AGCTKXKXXX^GCACGCAAAra 
CGACGTGATTGCAAGTTGGGCAAAAAACGGCGTTTGGTT^^ 

AAACTTCTAAAATAGTAAAACGCGATTTATACCAC^^ ~ ■ 

SEQ ID 8600 ! 



SEQ I D 860 1 

TTGATGTTTTTAACGCCATATTGTAGCCGTCAGCOT 
SEQ ID 8602 

IilFLTPYCSRQPAACQINQNNSGKIWASANMK 

SEQ ID 8603 

ATGAACAAAAATATTGCTGCCGCACTCGCCGGTGOT 

CTTGCGGCGCATCCAAATCTGCCGAAGGTTC^^ 

AACTGTAAAAAMGCCC^CAAACACACCAMGCATCTAAA 

SEQ ID 8604 
MNKNIAAALAGALSLSI^ 

SEQ ID 8605 

ATOAAAATCGGMCAACTTGGCAGACGGCATCCGCT^ 

TCAA^TeAGTTTCCATTCC^^ 

GGCATTGGGGCTGATGGTCGTTACCGCCGT(^ 

CTGCTTTTCCAGGGTGCGGGCGGATGGTCGC^ 
SEQ ID 8606 

MKIGTTWQTASMiVIiRLFAAYBFLES^ 

LliFQGAGGWSLOTLIjKKIjPCPKCRLKQD 
SEQ ID 8607 

ATGGAAMTATGGTAACGTTTTCAAAAATCA^ 
MCCCGCCXX^GTGGTCGCTTO^^ 

ATCAAAGGC&AAAAGCTGCAAAAMCATCAACCG^ 
TCAATCAAGGGMTGCCGGGCAG(K^^ 
CAGTGCCGCTCGGCX3GCAGGGGGCGAATTTC(^^ 
GTTTCCX3T(mxmAACG^ 
CAGCACGTGCOGCATTGCCTGAMTCAA 
SEQ ID 8608 
IMVTPSKIRSWAIAAAAU^^ 

IKGEKI^NYINRKVGGRQIQQFPIKFAAVATDPETGKAVAFNQGNA 
VSVLQKELGQADVVIKPQVIfl^ 

SEQ ID 8609 

GTGTOTTGAGCTGGTTGATTACTT^^ 
TTCGGACGGCA 



SEQID8610 ium 



OTG<^mCACCGAT^^ • 



TTGTCG 

SEQ ID 8613 



(XETCCGAACATATTTCX^^ 
S |^CFAKVSRPS^^ 

AQTGGGCGQGT 

SEQ ID 8616 

LI^IiffPKDI(mKAlASTWiKVETANR 

SEQ ID 8617 ._ M .^n.^mMii^ 

MGAATATi 



CCCCTTATCTAACAGGGGGGGTA 

SEQ ID 8620 

IJVFEDSC^GDKQGMTKAVAG 

™=^^ 
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